Need authorization agreement

20-newsgroups

The 20 Newsgroups data set

The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across 20 different newsgroups. To the best of my knowledge, it was originally collected by Ken Lang, probably for his Newsweeder: Learning to filter netnews paper, though he does not explicitly mention this collection. The 20 newsgroups collection has become a popular data set for experiments in text applications of machine learning techniques, such as text classification and text clustering.

## Reference: * http://qwone.com/~jason/20Newsgroups/

データとリソース

追加情報

フィールド
ソース http://qwone.com/~jason/20Newsgroups/
最終更新 10月 11, 2020, 12:13 (CST)
作成日 3月 7, 2018, 16:43 (CST)

推薦資料集:


  • 戰役人名索引

    Payment instrument Free
    Update frequency Irregular
    國家檔案有關國共戰爭之戰役心得報告之人名索引筆數。
  • 109年臺東縣非都市土地使用分區與編定筆數及面積-卑南鄉

    Payment instrument Free
    Update frequency Irregular
    109年臺東縣非都市土地使用分區與編定筆數及面積(卑南鄉)
  • 公司決算書表申報暨查核辦法

    Payment instrument Free
    Update frequency Irregular
    公司決算書表申報暨查核辦法
  • 東沙環礁國家公園物種調查結果

    Payment instrument Free
    Update frequency Irregular
    東沙環礁國家公園物種調查結果摘要,資料來源(2005-2015)
  • 空氣品質小時值_臺中市_西屯站

    Payment instrument Free
    Update frequency Irregular
    臺中市-西屯站小時值