Need authorization agreement

20-newsgroups

The 20 Newsgroups data set

The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across 20 different newsgroups. To the best of my knowledge, it was originally collected by Ken Lang, probably for his Newsweeder: Learning to filter netnews paper, though he does not explicitly mention this collection. The 20 newsgroups collection has become a popular data set for experiments in text applications of machine learning techniques, such as text classification and text clustering.

## Reference: * http://qwone.com/~jason/20Newsgroups/

データとリソース

追加情報

フィールド
ソース http://qwone.com/~jason/20Newsgroups/
最終更新 10月 11, 2020, 12:13 (CST)
作成日 3月 7, 2018, 16:43 (CST)

推薦資料集:


  • 各年度留學獎學金甄試錄取人數

    Payment instrument Free
    Update frequency Irregular
    近年留學獎學金甄試各類別學生錄取人數
  • 109年度臺東縣護理之家

    Payment instrument Free
    Update frequency Irregular
    109年度臺東縣智慧福利服務躍升計畫OpenData資料收集
  • 106年3月花蓮縣各項稅捐本月實徵數簡報表

    Payment instrument Free
    Update frequency Irregular
    花蓮縣各項稅捐本月實徵數簡報表
  • 全國場址目前改善與整治進度

    Payment instrument Free
    Update frequency Irregular
    全國土壤及地下水污染場址之改善進度更新情形統計報表
  • 嘉義市AED(自動體外心臟去顫器)設置地點

    Payment instrument Free
    Update frequency Irregular
    嘉義市AED(自動體外心臟去顫器)設置地點