Need authorization agreement

20-newsgroups

The 20 Newsgroups data set

The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across 20 different newsgroups. To the best of my knowledge, it was originally collected by Ken Lang, probably for his Newsweeder: Learning to filter netnews paper, though he does not explicitly mention this collection. The 20 newsgroups collection has become a popular data set for experiments in text applications of machine learning techniques, such as text classification and text clustering.

## Reference: * http://qwone.com/~jason/20Newsgroups/

データとリソース

追加情報

フィールド
ソース http://qwone.com/~jason/20Newsgroups/
最終更新 10月 11, 2020, 12:13 (CST)
作成日 3月 7, 2018, 16:43 (CST)

推薦資料集:


  • 臺閩地區營利事業自動報繳單位銷售額依公民營級距統計表

    Payment instrument Free
    Update frequency Irregular
    臺閩地區營利事業自動報繳單位銷售額依公民營級距統計表
  • 綜合所得稅所得淨額為零之各類扣除額申報統計表

    Payment instrument Free
    Update frequency Irregular
    綜合所得稅所得淨額為零之各類扣除額申報統計表 單位:金額(千元)
  • 通關即時服務窗口

    Payment instrument Free
    Update frequency Irregular
    提供通關即時服務窗口資訊
  • 跨醫院門診同藥理用藥日數重疊率-降血脂 (口服)(醫院總額指標)

    Payment instrument Free
    Update frequency Irregular
    資料來源:保險醫事服務機構醫療服務點數申報資料 分子:同一位病人在各院所的不同處方,開立同一種藥理分類之「口服降血脂藥物」,重複給藥日份加總 分母:開立「口服降血脂藥物」案件的給藥日份加總。 計算公式:(分子/分母)x 100%
  • IoW堤防結構安全基本資料

    Payment instrument Free
    Update frequency Irregular
    IoW(水資源物聯網)收集之堤防結構安全感測設備基本資料