Need authorization agreement

20-newsgroups

The 20 Newsgroups data set

The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across 20 different newsgroups. To the best of my knowledge, it was originally collected by Ken Lang, probably for his Newsweeder: Learning to filter netnews paper, though he does not explicitly mention this collection. The 20 newsgroups collection has become a popular data set for experiments in text applications of machine learning techniques, such as text classification and text clustering.

## Reference: * http://qwone.com/~jason/20Newsgroups/

Data and Resources

Additional Info

Field Value
Source http://qwone.com/~jason/20Newsgroups/
Last Updated October 11, 2020, 4:13 AM (UTC+00:00)
Created March 7, 2018, 8:43 AM (UTC+00:00)

推薦資料集:


  • 106年10月測量案件統計表

    Payment instrument Free
    Update frequency Irregular
    臺東縣各鄉鎮市土地及建物複丈案件統計
  • 102年度1664條土石流潛勢溪流影響範圍圖(TWD67)

    Payment instrument Free
    Update frequency Irregular
    提供102年度1664條土石流潛勢溪流影響範圍圖(TWD67)shp下載檔案。
  • 節能減碳檢核表

    Payment instrument Free
    Update frequency Irregular
    提供民眾節能減碳檢核表下載
  • 臺北市921、331地震列管建築物

    Payment instrument Free
    Update frequency Irregular
    臺北市921、331地震列管黃單需注意之建築物。提供市民參考本市因地震列管建築物之地址資料,以維護居住安全。
  • 國家教育研究院-化學名詞-常見生物鹼名詞

    Payment instrument Free
    Update frequency Irregular
    化學名詞-常見生物鹼英中對照名詞等資訊。