Need authorization agreement

20-newsgroups

The 20 Newsgroups data set

The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across 20 different newsgroups. To the best of my knowledge, it was originally collected by Ken Lang, probably for his Newsweeder: Learning to filter netnews paper, though he does not explicitly mention this collection. The 20 newsgroups collection has become a popular data set for experiments in text applications of machine learning techniques, such as text classification and text clustering.

## Reference: * http://qwone.com/~jason/20Newsgroups/

Data and Resources

Additional Info

Field Value
Source http://qwone.com/~jason/20Newsgroups/
Last Updated October 11, 2020, 12:13 (CST)
Created March 7, 2018, 16:43 (CST)

推薦資料集:


  • 傳播內容民眾依不妥類型分類之申訴意見

    Payment instrument Free
    Update frequency Irregular
    傳播內容民眾依不妥類型分類之申訴意見
  • 不動產實價登錄資訊-預售屋案件-林口區

    Payment instrument Free
    Update frequency Irregular
    不動產預售案件實價登錄資訊,包含標的位置(去識別化)、面積、總價等資訊。 2. 本資料集為每10日更新一次。-林口區
  • 新北市里長資訊-深坑區

    Payment instrument Free
    Update frequency Irregular
    新北市全市1032里里長聯絡資訊-深坑區
  • 機場巡迴巴士時刻表

    Payment instrument Free
    Update frequency Irregular
    本資料集主要提供機場航廈間巡迴巴士時刻表
  • insight_test_20612

    Payment instrument Free
    Update frequency Irregular