Need authorization agreement

20-newsgroups

The 20 Newsgroups data set

The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across 20 different newsgroups. To the best of my knowledge, it was originally collected by Ken Lang, probably for his Newsweeder: Learning to filter netnews paper, though he does not explicitly mention this collection. The 20 newsgroups collection has become a popular data set for experiments in text applications of machine learning techniques, such as text classification and text clustering.

## Reference: * http://qwone.com/~jason/20Newsgroups/

Data and Resources

Additional Info

Field Value
Source http://qwone.com/~jason/20Newsgroups/
Last Updated October 11, 2020, 12:13 (CST)
Created March 7, 2018, 16:43 (CST)

推薦資料集:


  • 台灣自來水公司淨水場擴建計畫

    Payment instrument Free
    Update frequency Irregular
    提供本公司辦理淨水場擴建計畫之計畫目標、工作內容及經費需求等內容
  • 15歲以上人口每日吸菸率

    Payment instrument Free
    Update frequency Irregular
    【資料來源】衛生福利部國民健康署「國人吸菸行為調查」,該調查係利用電訪收集民眾吸菸相關資料,簡介請參見國民健康署「菸害防制資訊網」(http://tobacco.hpa.gov.tw/)。...
  • 臺中市替代役役男家屬104年安家費暨三節生活扶助金統計

    Payment instrument Free
    Update frequency Irregular
    臺中市替代役役男家屬104年安家費暨三節生活扶助金統計
  • 109年4月臺南市登革熱病媒蚊密度調查

    Payment instrument Free
    Update frequency Irregular
    109年4月-臺南市登革熱病媒蚊密度調查
  • 高雄市108年小港分局路口監視器地點

    Payment instrument Free
    Update frequency Irregular
    分局、派出所、警編、位置