-
Dorothea
更新頻率 不定期DOROTHEA is a drug discovery dataset. Chemical compounds represented by structural molecular features must be classified as active (binding to thrombin) or inactive. This is one... -
Arcene
更新頻率 不定期ARCENE's task is to distinguish cancer versus normal patterns from mass-spectrometric data. This is a two-class classification problem with continuous input variables. This... -
Madelon
更新頻率 不定期MADELON is an artificial dataset, which was part of the NIPS 2003 feature selection challenge. This is a two-class classification problem with continuous input variables. The... -
Dexter
更新頻率 不定期DEXTER is a text classification problem in a bag-of-word representation. This is a two-class classification problem with sparse continuous input variables. This dataset is one... -
PAMAP2 Physical Activity Monitoring
更新頻率 不定期The PAMAP2 Physical Activity Monitoring dataset contains data of 18 different physical activities, performed by 9 subjects wearing 3 inertial measurement units and a heart rate... -
Victorian Era Authorship Attribution
更新頻率 不定期To create the largest authorship attribution dataset, we extracted works of 50 well-known authors. To have a non-exhaustive learning, in training there are 45 authors whereas,... -
OpinRank Review Dataset
更新頻率 不定期This data set contains user reviews of cars and and hotels collected from Tripadvisor (~259,000 reviews) and Edmunds (~42,230 reviews). -
AutoUniv
更新頻率 不定期AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity of real data. Data can be generated in .csv, ARFF or C4.5... -
Gisette
更新頻率 不定期GISETTE is a handwritten digit recognition problem. The problem is to separate the highly confusible digits '4' and '9'. This dataset is one of five datasets of the NIPS 2003... -
104年度行政院農業委員會林業試驗所單位決算
更新頻率 不定期以前年度歲入來源別轉入數決算表、以前年度歲出政事別轉入數決算表、以前年度歲出機關別轉入數決算表、歲入來源別決算表、歲入類平衡表、歲出政事別決算表、歲出機關別決算表、經費類平衡表
您也可以使用API (應用程式介面) (see API 文件)註冊。