• Reuters RCV1 RCV2 Multilingual, Multiview Text Categorization Test collection

    付費方式 免費 更新頻率 不定期
    This test collection contains feature characteristics of documents originally written in five different languages and their translations, over a common set of 6 categories.
  • Cryotherapy Dataset

    付費方式 免費 更新頻率 不定期
    This dataset contains information about wart treatment results of 90 patients using cryotherapy.
  • UNIX User Data

    付費方式 免費 更新頻率 不定期
    This file contains 9 sets of sanitized user data drawn from the command histories of 8 UNIX computer users at Purdue over the course of up to 2 years.
  • Beijing PM2.5 Data

    付費方式 免費 更新頻率 不定期
    This hourly data set contains the PM2.5 data of US Embassy in Beijing. Meanwhile, meteorological data from Beijing Capital International Airport are also included.
  • LED Display Domain

    付費方式 免費 更新頻率 不定期
    From Classification and Regression Trees book; We provide here 2 C programs for generating sample databases
  • Statlog (Image Segmentation)

    付費方式 免費 更新頻率 不定期
    This dataset is an image segmentation database similar to a database already present in the repository (Image segmentation database) but in a slightly different form.
  • Buzz in social media

    付費方式 免費 更新頻率 不定期
    This data-set contains examples of buzz events from two different social networks
  • Chess (Domain Theories)

    付費方式 免費 更新頻率 不定期
    6 different domain theories for generating legal moves of chess
  • Online News Popularity

    付費方式 免費 更新頻率 不定期
    This dataset summarizes a heterogeneous set of features about articles published by Mashable in a period of two years. The goal is to predict the number of shares in social...
  • Wilt

    付費方式 免費 更新頻率 不定期
    High-resolution Remote Sensing data set (Quickbird). Small number of training samples of diseased trees, large number for other land cover. Testing data set from stratified...
  • One-hundred plant species leaves data set

    付費方式 免費 更新頻率 不定期
    Sixteen samples of leaf each of one-hundred plant species. For each sample, a shape descriptor, fine scale margin and texture histogram are given.
  • Bag of Words

    付費方式 免費 更新頻率 不定期
    This data set contains five text collections in the form of bags-of-words.
  • SMS Spam Collection

    付費方式 免費 更新頻率 不定期
    The SMS Spam Collection is a public set of SMS labeled messages that have been collected for mobile phone spam research.
  • Automobile

    付費方式 免費 更新頻率 不定期
    From 1985 Ward's Automotive Yearbook
  • Audiology (Original)

    付費方式 免費 更新頻率 不定期
    Nominal audiology dataset from Baylor
  • Breast Cancer Wisconsin (Prognostic)

    付費方式 免費 更新頻率 不定期
    Prognostic Wisconsin Breast Cancer Database
  • seismic-bumps

    付費方式 免費 更新頻率 不定期
    The data describe the problem of high energy (higher than 10^4 J) seismic bumps forecasting in a coal mine. Data come from two of longwalls located in a Polish coal mine.
  • Wholesale customers

    付費方式 免費 更新頻率 不定期
    The data set refers to clients of a wholesale distributor. It includes the annual spending in monetary units (m.u.) on diverse product categories
  • Volcanoes on Venus - JARtool experiment

    付費方式 免費 更新頻率 不定期
    The JARtool project was a pioneering effort to develop an automatic system for cataloging small volcanoes in the large set of Venus images returned by the Magellan spacecraft.
  • Badges

    付費方式 免費 更新頻率 不定期
    Badges labeled with a "+" or "-" as a function of a person's name