• PubChem Bioassay Data

    更新頻率 不定期 瀏覽次數 710 下載次數 10
    These highly imbalanced bioassay datasets are from the differing types of screening that can be performed using HTS technology. 21 datasets were created from 12 bioassays.
  • microblogPCU

    更新頻率 不定期 瀏覽次數 516 下載次數 5
    MicroblogPCU data is crawled from sina weibo microblog[[Web Link]]. This data can be used to study machine learning methods as well as do some social network research.
  • UJIIndoorLoc-Mag

    更新頻率 不定期 瀏覽次數 512 下載次數 0
    The UJIIndoorLoc-Mag is an indoor localization database to test Indoor Positioning System that rely on Earth's magnetic field variations.
  • Autistic Spectrum Disorder Screening Data for Adolescent

    更新頻率 不定期 瀏覽次數 525 下載次數 8
    Autistic Spectrum Disorder Screening Data for Adolescent. This dataset is related to classification and predictive tasks.
  • Gas Sensor Array Drift Dataset at Different Concentrations

    更新頻率 不定期 瀏覽次數 529 下載次數 1
    This archive contains 13910 measurements from 16 chemical sensors exposed to 6 different gases at various concentration levels.
  • YouTube Spam Collection

    更新頻率 不定期 瀏覽次數 472 下載次數 4
    It is a public set of comments collected for spam research. It has five datasets composed by 1,956 real messages extracted from five videos that were among the 10 most viewed on...
  • Combined Cycle Power Plant

    更新頻率 不定期 瀏覽次數 453 下載次數 1
    The dataset contains 9568 data points collected from a Combined Cycle Power Plant over 6 years (2006-2011), when the plant was set to work with full load.
  • Taxi Service Trajectory - Prediction Challenge, ECML PKDD 2015

    更新頻率 不定期 瀏覽次數 655 下載次數 39
    An accurate dataset describing trajectories performed by all the 442 taxis running in the city of Porto, in Portugal.
  • Character Font Images

    更新頻率 不定期 瀏覽次數 537 下載次數 24
    Character images from scanned and computer generated fonts.
  • Devanagari Handwritten Character Dataset

    更新頻率 不定期 瀏覽次數 485 下載次數 1
    This is an image database of Handwritten Devanagari characters. There are 46 classes of characters with 2000 examples each. The dataset is split into training set(85%) and...
  • Educational Process Mining (EPM): A Learning Analytics Data Set

    更新頻率 不定期 瀏覽次數 456 下載次數 1
    Educational Process Mining data set is built from the recordings of 115 subjects' activities through a logging application while learning with an educational simulator.
  • Dota2 Games Results

    更新頻率 不定期 瀏覽次數 471 下載次數 3
    Dota 2 is a popular computer game with two teams of 5 players. At the start of the game each player chooses a unique hero with different strengths and weaknesses.
  • Northix

    更新頻率 不定期 瀏覽次數 486 下載次數 1
    Northix is designed to be a schema matching benchmark problem for data integration of two entity relationship databases.
  • Sentence Classification

    更新頻率 不定期 瀏覽次數 443 下載次數 5
    Contains sentences from the abstract and introduction of 30 articles annotated with a modified Argumentative Zones annotation scheme. These articles come from biology, machine...
  • Gastrointestinal Lesions in Regular Colonoscopy

    更新頻率 不定期 瀏覽次數 505 下載次數 4
    This dataset contains features extracted from colonoscopy videos used to detect gastrointestinal lesions. It contains 76 lesions
  • DSRC Vehicle Communications

    更新頻率 不定期 瀏覽次數 482 下載次數 6
    This set Provides data regarding wireless communications between vehicles and road side units. two separate data sets are provided (normal scenario) and in the presence of...
  • Leaf

    更新頻率 不定期 瀏覽次數 458 下載次數 4
    This dataset consists in a collection of shape and texture features extracted from digital images of leaf specimens originating from a total of 40 different plant species.
  • NoisyOffice

    更新頻率 不定期 瀏覽次數 512 下載次數 2
    Corpus intended to do cleaning (or binarization) and enhancement of noisy grayscale printed text images using supervised learning methods. Noisy images and their corresponding...
  • USPTO Algorithm Challenge, run by NASA-Harvard Tournament Lab and TopCoder ...

    更新頻率 不定期 瀏覽次數 527 下載次數 11
    Data used for USPTO Algorithm Competition. Contains drawing pages from US patents with manually labeled figure and part labels.
  • REALDISP Activity Recognition Dataset

    更新頻率 不定期 瀏覽次數 551 下載次數 11
    The REALDISP dataset is devised to evaluate techniques dealing with the effects of sensor displacement in wearable activity recognition as well as to benchmark general activity...