• gene expression cancer RNA-Seq

    更新頻率 不定期 瀏覽次數 847 下載次數 83
    This collection of data is part of the RNA-Seq (HiSeq) PANCAN data set, it is a random extraction of gene expressions of patients having different types of tumor
  • Multiple Features

    更新頻率 不定期 瀏覽次數 747 下載次數 101
    This dataset consists of features of handwritten numerals (0'--9') extracted from a collection of Dutch utility maps
  • Predict keywords activities in a online social media

    更新頻率 不定期 瀏覽次數 501 下載次數 8
    The data from Twitter was collected during 360 consecutive days. It was done by querying 1497 English keywords sampled from Wikipedia. This dataset is proposed in a Learning to...
  • Mobile Robots

    更新頻率 不定期 瀏覽次數 487 下載次數 10
    Learning concepts from sensor data of a mobile robot; set of data sets
  • Australian Sign Language signs

    更新頻率 不定期 瀏覽次數 498 下載次數 3
    This data consists of sample of Auslan (Australian Sign Language) signs. Examples of 95 signs were collected from five signers with a total of 6650 sign samples.
  • Entree Chicago Recommendation Data

    更新頻率 不定期 瀏覽次數 772 下載次數 9
    This data contains a record of user interactions with the Entree Chicago restaurant recommendation system.
  • CMU Face Images

    更新頻率 不定期 瀏覽次數 1309 下載次數 84
    This data consists of 640 black and white face images of people taken with varying pose (straight, left, right, up), expression (neutral, happy, sad, angry), eyes (wearing...
  • First-order theorem proving

    更新頻率 不定期 瀏覽次數 449 下載次數 1
    Given a theorem, predict which of five heuristics will give the fastest proof when used by a first-order prover. A sixth prediction declines to attempt a proof, should the...
  • URL Reputation

    更新頻率 不定期 瀏覽次數 619 下載次數 42
    Anonymized 120-day subset of the ICML-09 URL data containing 2.4 million examples and 3.2 million features.
  • Insurance Company Benchmark (COIL 2000)

    更新頻率 不定期 瀏覽次數 653 下載次數 25
    This data set used in the CoIL 2000 Challenge contains information on customers of an insurance company. The data consists of 86 variables and includes product usage data and...
  • Australian Sign Language signs (High Quality)

    更新頻率 不定期 瀏覽次數 642 下載次數 9
    This data consists of sample of Auslan (Australian Sign Language) signs. 27 examples of each of 95 Auslan signs were captured from a native signer using high-quality position...
  • Reuters RCV1 RCV2 Multilingual, Multiview Text Categorization Test collection

    更新頻率 不定期 瀏覽次數 714 下載次數 41
    This test collection contains feature characteristics of documents originally written in five different languages and their translations, over a common set of 6 categories.
  • UNIX User Data

    更新頻率 不定期 瀏覽次數 632 下載次數 9
    This file contains 9 sets of sanitized user data drawn from the command histories of 8 UNIX computer users at Purdue over the course of up to 2 years.
  • Buzz in social media

    更新頻率 不定期 瀏覽次數 495 下載次數 3
    This data-set contains examples of buzz events from two different social networks
  • Volcanoes on Venus - JARtool experiment

    更新頻率 不定期 瀏覽次數 557 下載次數 28
    The JARtool project was a pioneering effort to develop an automatic system for cataloging small volcanoes in the large set of Venus images returned by the Magellan spacecraft.
  • KASANDR

    更新頻率 不定期 瀏覽次數 508 下載次數 6
    KASANDR is a novel, publicly available collection for recommendation systems that records the behavior of customers of the European leader in e-Commerce advertising, Kelkoo.
  • Census-Income (KDD)

    更新頻率 不定期 瀏覽次數 1022 下載次數 65
    This data set contains weighted census data extracted from the 1994 and 1995 current population surveys conducted by the U.S. Census Bureau.
  • Twenty Newsgroups

    更新頻率 不定期 瀏覽次數 853 下載次數 44
    This data set consists of 20000 messages taken from 20 newsgroups.
  • Amazon Access Samples

    更新頻率 不定期 瀏覽次數 437 下載次數 5
    Amazon's InfoSec is getting smarter about the way Access data is leveraged. This is an anonymized sample of access provisioned within the company.
  • EEG Database

    更新頻率 不定期 瀏覽次數 1849 下載次數 179
    This data arises from a large study to examine EEG correlates of genetic predisposition to alcoholism. It contains measurements from 64 electrodes placed on the scalp sampled at...