找到31個資料集

組織: UCI Machine Learning Repository 格式: HTML

篩選結果
  • Census-Income (KDD)

    付費方式 免費 更新頻率 不定期
    This data set contains weighted census data extracted from the 1994 and 1995 current population surveys conducted by the U.S. Census Bureau.
  • Twenty Newsgroups

    付費方式 免費 更新頻率 不定期
    This data set consists of 20000 messages taken from 20 newsgroups.
  • Synthetic Control Chart Time Series

    付費方式 免費 更新頻率 不定期
    This data consists of synthetically generated control charts.
  • MSNBC.com Anonymous Web Data

    付費方式 免費 更新頻率 不定期
    This data describes the page visits of users who visited msnbc.com on September 28, 1999. Visits are recorded at the level of URL category (see description) and are recorded in...
  • Coil 1999 Competition Data

    付費方式 免費 更新頻率 不定期
    This data set is from the 1999 Computational Intelligence and Learning (COIL) competition. The data contains measurements of river chemical concentrations and algae densities.
  • Pseudo Periodic Synthetic Time Series

    付費方式 免費 更新頻率 不定期
    This data set is designed for testing indexing schemes in time series databases. The data appears highly periodic, but never exactly repeats itself.
  • Internet Usage Data

    付費方式 免費 更新頻率 不定期
    This data contains general demographic information on internet users in 1997.
  • KDD Cup 1999 Data

    付費方式 免費 更新頻率 不定期
    This is the data set used for The Third International Knowledge Discovery and Data Mining Tools Competition, which was held in conjunction with KDD-99
  • Corel Image Features

    付費方式 免費 更新頻率 不定期
    This dataset contains image features extracted from a Corel image collection. Four sets of features are available based on the color histogram, color histogram layout, color...
  • E. Coli Genes

    付費方式 免費 更新頻率 不定期
    Data giving characteristics of each ORF (potential gene) in the E. coli genome. Sequence, homology (similarity to other genes) and structural information, and function (if...
  • EEG Database

    付費方式 免費 更新頻率 不定期
    This data arises from a large study to examine EEG correlates of genetic predisposition to alcoholism. It contains measurements from 64 electrodes placed on the scalp sampled at...
  • Movie

    付費方式 免費 更新頻率 不定期
    This data set contains a list of over 10000 films including many older, odd, and cult films. There is information on actors, casts, directors, producers, studios, etc.
  • KDD Cup 1998 Data

    付費方式 免費 更新頻率 不定期
    This is the data set used for The Second International Knowledge Discovery and Data Mining Tools Competition, which was held in conjunction with KDD-98
  • Reuters-21578 Text Categorization Collection

    付費方式 免費 更新頻率 不定期
    This is a collection of documents that appeared on Reuters newswire in 1987. The documents were assembled and indexed with categories.
  • Syskill and Webert Web Page Ratings

    付費方式 免費 更新頻率 不定期
    This database contains HTML source of web pages plus the ratings of a single user on these web pages. Web pages are on four seperate subjects (Bands- recording artists; Goats;...
  • Volcanoes on Venus - JARtool experiment

    付費方式 免費 更新頻率 不定期
    The JARtool project was a pioneering effort to develop an automatic system for cataloging small volcanoes in the large set of Venus images returned by the Magellan spacecraft.
  • UNIX User Data

    付費方式 免費 更新頻率 不定期
    This file contains 9 sets of sanitized user data drawn from the command histories of 8 UNIX computer users at Purdue over the course of up to 2 years.
  • Australian Sign Language signs (High Quality)

    付費方式 免費 更新頻率 不定期
    This data consists of sample of Auslan (Australian Sign Language) signs. 27 examples of each of 95 Auslan signs were captured from a native signer using high-quality position...
  • El Nino

    付費方式 免費 更新頻率 不定期
    The data set contains oceanographic and surface meteorological readings taken from a series of buoys positioned throughout the equatorial Pacific.
  • Insurance Company Benchmark (COIL 2000)

    付費方式 免費 更新頻率 不定期
    This data set used in the CoIL 2000 Challenge contains information on customers of an insurance company. The data consists of 86 variables and includes product usage data and...