Machine Learning/Datasets: Difference between revisions

From Noisebridge

Jump to navigation Jump to search

Revision as of 23:36, 14 March 2011

This page describes in detail the datasets used for the NBML Course.

Classification

MNIST Handwritten Digits
- Classify handwritten digits using this dataset, a very popular one with lots of training examples.
Heart Disease
- Predict whether a person will have heart disease based on a subset of 76 factors.
Census Income
- Try to predict whether a person has an income greater than or less than 50k

Regression

Boiling point in the Alps
- The boiling point of water at different barometric pressures.
Shocking Rats
- How does shocking a rat affect it's ability to complete a maze?
Ice Cream Sales
- Predict the quantity of ice cream consumed based on some other variables.
Smoking and Respiratory Function
- How does smoking affect lung capacity?

Time Series

Gun-related Deaths in Australia
- "Deaths from gun-related homicides and suicides and non-gun-related homicides and suicides. Australia: 1915-2004. Source: Neill and Leigh (2007)."
Immigration Rates
- "Annual immigration into the United States: thousands. 1820 – 1962. From Kendall & Ord (1990), p.13."
Percent of Men with Beards 1866-1911
- "Percent of Men with full beards, 1866 – 1911. Source: Hipel and Mcleod (1994)."

Clustering

Retrieved from "https://www.noisebridge.net/index.php?title=Machine_Learning/Datasets&oldid=17102"