Editing KDD Competition 2010

Jump to navigation Jump to search
Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.

Latest revision Your text
Line 20: Line 20:
* [[Machine_Learning/SqliteImport | Importing data into Sqlite]] for SQL'ing the data
* [[Machine_Learning/SqliteImport | Importing data into Sqlite]] for SQL'ing the data
* [[Machine_Learning/OmniscopeVisualization | Visualizing Sqlite data in Omniscope]] for understanding the data
* [[Machine_Learning/OmniscopeVisualization | Visualizing Sqlite data in Omniscope]] for understanding the data
* [http://swarmfinancial.com/ec2mapping.zip Chance mapping dataset for Vikram's EC2 presentation]
* [http://swarmfinancial.com/withIq.zip All datasets with iq values]
 
==TODOs==
 
* Vikram -- will create a guide for Mahout setup
* Thomas -- Attempt clustering skills (subskills, traced skills and rules) using Mahout
** put together a [[Machine_Learning/kdd_sample | perl script]] which will take random samples from the data, for working on smaller instances
** put together a [[Machine_Learning/kdd_r | simple R script]] for loading the data
* Andy --  define features for sub-problems (student iq, step difficulty); Do remaining feature transforms: Replace step name with unique step name; remove given features; add features: step success chance, student IQ, complexity
* Erin --
* Paul -- Create overview of the data: histograms, notable features etc. Visualization?


== Notes ==
== Notes ==
Line 33: Line 43:
== Who we are ==
== Who we are ==
* Andy; Machine Learning
* Andy; Machine Learning
* Paul; Machine Learning
* Thomas; Statistics
* Thomas; Statistics
* Erin; Maths
* Erin; Maths
Line 50: Line 61:
* [http://swarmfinancial.com/screencasts/nb/kddWekaUsage1.swf Screencast1]
* [http://swarmfinancial.com/screencasts/nb/kddWekaUsage1.swf Screencast1]
* [http://swarmfinancial.com/screencasts/nb/kddWekaUsage2.swf Screencast2]
* [http://swarmfinancial.com/screencasts/nb/kddWekaUsage2.swf Screencast2]
== A more step-by-step weka example ==
* [[Machine Learning/weka]]


== How to run libSVM ==
== How to run libSVM ==
* See the notes at [[Machine Learning/SVM]]
* See the notes at [[Machine Learning/SVM]]
== How to run MOA ==
* See the notes at [[Machine Learning/moa]]
Please note that all contributions to Noisebridge are considered to be released under the Creative Commons Attribution-NonCommercial-ShareAlike (see Noisebridge:Copyrights for details). If you do not want your writing to be edited mercilessly and redistributed at will, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource. Do not submit copyrighted work without permission!

To protect the wiki against automated edit spam, we kindly ask you to solve the following CAPTCHA:

Cancel Editing help (opens in new window)