Machine Learning: Difference between revisions
Jump to navigation
Jump to search
Mschachter (talk | contribs) m (→Software Tools) |
Mschachter (talk | contribs) No edit summary |
||
(9 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
=== Join the Mailing List === | |||
https://www.noisebridge.net/mailman/listinfo/ml | |||
=== Next Meeting=== | === Next Meeting=== | ||
*When: | *When: November 2013 | ||
*Where: 2169 Mission St. (back NE corner, Church classroom) | *Where: 2169 Mission St. (back NE corner, Church classroom) | ||
*Topic: | *Topic: | ||
*Details: | *Details: We're going to pick things back up soon! | ||
*Who: | *Who: | ||
=== Take the Noisebridge ML Survey === | === Take the Noisebridge ML Survey === | ||
[http://www.surveymonkey.com/s/W2T9ZB6 Take a survey] and vote for what you want to learn! | [http://www.surveymonkey.com/s/W2T9ZB6 Take a survey] and vote for what you want to learn! | ||
=== Talks and Workshops === | === Talks and Workshops === | ||
Line 42: | Line 40: | ||
*Working with the Kinect | *Working with the Kinect | ||
*Computer Vision with OpenCV | *Computer Vision with OpenCV | ||
=== Projects === | === Projects === | ||
Line 106: | Line 100: | ||
*[http://www.mlpack.org/ MLPACK] | *[http://www.mlpack.org/ MLPACK] | ||
**High performance scalable ML Library | **High performance scalable ML Library | ||
*[http://www.torch.ch/ Torch] | |||
**MATLAB-like environment for state-of-the art ML libraries written in LUA | |||
==== Online ML ==== | ==== Online ML ==== | ||
Line 112: | Line 108: | ||
*[http://jubat.us/en/ Jubatus] | *[http://jubat.us/en/ Jubatus] | ||
**Distributed Online ML | **Distributed Online ML | ||
*[http://dogma.sourceforge.net/ DOGMA] | |||
**MATLAB-based online learning stuff | |||
*[http://code.google.com/p/libol/ libol] | |||
*[http://code.google.com/p/oll/ oll] | |||
*[http://code.google.com/p/scw-learning/ scw-learning] | |||
==== Graphical Models ==== | ==== Graphical Models ==== | ||
Line 120: | Line 121: | ||
*[http://mc-stan.org/ Stan] | *[http://mc-stan.org/ Stan] | ||
**A graphical model compiler | **A graphical model compiler | ||
*[https://github.com/kutschkem/Jayes Jayes] | |||
**Bayesian networks in Java | |||
==== Text Stuff ==== | ==== Text Stuff ==== | ||
Line 140: | Line 143: | ||
*[http://drwn.anu.edu.au/ DARWIN] | *[http://drwn.anu.edu.au/ DARWIN] | ||
**Generic C++ ML and Computer Vision Library | **Generic C++ ML and Computer Vision Library | ||
*[http://sourceforge.net/projects/petavision/ PetaVision] | |||
**Developing a real-time, full-scale model of the primate visual cortex. | |||
==== Audio Processing ==== | ==== Audio Processing ==== | ||
Line 152: | Line 157: | ||
*[http://ofer.sci.ccny.cuny.edu/sound_analysis_pro Sound Analysis Pro] | *[http://ofer.sci.ccny.cuny.edu/sound_analysis_pro Sound Analysis Pro] | ||
**Tool for analyzing animal sounds | **Tool for analyzing animal sounds | ||
*[http://luscinia.sourceforge.net/ Luscinia] | |||
**Software for archiving, measuring, and analyzing bioacoustic data | |||
*[http://wiki.python.org/moin/PythonInMusic List of Sound Tools for Python] | *[http://wiki.python.org/moin/PythonInMusic List of Sound Tools for Python] | ||
Line 165: | Line 173: | ||
*[http://cytoscape.github.io/cytoscape.js/ Cytoscape] | *[http://cytoscape.github.io/cytoscape.js/ Cytoscape] | ||
**A JavaScript graph library for analysis and visualisation | **A JavaScript graph library for analysis and visualisation | ||
*[https://plot.ly/ plot.ly] | |||
**Web-based plotting | |||
==== Cluster Computing ==== | ==== Cluster Computing ==== | ||
Line 171: | Line 181: | ||
*[http://web.mit.edu/star/cluster/ STAR: Cluster] | *[http://web.mit.edu/star/cluster/ STAR: Cluster] | ||
**Easily build your own Python computing cluster on Amazon EC2 | **Easily build your own Python computing cluster on Amazon EC2 | ||
==== Database Stuff ==== | |||
*[http://madlib.net/ MADlib] | |||
**Machine learning algorithms for in-database data | |||
*[http://www.joyent.com/products/manta Manta] | |||
**Distributed object storage | |||
==== Neural Simulation ==== | |||
*[http://nengo.ca/ Nengo] | |||
==== Other ==== | ==== Other ==== |
Revision as of 17:17, 31 October 2013
Join the Mailing List
https://www.noisebridge.net/mailman/listinfo/ml
Next Meeting
- When: November 2013
- Where: 2169 Mission St. (back NE corner, Church classroom)
- Topic:
- Details: We're going to pick things back up soon!
- Who:
Take the Noisebridge ML Survey
Take a survey and vote for what you want to learn!
Talks and Workshops
We've given lots of workshops and talks over the past year or so, here's a few. Many of the workshops we've given previously are recurring and will be given again, especially upon request!
- Intro to Machine Learning
- A Brief Tour of Statistics
- Generalized Linear Models
- Neural Nets Workshop
- Support Vector Machines
- Random Forests
- Independent Components Analysis
- Deep Nets
Code and SourceForge Site
- We have a Sourceforge Project
- We have a git repository on the project page, accessible as:
git clone git://ml-noisebridge.git.sourceforge.net/gitroot/ml-noisebridge/ml-noisebridge
- Send an email to the list if you want to become an administrator on the site to get write access to the git repo!
Future Talks and Topics, Ideas
- Random Forests in R
- Restricted Boltzmann Machines (Mike S, some day)
- Analyzing brain cells (Mike S)
- Deep Nets w/ Stacked Autoencoders (Mike S, some day)
- Generalized Linear Models (Mike S, Erin L? some day)
- Graphical Models
- Working with the Kinect
- Computer Vision with OpenCV
Projects
- Small Group Subproblems
- Fundraising
- Noisebridge Machine Learning Course
- Kaggle Social Network Contest
- KDD Competition 2010
- HIV
Datasets and Websites
- UCI Machine Learning Repository
- DataSF.org
- Infochimps
- Face Recognition Databases
- Time Series Data Library
- Data Q&A Forum
- Metaoptimize
- Quora ML Page
- A ton of Weather Data
- MLcomp
- Upload your algorithm and objectively compare it's performance to other algorithms
- Social Security Death Master File!
Software Tools
Generic ML Libraries
- Weka
- a collection of data mining tools and machine learning algorithms.
- scikits.learn
- Machine learning Python package
- scikits.statsmodels
- Statistical models to go with scipy
- PyBrain
- Does feedforward, recurrent, SOM, deep belief nets.
- LIBSVM
- c-based SVM package
- PyML
- MDP
- Modular framework, has lots of stuff!
- VirtualBox Virtual Box Image with Pre-installed Libraries listed here
- Theano: Symbolic Expressions and Transparent GPU Integration
- sympy Does symbolic math
- Waffles
- Open source C++ set of machine learning command line tools.
- RapidMiner
- Mobile Robotic Programming Toolkit
- nitime
- NeuroImaging in Python, has some good time series analysis stuff and multi-variate response fitting.
- Pandas
- Data analysis workflow in python
- PyTables
- Adds querying capabilities to HDF5 files
- statsmodels
- Regression, time series analysis, statistics stuff for python
- Vowpal Wabbit
- "Intrinsically Fast" implementation of gradient descent for large datasets
- Shogun
- Fast implementations of SVMs
- MLPACK
- High performance scalable ML Library
- Torch
- MATLAB-like environment for state-of-the art ML libraries written in LUA
Online ML
- MOA (Massive Online Analysis)
- Offshoot of weka, has all online-algorithms
- Jubatus
- Distributed Online ML
- DOGMA
- MATLAB-based online learning stuff
- libol
- oll
- scw-learning
Graphical Models
- BUGS
- MCMC for Bayesian Models
- JAGS
- Hierarchical Bayesian Models
- Stan
- A graphical model compiler
- Jayes
- Bayesian networks in Java
Text Stuff
- Beautiful Soup
- Screen-scraping tools
- SALLY
- Tool for embedding strings into vector spaces
Collaborative Filtering
- PREA
- Personalized Recommendation Algorithms Toolkit
- SVDFeature
- Collaborative Filtering and Ranking Toolkit
Computer Vision
- OpenCV
- Computer Vision Library
- Has ML component (SVM, trees, etc)
- Online tutorials here
- DARWIN
- Generic C++ ML and Computer Vision Library
- PetaVision
- Developing a real-time, full-scale model of the primate visual cortex.
Audio Processing
- Friture
- Real-time spectrogram generation
- pyo
- Real-time audio signal processing
- PYMir
- A library for reading mp3's into python, and doing analysis
- PRAAT
- Speech analysis toolkit
- Sound Analysis Pro
- Tool for analyzing animal sounds
- Luscinia
- Software for archiving, measuring, and analyzing bioacoustic data
Data Visualization
- Orange
- Strong data visualization component
- Gephi
- Graph Visualization
- ggplot
- Nice plotting package for R
- MayaVi2
- 3D Scientific Data Visualization
- Cytoscape
- A JavaScript graph library for analysis and visualisation
- plot.ly
- Web-based plotting
Cluster Computing
- Mahout
- Hadoop cluster based ML package.
- STAR: Cluster
- Easily build your own Python computing cluster on Amazon EC2
Database Stuff
Neural Simulation
Other
Presentations and other Materials
- Awesome Machine Learning Applications -- A list of cool applications of ML
- Hands-on Machine Learning, a presentation jbm gave on 2009-01-07.
- http://www.youtube.com/user/StanfordUniversity#g/c/A89DCFA6ADACE599 Stanford Machine Learning online course videos]
- Media:Brief_statistics_slides.pdf, a presentation given on statistics for the machine learning group
- LinkedIn discussion on good resources for data mining and predictive analytics
- Face Recognition Algorithms
- Max Welling's ML classnotes
Topics to Learn and Teach
NBML Course - Noisebridge Machine Learning Curriculum (work-in-progress)
CS229 - The Stanford Machine learning Course @ noisebridge
- Supervised Learning
- Linear Regression
- Linear Discriminants
- Neural Nets/Radial Basis Functions
- Support Vector Machines
- Classifier Combination [1]
- A basic decision tree builder, recursive and using entropy metrics
- Unsupervised Learning
- Hidden Markov Models
- Clustering: PCA, k-Means, Expectation-Maximization
- Graphical Modeling
- Generative Models: gaussian distribution, multinomial distributions, HMMs, Naive Bayes
- Deep Belief Networks & Restricted Boltzmann Machines
- Reinforcement Learning
- Temporal Difference Learning
- Math, Probability & Statistics
- Metric spaces and what they mean
- Fundamentals of probabilities
- Decision Theory (Bayesian)
- Maximum Likelihood
- Bias/Variance Tradeoff, VC Dimension
- Bagging, Bootstrap, Jacknife [2]
- Information Theory: Entropy, Mutual Information, Gaussian Channels
- Estimation of Misclassification [3]
- No-Free Lunch Theorem [4]
- Machine Learning SDK's
- Applications
- Collective Intelligence & Recommendation Engines