[ml] Hadoop going forward
Andreas von Hessling
vonhessling at gmail.com
Sun May 23 00:07:21 PDT 2010
is there a way you could easily create a download comprising all files
in your github project, please? It seems as it is now we'd need to
manually download each file individually in the browser. (I've never
used GitHub before, please ignore my ignorance)
The setup instructions work great for me as well; (I'm now trying to
get a better sense of what your functions do and how I use them. )
On Wed, May 19, 2010 at 11:38 PM, Vikram Oberoi <voberoi at gmail.com> wrote:
> Hey folks,
> For those of you that came out tonight, I hope the code I walked through and
> initial (albeit rough) overview of MapReduce helped. If you guys have any
> questions or requests, the best way to ask would be to:
> a) direct an email to me over ml at lists.noisebridge.net or...
> b) open an issue at the Github
> project: http://github.com/voberoi/hadoop-mrutils
> Both of these ways someone else might be able to answer first and everyone
> will benefit from the answer, as there's a high probability that everyone
> will have the same questions.
> For next week, I'm going to write a script that transforms the KDD dataset
> in... some useful way. Your guys' input on what exactly I should do here is
> most welcome. The transformation should be involved enough that the code can
> serve as an example for scripts you all might implement later.
> I'll also be taking a look at Apache Mahout (a library containing Hadoop
> MapReduce implementations of numerous machine learning algorithms) and
> writing up an example of how to use it. If you have a particular algorithm
> that you want to apply to the dataset, check if it's in the Mahout library
> and let me know.
> Finally, is any brainstorming/discussion about what we're doing happening
> anywhere other than the meetups? I'd be happy to meet again some time before
> next Wednesday to hash out some ideas and run with them, as in-person
> conversation bandwidth is *so* much higher. Alternately, we could throw out
> ideas on the list and brainstorm over email threads. It doesn't seem like
> there's a whole lot of action on the wiki other than links to resources and
> TODOs. Or is there?
> ml mailing list
> ml at lists.noisebridge.net
More information about the ml