Scalability Challenges in Big Data Science
Yesterday I gave a talk on scalability and machine learning at the BerlinBuzzword conference. I give an overview of different ways to scale data analysis and machine learning methods. I cover MapReduce (of course), large scale training of SVMs via stochastic gradient descent, but also stream mining, and real-time (as you know, “you don’t just scale into real-time”).
The conference continues today, follow the conference on Twitter on the #bbuzz hashtag.
Update: On scribd, the hyperlinks are somehow lost, so here is the list:
Posted by Mikio L. Braun at 2012-06-05 11:54:00 +0000