View posts by date.
Most posts haven't been categorized yet.
Mikio's Guide To Real-Time Big Data
Designing ML frameworks
Misconceptions about the CAP Theorem
More Google Big Data papers: Megastore and Spanner
Stream Processing has no Query Layer
Big Data beyond MapReduce: Google's Big Data papers
Streamdrill compared to other approaches for the Top-K-Problem
What is streamdrill good for?
jblas finally on central Maven repository
Download the streamdrill demo
Levels of Abstractions in Big Data
Why you don't want real-time analytics to be exact
Twitter in 2011 revamped
Video for talk: 'TWIMPACT: Real-Time Twitter Analysis'
Talk: On Real-Time Twitter Analysis
What is Data Science?
Scala discussion heating up?
Slides for my LinuxTag talk on Cassandra
Tuning ATLAS for jblas
Getting Started in Scala
jblas 1.2.0: A look behind the scenes
Cassandra Garbage Collection Tuning
Some Tips On Using Cassandra
Companion Objects as Classes in Scala
Why you should listen to your supervisor
The Perpetual Conference
Is Machine Learning Losing Impact?
Talk: Some Introductory Remarks on Bayesian Inference
My thoughts on the NY Times article: Troves of Personal Data, Forbidden to Researchers
Introducing Data Science Seminars
Fast Cross Validation
Analyzing Social Media Data
Peer Review and NoSQL
Machine Learning: Beyond Prediction Accuracy
A Rejected Paper Is Not The End
MLOSS workshop at ICML 2010
A Bit of Background on "Bayes vs. Frequentists"
Book Review: 'Debt: the first 5000 years' by David Graeber
Book review: "Start with Why" by Simon Sinek and Apple's Patent Wars
Short Review: Visualize This by Nathan Yau
Short Review of Edward R. Tufte's "The Visual Display of Quantitative Information"
From the Cluetrain Manifesto to Social Media
Books in Pairs: Programming in Scala vs. Programming Scala
Reclaim your data, own a piece of the cloud!
Lunch talk: Google Reader
Coffee Talk: Twitter Should Have Monetized Their API