Machine Learning in Action

A couple months ago I briefly reviewed Machine Learning for Hackers by Drew Conway and John Myles White. Today I’m looking at Machine Learning in Action by Peter Harrington and comparing the two books.

Both books are about the same size and cover many of the same topics. One difference between the two books is choice of programming language: ML for Hackers uses R for its examples, ML in Action uses Python.

ML in Action doesn’t lean heavily on Python libraries. It mostly implements its algorithms from scratch, with a little help from NumPy for linear algebra, but it does not use ML libraries such as scikit-learn. It sometimes uses Matplotlib for plotting and uses Tkinter for building a simple GUI in one chapter. The final chapter introduces Hadoop and Amazon Web Services.

ML for Hackers is a little more of a general introduction to machine learning. ML in Action contains a brief introduction to machine learning in general, but quickly moves on to specific algorithms. ML for Hackers spends a good number of pages discussing data cleaning. ML in Action starts with clean data in order to spend more time on algorithms.

ML in Action takes 8 of the top 10 algorithms in machine learning (as selected by this paper) and organizes around these algorithms. (The two algorithms out of the top 1o that didn’t make it into ML in Action were PageRank, because it has been covered well elsewhere, and EM, because its explanation requires too much mathematics.) The algorithms come first in ML in Action, illustrations second. ML for Hackers puts more emphasis on its examples and reads a bit more like a story. ML in Action reads a little more like a reference book.

//www.johndcook.com/blog/2008/06/27/wine-beer-and-statistics/#comment-170809

6 thoughts on “Machine Learning in Action”

Jordan

15 May 2012 at 09:52

So would you recommend reading both books in parallel to get the most out of them?

John

15 May 2012 at 10:00

No, I’d recommend picking one. You could pick the book whose programming language you prefer (R vs Python) or the book whose style you prefer. The styles aren’t that different, though I’d say ML for Hackers is a little more conversational and ML in Action is a little more instructional.

SteveBrooklineMA

15 May 2012 at 13:37

I see. You can have ML without EM, but you can’t have EM without ML. Maybe a different ML though.

Alex

16 May 2012 at 04:02

EM, is that EM as in Expectation-Maximization? the algorithm for approximating MaxLikelihood in statistical inference? I don’t know much on Machine Learning, but this did ring a bell…

nexone

16 May 2012 at 08:39

So which one would you recommend as first ML introduction reading?

Geoff Knauth

16 May 2012 at 09:01

Thank you so much for your insights!

Comments are closed.