Friday miscellany

by John on June 3, 2011

Eclectic

Vintage advertising
The evolution of email
Solving the wrong problem
Alternative medicines and placebos

Truth stranger than fiction

Larry Ellison suing neighbors over his view
Towing icebergs

Math

Elementary Applied Topology draft textbook
Introduction to category theory
Mathematical model of walking

Computing

Lifetimes of cryptographic hash functions
How much Gnu is there in Linux?
DragonEgg: gcc plug-in to target LLVM

Statistics and machine learning

Machine learning demos
On the accuracy of statistical procedures in Excel 2007
R reference card for data mining
Wisdom of statistically manipulated crowds

{ 1 comment… read it below or add one }

1

Jerzy 06.06.11 at 07:27

Thank you for continuing to share such interesting links! The airplane-design story is great. And the alternative medicine link made me laugh for a moment. If they’re all worthless, you should *expect* about 5% of them to look better than placebo at significance level alpha=0.05…

In the “wisdom of crowds” article, I was trying to understand why the author (Nick Carr) seems to think that the arithmetic mean is a natural measure to use while the median and geometric mean are “arcane” statistical inventions which “massage” the answers. It seems to me that the median and geometric mean are, in fact, the more “natural” approaches for questions where responses vary across many orders of magnitude, but commenters haven’t been able to convince him yet and he seems to have had enough of the discussion… Too bad.
Can you think of some simple examples that would clearly illustrate when geometric means might be more natural? For example, perhaps if we asked the crowd “How many zeros are there in a trillion?” we’d take the arithmetic mean, while if we asked them to “Write down a trillion” we’d take the geometric mean…

And that got me wondering if the arithmetic mean is so common in statistics largely because it’s easy to use in proofs: you can take derivatives of the mean (and the variance, as a sum of squared errors) but not of the median (or the sum of absolute errors). If not for this analytical simplicity, might medians be more commonly used than means?

Leave a Comment

You can use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>