From the monthly archives:

March 2009

Academic publications as spam

by John on March 4, 2009

From an article by Luis von Ahn on academic publications:

… there is an insane number of papers written every year, the vast majority of which contribute very little (or not at all) to our collective knowledge. This is basically spam. In fact, for many papers (including some of my own), the actual idea of the paper could be stated in one paragraph, but somehow people manage to write 10 pages of it.

Hat tip: Academic Productivity via @IanMulvany

{ 3 comments }

Here are three ways to evaluate a person’s performance.

  1. How good are they at their worst?
  2. How good are they on average?
  3. How good are they at their best?

Schools use the first two criteria, but the market uses the third.

Schools evaluate people at their worst

Teachers average grades to come up with semester grades, and semester grades feed into a grade point average. So in some sense schools evaluate average performance.

But in more subtle ways schools evaluate students by how good they are at their worst. To graduate, your lowest course grade in all your required courses must be passing. No amount of brilliance in one area can compensate for a failing grade in another area. Your creative writing grades are excellent, Mr. Shakespeare, but we cannot let you graduate until you pass physics.

How do you get on the honor roll? Your lowest grade has to be above a certain level. Again, what matters is how good you are at your worst.

How do you get to be valedictorian? Be good enough at every class to get an A. You have to be pretty good at everything, but you don’t have to be truly exceptional at anything.

Schools encourage perfectionism, not excellence. They encourage people to avoid mistakes, not to be creative.

Markets evaluate people at their best

Markets often evaluate people and products at their best.

If you write 100 obscure novels and one best-seller, you’re a best-selling author. If you consistently write moderately popular novels, you’re not. If you write one really good novel, you might get a Nobel prize. Imagine the Nobel committee evaluating a writer saying “Yeah, these two novels were brilliant, world-changing. But he also wrote this one novel that was mediocre. Let’s give the prize to someone whose books are consistently pretty good.”

The Ford F150 did poorly in focus groups. The average rating wasn’t good. But the people who liked it really liked it. And the F150 went on to be the most popular truck in history. All that matters in business is people who like your product enough to buy it. You don’t make any money by being everyone’s second choice.

If a company has one product that is a runaway success, the company is a success. If it has two or three runaway successes, even better. But a company can produce a few dismal failures (think Microsoft Bob or the Apple Newton) and still do quite well if their flagship products succeed.The same is true of the people behind these products. Someone can make a successful career with one big win even if they have a number of failures.

We all want others to see the best in us. There are ethical and economic reasons to look for the best in others. But years of education can incline us to look for the worst in others and in ourselves.

Related posts:

Quantity and quality
Four reasons we don’t apply the 80/20 rule
Gerald Weinberg’s law of twins

{ 13 comments }

{ 4 comments }

StackOverflow reputation statistics

by John on March 2, 2009

What is the distribution of StackOverflow user reputation scores? Here’s a graph of the number of users with reputations between 0 and 100, 100 and 200, …, 900 and 1000. (The scores go out much further, but the curve looks flat compared to the peak unless you zoom in further.)

This graph was based on a snapshot of the user reputations one day last week. The largest group, 15,219 users, had reputation less than 100. There were 2,494 users with reputation between 100 and 200, etc. The number of users in a 100-point reputation range generally decreases as the reputation score increases. The majority of users have reputation less than 100, and yet the top percentile have reputations over 4,800 and the highest reputation was 38,700. This sort of extreme disparity suggests a power law distribution.

The test for whether the reputation scores follow a power law is to plot the logarithms of the number of people with each score and look for a straight line. And after an initial steep drop off, the logs of the counts do fall roughly on a straight line.

(The graph goes out to scores below 7,700. Beyond that point, there are a few empty 100-point ranges. I stopped at 7,700 to avoid taking logs of zeros.)

The average reputation was 364, though the average does not mean much with a power law distribution. Instead of a bell shape centered around the average, there is a long tail. The average is not the middle because there is no middle to a power law.

Update: As pointed out in the comments, I should have plotted with the log of the reputation score to test for a power law distribution. Whether or not there is a power law here, however, there is a long tail and there’s no useful “middle.”

Other posts about StackOverflow:

Voting patterns on StackOverflow
StackOverflow question statistics

Other posts about power laws:

Networks and power laws
Rate of regularizing English verbs
Metabolism and power laws

{ 7 comments }

What is the shape of the Earth?

by John on March 2, 2009

To first approximation, out planet is a sphere. But how accurate is that approximation? What’s a better approximation? How good is that? This post will answer these questions and some related questions.

How well does a sphere describe the Earth’s shape? The Earth’s polar diameter is about 43 kilometers shorter than its equatorial diameter, a difference of about 0.3%.This is due to the equatorial bulge caused by the Earth’s rotation.

What’s a more accurate description of the Earth’s shape? An oblate spheroid.

What is an oblate spheroid? It’s the shape you get by spinning an ellipse around it’s minor axis. That says if you were to take a cross-section of the Earth containing the polar axis, the shape you get would be an ellipse. The polar axis would be the minor axis and the equatorial axis would be the major axis. But if you were to take a cross-section through the equator, or any plane parallel to the equator, you’d get a circle.

What is a prolate spheroid? A prolate spheroid is what you get by spinning an ellipse around its major axis.

What is an ellipsoid? An ellipsoid satisfies the following equation.

\left(\frac{x}{a}\right)^2 + \left(\frac{y}{b}\right)^2 + \left(\frac{z}{c}\right)^2 = 1

A sphere is an ellipsoid with a = b = c. An oblate spheroid is an ellipsoid with a = b > c. A prolate spheroid is an ellipsoid with a = b < c. A scalene ellipsoid is an ellipsoid for which a, b, and c are all distinct.

How good is the oblate spheroid model? The error in approximating the Earth’s shape as an oblate spheroid is less than 100 meters, two orders of magnitude better than the spherical model.

How are other planets shaped? The other planets in our solar system are also oblate spheroids with Saturn being the most oblate: the polar diameter of Saturn is about 10% shorter than its equatorial diameter.

Related post: Finding distances using longitude and latitude

{ 1 comment }