The previous post was about 12 probability distributions named after Irving Burr. This post is about 12 probability distributions named after Karl Pearson. The Pearson distributions are better known, and include some very well known distributions.

Burr’s distributions are defined by their CDFs; Pearson’s distributions are defined by their PDFs.

## Pearson’s differential equation

The densities of Pearson’s distributions all satisfy the same differential equation:

This is a linear differential equation, and so multiples of a solution are also a solution. However, a probability density must integrate to 1, so there is a unique probability density solution given *a*, *c*_{0}, *c*_{1}, and *c*_{2}.

## Well known distributions

Note that *f*(*x*) = exp(-*x*²/2) satisfies the differential equation above if we set *a* = 0, *c*_{0} = 1, and *c*_{1} = *c*_{2} = 0. This says the **normal distribution** is a Pearson distribution.

If *f*(*x*) = *x*^{m} exp(-*x*) then the differential equation is satisfied for *a* = *m*, *c*_{0} = −1, and *c*_{0} = *c*_{2} = 0. This says that the **exponential distribution** and more generally the **gamma distribution** are Pearson distributions.

You can also show that the **Cauchy distribution** and more generally the **Student ***t* distribution are also Pearson distributions. So are the **beta distributions** (with a transformed range).

## Table of Pearson distributions

The table below lists all Pearson distributions with their traditional names. The order of the list is a little strange for historical reasons.

The table uses Iverson’s bracket notation: a Boolean expression in brackets represents the function that is 1 when the condition holds and 0 otherwise. This way all densities are defined over the entire real line, though some of them are only positive over an interval.

The densities are presented without normalization constant; the normalization constant are whatever they have to be for the function to integrate to 1. The normalization constants can be complicated functions of the parameters and so they are left out for simplicity.

There is a lot of redundancy in the list. All the distributions are either special cases of or limiting cases of distributions I, IV, and VI.

Note that VII is the Student *t* distribution after you introduce a scaling factor*.*

## Moments

The Pearson distributions are determined by their first few moments, provided these exist, and these moments can be derived from the parameters in Pearson’s differential equation.

This suggests moment matching as a way to fit Pearson distributions to data: solve for the distribution parameters that make the the exact moments match the empirical moments. Sometimes this works very well, though sometimes other approaches are better, depending on your criteria for what constitutes a good match.