<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Three views of the negative binomial distribution</title>
	<atom:link href="http://www.johndcook.com/blog/2009/09/22/negative-binomial-distribution/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.johndcook.com/blog/2009/09/22/negative-binomial-distribution/</link>
	<description>The blog of John D. Cook</description>
	<lastBuildDate>Sat, 11 Feb 2012 01:10:06 -0500</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: John Cook y sus tres acercamientos a la distribución binomial negativa</title>
		<link>http://www.johndcook.com/blog/2009/09/22/negative-binomial-distribution/comment-page-1/#comment-99010</link>
		<dc:creator>John Cook y sus tres acercamientos a la distribución binomial negativa</dc:creator>
		<pubDate>Wed, 17 Aug 2011 23:16:40 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=3238#comment-99010</guid>
		<description>[...] Cook plantea acá una interesante discusión acerca de la interpretación de la distribución binomial negativa. [...]</description>
		<content:encoded><![CDATA[<p>[...] Cook plantea acá una interesante discusión acerca de la interpretación de la distribución binomial negativa. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John</title>
		<link>http://www.johndcook.com/blog/2009/09/22/negative-binomial-distribution/comment-page-1/#comment-97213</link>
		<dc:creator>John</dc:creator>
		<pubDate>Mon, 08 Aug 2011 10:01:03 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=3238#comment-97213</guid>
		<description>Hypergeometric functions have many faces. Yes, they were studied first because of their connection to differential equations. But later people realized they were useful in combinatorics as generating functions. This latter perspective is easier to connect to probability. Think about the combinatorial properties of the coefficients in the series rather than the analytical properties of the function.</description>
		<content:encoded><![CDATA[<p>Hypergeometric functions have many faces. Yes, they were studied first because of their connection to differential equations. But later people realized they were useful in combinatorics as generating functions. This latter perspective is easier to connect to probability. Think about the combinatorial properties of the coefficients in the series rather than the analytical properties of the function.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: zhanxw</title>
		<link>http://www.johndcook.com/blog/2009/09/22/negative-binomial-distribution/comment-page-1/#comment-97165</link>
		<dc:creator>zhanxw</dc:creator>
		<pubDate>Mon, 08 Aug 2011 04:36:59 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=3238#comment-97165</guid>
		<description>Thanks Jan Galkowski for your suggested chapter. I read it from our library links. It does make sense that hypergeometric distribution got its name from its connection to hypergeometric series by a scaling factor. However, the hypergeometric series and hypergeometric function are derived when studying second-order linear ordinary differential equation, and it thus appears mysterious that why  statisticians want to bring the context of physics to statistics. Do you have more ideas? I guess there might be some anecdote beyond coincidence. Thanks.</description>
		<content:encoded><![CDATA[<p>Thanks Jan Galkowski for your suggested chapter. I read it from our library links. It does make sense that hypergeometric distribution got its name from its connection to hypergeometric series by a scaling factor. However, the hypergeometric series and hypergeometric function are derived when studying second-order linear ordinary differential equation, and it thus appears mysterious that why  statisticians want to bring the context of physics to statistics. Do you have more ideas? I guess there might be some anecdote beyond coincidence. Thanks.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jan Galkowski</title>
		<link>http://www.johndcook.com/blog/2009/09/22/negative-binomial-distribution/comment-page-1/#comment-97131</link>
		<dc:creator>Jan Galkowski</dc:creator>
		<pubDate>Mon, 08 Aug 2011 01:46:01 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=3238#comment-97131</guid>
		<description>According to Bishop, Fienberg, and Holland in &lt;i&gt;Discrete Multivariate Analysis&lt;/i&gt;, Springer, 2007, section 13.5, pages 448-449, the hypergeometric distribution gets its name because hypergeometric probabilities are scaled terms from the hypergeometric series. This presentation context is symbolically challenged, so I just refer the reader to equations (13.5-1) through (13.5-5) of the cited text.  

By the way, I find this text &lt;i&gt;very&lt;/i&gt; useful and illuminating, in many ways.  I came to it through its extension of the Fisher Exact Test to instances where the &lt;i&gt;extended hypergeometric distribution&lt;/i&gt; is more appropriate for tables of counts, where the model suggests the populations compared may not have the same probabilities of appearance.  There&#039;s a lot more to this book. 

I have also looked at both Fienberg, &lt;i&gt;The Analysis of Cross-Classified Categorical Data&lt;/i&gt; (2nd edition, Springer, 2007) , which is useful, and Congdon&#039;s &lt;i&gt;Bayesian Models for Categorical Data&lt;i&gt;, Wiley, 2005. I find Congdon less useful, even if he incorporates some of Epstein and Fienberg, &quot;Bayesian estimation in multidimensional contingency tables&quot; (from &lt;i&gt;Computer Science and Statistics: Proceedings of the 23rd Symposium on the Interface&lt;/i&gt;, Keramidas, E, editor, &lt;i&gt;Inteface Foundation of North America&lt;/i&gt;: Fairfax, 37-47). 

Hope this helps.</description>
		<content:encoded><![CDATA[<p>According to Bishop, Fienberg, and Holland in <i>Discrete Multivariate Analysis</i>, Springer, 2007, section 13.5, pages 448-449, the hypergeometric distribution gets its name because hypergeometric probabilities are scaled terms from the hypergeometric series. This presentation context is symbolically challenged, so I just refer the reader to equations (13.5-1) through (13.5-5) of the cited text.  </p>
<p>By the way, I find this text <i>very</i> useful and illuminating, in many ways.  I came to it through its extension of the Fisher Exact Test to instances where the <i>extended hypergeometric distribution</i> is more appropriate for tables of counts, where the model suggests the populations compared may not have the same probabilities of appearance.  There&#8217;s a lot more to this book. </p>
<p>I have also looked at both Fienberg, <i>The Analysis of Cross-Classified Categorical Data</i> (2nd edition, Springer, 2007) , which is useful, and Congdon&#8217;s <i>Bayesian Models for Categorical Data</i><i>, Wiley, 2005. I find Congdon less useful, even if he incorporates some of Epstein and Fienberg, &#8220;Bayesian estimation in multidimensional contingency tables&#8221; (from </i><i>Computer Science and Statistics: Proceedings of the 23rd Symposium on the Interface</i>, Keramidas, E, editor, <i>Inteface Foundation of North America</i>: Fairfax, 37-47). </p>
<p>Hope this helps.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John</title>
		<link>http://www.johndcook.com/blog/2009/09/22/negative-binomial-distribution/comment-page-1/#comment-97096</link>
		<dc:creator>John</dc:creator>
		<pubDate>Sun, 07 Aug 2011 21:18:51 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=3238#comment-97096</guid>
		<description>I don&#039;t really know how the hypergeometric &lt;em&gt;distribution&lt;/em&gt; gets its name, but I know how the hypergeometric &lt;em&gt;series&lt;/em&gt; was named. Hypergeometric series are a generalization of the geometric series, and I would assume the hypergeometric &lt;em&gt;distribution&lt;/em&gt; has some connection to the hypergeometric &lt;em&gt;series&lt;/em&gt;.</description>
		<content:encoded><![CDATA[<p>I don&#8217;t really know how the hypergeometric <em>distribution</em> gets its name, but I know how the hypergeometric <em>series</em> was named. Hypergeometric series are a generalization of the geometric series, and I would assume the hypergeometric <em>distribution</em> has some connection to the hypergeometric <em>series</em>.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: zhanxw</title>
		<link>http://www.johndcook.com/blog/2009/09/22/negative-binomial-distribution/comment-page-1/#comment-97093</link>
		<dc:creator>zhanxw</dc:creator>
		<pubDate>Sun, 07 Aug 2011 21:11:41 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=3238#comment-97093</guid>
		<description>Thanks John for your work. The content is a well organized refresh to me, and it is first time I feel that I understand the naming of &quot;negative&quot; binomial. Do you plan to write something about how the name &quot;hypergeometric&quot; comes from? Thanks.</description>
		<content:encoded><![CDATA[<p>Thanks John for your work. The content is a well organized refresh to me, and it is first time I feel that I understand the naming of &#8220;negative&#8221; binomial. Do you plan to write something about how the name &#8220;hypergeometric&#8221; comes from? Thanks.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Joseph M Hilbe</title>
		<link>http://www.johndcook.com/blog/2009/09/22/negative-binomial-distribution/comment-page-1/#comment-35421</link>
		<dc:creator>Joseph M Hilbe</dc:creator>
		<pubDate>Fri, 26 Mar 2010 19:56:48 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=3238#comment-35421</guid>
		<description>You may want to look at my 2007 book, &quot;Negative Binomial Regression&quot; (Cambridge University Press). I am nearly finished with a substantially expanded second edition, that will come out at about 500 pages in length. I use Stata and R for examples, many of which i have written. I&#039;ve seen many data sets that are equi-dispersed, in fact, I&#039;ve seen underdispersed Poisson models. These of course cannot be modeled using the traditional NB procedure. But a generalized Poisson, as well as hurdle models and double Poisosn models can estimate under-dispersed count data. 

There are a wide variety of NB models, including one that is not a Poisson-gamma mixture, but rather derives directly from PDF as understood by the first version mentioned by John. I have called this the NB-C parameterization, and it is not related to the Poisson at all. On the other hand, with the variance defined as mu+alpha*mu^2, where alpha is the overdispersion parameter and mu is the mean, the traditonal NB, called in the literature as NB2, is parameterized such that the Poisson model is a NB with alpha=0. A geometric model is NB with alpha=1. A NB2 model can be extra-dispersed as well -- either uncer of over dispersed. The data can in fact be Poisson overdispersed but NB under-dispersed. How to identify possible overdispersion (correlation)  and its causes is important in deciding which type of count model to use for a given data situation.  If you are really interested in this subject, email me and I can perhaps refer you to sources that will help you understand the family of negative binomial models a bit better. Hilbe@asu.edu</description>
		<content:encoded><![CDATA[<p>You may want to look at my 2007 book, &#8220;Negative Binomial Regression&#8221; (Cambridge University Press). I am nearly finished with a substantially expanded second edition, that will come out at about 500 pages in length. I use Stata and R for examples, many of which i have written. I&#8217;ve seen many data sets that are equi-dispersed, in fact, I&#8217;ve seen underdispersed Poisson models. These of course cannot be modeled using the traditional NB procedure. But a generalized Poisson, as well as hurdle models and double Poisosn models can estimate under-dispersed count data. </p>
<p>There are a wide variety of NB models, including one that is not a Poisson-gamma mixture, but rather derives directly from PDF as understood by the first version mentioned by John. I have called this the NB-C parameterization, and it is not related to the Poisson at all. On the other hand, with the variance defined as mu+alpha*mu^2, where alpha is the overdispersion parameter and mu is the mean, the traditonal NB, called in the literature as NB2, is parameterized such that the Poisson model is a NB with alpha=0. A geometric model is NB with alpha=1. A NB2 model can be extra-dispersed as well &#8212; either uncer of over dispersed. The data can in fact be Poisson overdispersed but NB under-dispersed. How to identify possible overdispersion (correlation)  and its causes is important in deciding which type of count model to use for a given data situation.  If you are really interested in this subject, email me and I can perhaps refer you to sources that will help you understand the family of negative binomial models a bit better. <a href="mailto:Hilbe@asu.edu">Hilbe@asu.edu</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John Myles White</title>
		<link>http://www.johndcook.com/blog/2009/09/22/negative-binomial-distribution/comment-page-1/#comment-30648</link>
		<dc:creator>John Myles White</dc:creator>
		<pubDate>Tue, 12 Jan 2010 18:11:04 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=3238#comment-30648</guid>
		<description>This was incredibly useful, John. I&#039;ve known just a little about the negative binomial distribution for a while now after reading the start of &quot;Inference and Disputed Authorship,&quot; but never fully appreciated the distribution&#039;s place. Thanks for giving such a useful buildup of motivating intuitions for it.</description>
		<content:encoded><![CDATA[<p>This was incredibly useful, John. I&#8217;ve known just a little about the negative binomial distribution for a while now after reading the start of &#8220;Inference and Disputed Authorship,&#8221; but never fully appreciated the distribution&#8217;s place. Thanks for giving such a useful buildup of motivating intuitions for it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jan Galkowski</title>
		<link>http://www.johndcook.com/blog/2009/09/22/negative-binomial-distribution/comment-page-1/#comment-30001</link>
		<dc:creator>Jan Galkowski</dc:creator>
		<pubDate>Sat, 02 Jan 2010 01:03:28 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=3238#comment-30001</guid>
		<description>I&#039;m wondering what the general expression for the variance of a mixture distribution might look like.  To be specific, suppose X(1), X(2), ..., X(n) are r.v. each having means u(1), u(2), ..., u(n), respectively, and variances v(1), v(2), ..., v(n). The k-th distribution is chosen with probability p(k), and the choice is mutually exclusive. So, in each case, it&#039;s a multinomial gating which of the n distributions are used.  In the case of n = 2, 

VAR[...] = p(1) v(1) + p(2) v(2) + p(1) p(2) [u(1) - u(2)]^2.

The question is, what does this look like in general?  It had been suggested to me that the general expression, given the intermediate definition,

 Y(i) = P(S == i) X(i)

for the variance looks like

SUM-over-i VAR[Y(i)] + 2 SUM-over-i-not-equal-j COV[Y(i),Y(j)]

where, of course, COV[U,V] = E[U V] - E[U] E[V]

Here, since Y(i) and Y(j) are mutually exclusive, E[Y(i) Y(j)] is presumably zero.

But I cannot reconcile this in the case of two r.v. with n = 2 expression.

Any suggestions?</description>
		<content:encoded><![CDATA[<p>I&#8217;m wondering what the general expression for the variance of a mixture distribution might look like.  To be specific, suppose X(1), X(2), &#8230;, X(n) are r.v. each having means u(1), u(2), &#8230;, u(n), respectively, and variances v(1), v(2), &#8230;, v(n). The k-th distribution is chosen with probability p(k), and the choice is mutually exclusive. So, in each case, it&#8217;s a multinomial gating which of the n distributions are used.  In the case of n = 2, </p>
<p>VAR[...] = p(1) v(1) + p(2) v(2) + p(1) p(2) [u(1) - u(2)]^2.</p>
<p>The question is, what does this look like in general?  It had been suggested to me that the general expression, given the intermediate definition,</p>
<p> Y(i) = P(S == i) X(i)</p>
<p>for the variance looks like</p>
<p>SUM-over-i VAR[Y(i)] + 2 SUM-over-i-not-equal-j COV[Y(i),Y(j)]</p>
<p>where, of course, COV[U,V] = E[U V] &#8211; E[U] E[V]</p>
<p>Here, since Y(i) and Y(j) are mutually exclusive, E[Y(i) Y(j)] is presumably zero.</p>
<p>But I cannot reconcile this in the case of two r.v. with n = 2 expression.</p>
<p>Any suggestions?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: EastwoodDC</title>
		<link>http://www.johndcook.com/blog/2009/09/22/negative-binomial-distribution/comment-page-1/#comment-26479</link>
		<dc:creator>EastwoodDC</dc:creator>
		<pubDate>Mon, 26 Oct 2009 02:09:05 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=3238#comment-26479</guid>
		<description>I had not ever though about the NB being a generalization of the Poisson, but that is a useful was to think of it. A colleague of vast experience tells me he has never fit a Poisson model where overdispersion was not a problem.</description>
		<content:encoded><![CDATA[<p>I had not ever though about the NB being a generalization of the Poisson, but that is a useful was to think of it. A colleague of vast experience tells me he has never fit a Poisson model where overdispersion was not a problem.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John Cook y sus tres acercamientos a la distribución binomial negativa &#171; Apuntes de Estadística</title>
		<link>http://www.johndcook.com/blog/2009/09/22/negative-binomial-distribution/comment-page-1/#comment-25047</link>
		<dc:creator>John Cook y sus tres acercamientos a la distribución binomial negativa &#171; Apuntes de Estadística</dc:creator>
		<pubDate>Thu, 24 Sep 2009 17:39:29 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=3238#comment-25047</guid>
		<description>[...] Cook plantea acá una interesante discusión acerca de la interpretación de la distribución binomial negativa. [...]</description>
		<content:encoded><![CDATA[<p>[...] Cook plantea acá una interesante discusión acerca de la interpretación de la distribución binomial negativa. [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Dynamic Page Served (once) in 0.389 seconds -->

