<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Four types of errors</title>
	<atom:link href="http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/</link>
	<description>The blog of John D. Cook</description>
	<lastBuildDate>Sat, 11 Feb 2012 01:10:06 -0500</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Jerzy</title>
		<link>http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/comment-page-1/#comment-120826</link>
		<dc:creator>Jerzy</dc:creator>
		<pubDate>Fri, 09 Dec 2011 17:00:22 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/#comment-120826</guid>
		<description>I really like the idea of Type S error but am having trouble thinking through how to apply it in practice.
Say you expect θj and θk to be pretty close, but certainly not identical. So you&#039;d like to be able to say one of these three things: &quot;θj is bigger than θk,&quot; &quot;θk is bigger than θj,&quot; or &quot;We don&#039;t have enough evidence to decide which is bigger.&quot;
Then, let&#039;s say you do what many people do in practice. Perform a standard test of H0: θj=θk at confidence level 0.05. Then either you fail to reject H0, saying &quot;I don&#039;t have enough evidence to decide which one is the bigger one&quot;...  or you do reject H0, then say &quot;They are statistically significantly different and θj has the higher point estimate so I am confident that θj is bigger than θk&quot; (or vice versa, depending).
Can we say anything precise about our probability of Type S error under this procedure?</description>
		<content:encoded><![CDATA[<p>I really like the idea of Type S error but am having trouble thinking through how to apply it in practice.<br />
Say you expect θj and θk to be pretty close, but certainly not identical. So you&#8217;d like to be able to say one of these three things: &#8220;θj is bigger than θk,&#8221; &#8220;θk is bigger than θj,&#8221; or &#8220;We don&#8217;t have enough evidence to decide which is bigger.&#8221;<br />
Then, let&#8217;s say you do what many people do in practice. Perform a standard test of H0: θj=θk at confidence level 0.05. Then either you fail to reject H0, saying &#8220;I don&#8217;t have enough evidence to decide which one is the bigger one&#8221;&#8230;  or you do reject H0, then say &#8220;They are statistically significantly different and θj has the higher point estimate so I am confident that θj is bigger than θk&#8221; (or vice versa, depending).<br />
Can we say anything precise about our probability of Type S error under this procedure?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Type R error &#8212; The Endeavour</title>
		<link>http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/comment-page-1/#comment-120769</link>
		<dc:creator>Type R error &#8212; The Endeavour</dc:creator>
		<pubDate>Fri, 09 Dec 2011 13:01:28 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/#comment-120769</guid>
		<description>[...] Gelman added a couple more types of error to the standard repertoire of type I and type II errors. He suggests using type S [...]</description>
		<content:encoded><![CDATA[<p>[...] Gelman added a couple more types of error to the standard repertoire of type I and type II errors. He suggests using type S [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: jean-louis</title>
		<link>http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/comment-page-1/#comment-110188</link>
		<dc:creator>jean-louis</dc:creator>
		<pubDate>Tue, 25 Oct 2011 19:36:17 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/#comment-110188</guid>
		<description>Hi John, 
&lt;cite&gt;“significantly different” is related to the strength of evidence of a difference, not its size of the difference&lt;/cite&gt;
I agree, but I did not say that, did I? I guess you can measure the size of the difference with a hypothesis like θj = θk+epsilon. With an inequality hypothesis like θj &gt; θk you do not solve this problem, either. 

The type M error would do, but I would need to investigate to know more what it is all about :-)</description>
		<content:encoded><![CDATA[<p>Hi John,<br />
<cite>“significantly different” is related to the strength of evidence of a difference, not its size of the difference</cite><br />
I agree, but I did not say that, did I? I guess you can measure the size of the difference with a hypothesis like θj = θk+epsilon. With an inequality hypothesis like θj &gt; θk you do not solve this problem, either. </p>
<p>The type M error would do, but I would need to investigate to know more what it is all about <img src='http://www.johndcook.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John</title>
		<link>http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/comment-page-1/#comment-110184</link>
		<dc:creator>John</dc:creator>
		<pubDate>Tue, 25 Oct 2011 19:17:56 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/#comment-110184</guid>
		<description>jean-louis: In statistics, &quot;significantly different&quot; is related to the strength of evidence of a difference, not its size of the difference. The null hypothesis is typically that two treatments have exactly the same effect. If there is statistically significant data that the null is false, that doesn&#039;t mean that there is a large difference in the effects, only a large amount of evidence that there is a non-zero difference. You could have strong evidence of a small effect.</description>
		<content:encoded><![CDATA[<p>jean-louis: In statistics, &#8220;significantly different&#8221; is related to the strength of evidence of a difference, not its size of the difference. The null hypothesis is typically that two treatments have exactly the same effect. If there is statistically significant data that the null is false, that doesn&#8217;t mean that there is a large difference in the effects, only a large amount of evidence that there is a non-zero difference. You could have strong evidence of a small effect.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: jean-louis</title>
		<link>http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/comment-page-1/#comment-110182</link>
		<dc:creator>jean-louis</dc:creator>
		<pubDate>Tue, 25 Oct 2011 19:01:49 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/#comment-110182</guid>
		<description>Hi John,
&lt;cite&gt;The point is that no two treatments are ever identical.&lt;/cite&gt;
While I would agree to this, I believe that it still is not in contradiction with the hypothesis θj = θk, in which the θs do not replace the &quot;treatments&quot; (which, for the experiments to be useful, have to be different!), but rather, usually, the effects of the treatments. 

I am not an expert on the topic, nor an expert on statistics, so you might need to correct me if I m wrong. How I understand the null hypothesis is: &quot;are the effects of treatment A statistically different from those of treatment B?&quot; and usually, you don&#039;t really get a clear answer, but rather a measure telling you how likely it is that they are different (and hopefully medical publications are putting great care in keeping this in mind). 

As for type S or M errors, I am not sure how that would work. However, it made me think of the difference between one-tailed and two-tailed hypothesis test. As I understand, type S or M would still be type I or II errors, but given different kind of hypothesis  (inequality or equality, demonstration is left as homework ;-)). 

Or am I completely off-topic? even more than all the above spam about spam ?!! (no offense, just thought about the funny connection...)</description>
		<content:encoded><![CDATA[<p>Hi John,<br />
<cite>The point is that no two treatments are ever identical.</cite><br />
While I would agree to this, I believe that it still is not in contradiction with the hypothesis θj = θk, in which the θs do not replace the &#8220;treatments&#8221; (which, for the experiments to be useful, have to be different!), but rather, usually, the effects of the treatments. </p>
<p>I am not an expert on the topic, nor an expert on statistics, so you might need to correct me if I m wrong. How I understand the null hypothesis is: &#8220;are the effects of treatment A statistically different from those of treatment B?&#8221; and usually, you don&#8217;t really get a clear answer, but rather a measure telling you how likely it is that they are different (and hopefully medical publications are putting great care in keeping this in mind). </p>
<p>As for type S or M errors, I am not sure how that would work. However, it made me think of the difference between one-tailed and two-tailed hypothesis test. As I understand, type S or M would still be type I or II errors, but given different kind of hypothesis  (inequality or equality, demonstration is left as homework <img src='http://www.johndcook.com/blog/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> ). </p>
<p>Or am I completely off-topic? even more than all the above spam about spam ?!! (no offense, just thought about the funny connection&#8230;)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Andy</title>
		<link>http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/comment-page-1/#comment-76689</link>
		<dc:creator>Andy</dc:creator>
		<pubDate>Mon, 18 Apr 2011 08:38:01 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/#comment-76689</guid>
		<description>Roman: thanks!</description>
		<content:encoded><![CDATA[<p>Roman: thanks!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Roman Cheplyaka</title>
		<link>http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/comment-page-1/#comment-76674</link>
		<dc:creator>Roman Cheplyaka</dc:creator>
		<pubDate>Mon, 18 Apr 2011 03:54:37 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/#comment-76674</guid>
		<description>Andy: for example, here are the features that SpamAssassin tests for: http://wiki.apache.org/spamassassin/RulesList
(They are separate from the Bayes classifier and are more about meta-information than the contents.)</description>
		<content:encoded><![CDATA[<p>Andy: for example, here are the features that SpamAssassin tests for: <a href="http://wiki.apache.org/spamassassin/RulesList" rel="nofollow">http://wiki.apache.org/spamassassin/RulesList</a><br />
(They are separate from the Bayes classifier and are more about meta-information than the contents.)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Andy</title>
		<link>http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/comment-page-1/#comment-76604</link>
		<dc:creator>Andy</dc:creator>
		<pubDate>Sun, 17 Apr 2011 21:40:15 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/#comment-76604</guid>
		<description>What spam filters do could be viewed in terms of a, say, logistic regression model predicting probability an email is spam as a function of a bunch of features, e.g., presence of a particular word.  Then, you&#039;ve got a slope, estimated using seen emails (sample) for each feature.  Null hypothesis: slope = 0, for each slope.

Type S error would be inferring that a feature is indicative of spam when it&#039;s indicative of a safe email or vice versa.  Type M error would be, e.g., inferring that the presence of a particular feature is more likely to indicate spam than it really is in the broader population of emails.

Back to the oldies: a type 1 error would be inferring, based on sample emails, that a feature is predictive of spam (i.e., the slope for the feature is statistically significantly different to zero) when it&#039;s not in the population of emails.  Type 2: you think based on sample that a feature is not predictive when in the population it is.

I guess the notion of population is quite tricky for emails, though!  Spam detection is a really nice example.

Actually, has anyone looked at what features tend to predict whether an email is spam, or is it all kept secret and/or hidden in the CPTs of a Bayesian classifier somewhere?</description>
		<content:encoded><![CDATA[<p>What spam filters do could be viewed in terms of a, say, logistic regression model predicting probability an email is spam as a function of a bunch of features, e.g., presence of a particular word.  Then, you&#8217;ve got a slope, estimated using seen emails (sample) for each feature.  Null hypothesis: slope = 0, for each slope.</p>
<p>Type S error would be inferring that a feature is indicative of spam when it&#8217;s indicative of a safe email or vice versa.  Type M error would be, e.g., inferring that the presence of a particular feature is more likely to indicate spam than it really is in the broader population of emails.</p>
<p>Back to the oldies: a type 1 error would be inferring, based on sample emails, that a feature is predictive of spam (i.e., the slope for the feature is statistically significantly different to zero) when it&#8217;s not in the population of emails.  Type 2: you think based on sample that a feature is not predictive when in the population it is.</p>
<p>I guess the notion of population is quite tricky for emails, though!  Spam detection is a really nice example.</p>
<p>Actually, has anyone looked at what features tend to predict whether an email is spam, or is it all kept secret and/or hidden in the CPTs of a Bayesian classifier somewhere?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John</title>
		<link>http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/comment-page-1/#comment-76296</link>
		<dc:creator>John</dc:creator>
		<pubDate>Fri, 15 Apr 2011 21:28:10 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/#comment-76296</guid>
		<description>Andrew does seem to assume all null hypotheses are point hypotheses. I suppose he does this because so often that is the case in practice, even though in theory a null hypothesis could be any arbitrary subset of the parameter space.</description>
		<content:encoded><![CDATA[<p>Andrew does seem to assume all null hypotheses are point hypotheses. I suppose he does this because so often that is the case in practice, even though in theory a null hypothesis could be any arbitrary subset of the parameter space.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Roman Cheplyaka</title>
		<link>http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/comment-page-1/#comment-76293</link>
		<dc:creator>Roman Cheplyaka</dc:creator>
		<pubDate>Fri, 15 Apr 2011 21:20:51 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/#comment-76293</guid>
		<description>&lt;q&gt;A spam filter does not have a point null hypothesis.&lt;/q&gt;
Exactly. I was under a (probably wrong) impression that you or Andrew Gelman argue for somehow replacing all type 1/type 2 errors with type s/type m errors.</description>
		<content:encoded><![CDATA[<p><q>A spam filter does not have a point null hypothesis.</q><br />
Exactly. I was under a (probably wrong) impression that you or Andrew Gelman argue for somehow replacing all type 1/type 2 errors with type s/type m errors.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John</title>
		<link>http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/comment-page-1/#comment-76290</link>
		<dc:creator>John</dc:creator>
		<pubDate>Fri, 15 Apr 2011 21:13:49 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/#comment-76290</guid>
		<description>A spam filter does not have a point null hypothesis. Type-S error is relevant if you think of a spaminess scale with 0 being neutral and increasing values corresponding to more offensive spam.</description>
		<content:encoded><![CDATA[<p>A spam filter does not have a point null hypothesis. Type-S error is relevant if you think of a spaminess scale with 0 being neutral and increasing values corresponding to more offensive spam.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Roman Cheplyaka</title>
		<link>http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/comment-page-1/#comment-76285</link>
		<dc:creator>Roman Cheplyaka</dc:creator>
		<pubDate>Fri, 15 Apr 2011 20:44:57 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/#comment-76285</guid>
		<description>Often Type 1/Type 2 errors (or &quot;false positive&quot;/&quot;false negative&quot;) make more sense. E.g. how would you formulate spam detection errors in terms of &quot;Type M&quot; and &quot;Type S&quot;?</description>
		<content:encoded><![CDATA[<p>Often Type 1/Type 2 errors (or &#8220;false positive&#8221;/&#8221;false negative&#8221;) make more sense. E.g. how would you formulate spam detection errors in terms of &#8220;Type M&#8221; and &#8220;Type S&#8221;?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John</title>
		<link>http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/comment-page-1/#comment-76268</link>
		<dc:creator>John</dc:creator>
		<pubDate>Fri, 15 Apr 2011 18:39:12 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/#comment-76268</guid>
		<description>Thanks. I updated the link.</description>
		<content:encoded><![CDATA[<p>Thanks. I updated the link.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mathew Woodyard</title>
		<link>http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/comment-page-1/#comment-76265</link>
		<dc:creator>Mathew Woodyard</dc:creator>
		<pubDate>Fri, 15 Apr 2011 18:29:13 +0000</pubDate>
		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/04/21/four-types-of-errors/#comment-76265</guid>
		<description>Great post, as always, but I noticed that your link to Gelman&#039;s presentation is broken. It looks like it has moved here: http://www.stat.columbia.edu/~gelman/presentations/multiple_minitalk2.pdf</description>
		<content:encoded><![CDATA[<p>Great post, as always, but I noticed that your link to Gelman&#8217;s presentation is broken. It looks like it has moved here: <a href="http://www.stat.columbia.edu/~gelman/presentations/multiple_minitalk2.pdf" rel="nofollow">http://www.stat.columbia.edu/~gelman/presentations/multiple_minitalk2.pdf</a></p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Dynamic Page Served (once) in 0.479 seconds -->

