<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Endeavour &#187; Clinical trials</title>
	<atom:link href="http://www.johndcook.com/blog/category/clinical-trials/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.johndcook.com/blog</link>
	<description>The blog of John D. Cook</description>
	<lastBuildDate>Fri, 10 Feb 2012 23:03:26 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Works well versus well understood</title>
		<link>http://www.johndcook.com/blog/2011/05/10/well-understood/</link>
		<comments>http://www.johndcook.com/blog/2011/05/10/well-understood/#comments</comments>
		<pubDate>Tue, 10 May 2011 18:19:48 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Statistics]]></category>
		<category><![CDATA[Probability and Statistics]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=8487</guid>
		<description><![CDATA[While I was looking up the Tukey quote in my earlier post, I ran another of his quotes:
The test of a good procedure is how well it works, not how well it is understood.
At some level, it&#8217;s hard to argue against this. Statistical procedures operate on empirical data, so it makes sense that the procedures [...]]]></description>
			<content:encoded><![CDATA[<p>While I was looking up the Tukey quote in my <a href="http://www.johndcook.com/blog/2011/05/09/does-the-answer-matter/">earlier post</a>, I ran another of his quotes:</p>
<blockquote><p>The test of a good procedure is how well it works, not how well it is understood.</p></blockquote>
<p>At some level, it&#8217;s hard to argue against this. Statistical procedures operate on empirical data, so it makes sense that the procedures themselves be evaluated empirically.</p>
<p>But I question whether we really know that a statistical procedure works well if it isn&#8217;t well understood. Specifically, I&#8217;m skeptical of complex statistical methods whose only credentials are a handful of simulations. &#8220;We don&#8217;t have any theoretical results, buy hey, it works well in practice. Just look at the simulations.&#8221;</p>
<p><strong>Every method works well on the scenarios its author publishes</strong>, almost by definition. If the method didn&#8217;t handle a scenario well, the author would publish a different scenario. Even if the author didn&#8217;t select the most flattering scenarios, he or she may simply not have considered unflattering scenarios. The latter is particularly understandable, almost inevitable.</p>
<p>Simulation results would have more credibility if an adversary rather than an advocate chose the scenarios. Even so, an adversary and an advocate may share the same blind spots and not explore certain situations. Unless there&#8217;s a way to argue that a set of scenarios adequately samples the space of possible inputs, it&#8217;s hard to have a great deal of confidence in a method based on simulation results alone.</p>
<p><strong>Related posts</strong>:</p>
<p><a href="http://www.johndcook.com/blog/2010/10/19/buggy-simulation-code-is-biased/">Buggy code is biased code</a><br />
<a href="http://www.johndcook.com/blog/2010/01/12/software-sins-of-omission/">Software sins of omission</a><br />
<a href="http://www.johndcook.com/blog/2011/01/12/occams-razor-bayes-theorem/">Occam&#8217;s razor and Bayes&#8217; theorem</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2011/05/10/well-understood/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>A couple preprints</title>
		<link>http://www.johndcook.com/blog/2011/01/20/a-couple-preprints/</link>
		<comments>http://www.johndcook.com/blog/2011/01/20/a-couple-preprints/#comments</comments>
		<pubDate>Thu, 20 Jan 2011 14:47:51 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Statistics]]></category>
		<category><![CDATA[Bayesian]]></category>
		<category><![CDATA[Biostatistics]]></category>
		<category><![CDATA[Probability and Statistics]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=7626</guid>
		<description><![CDATA[Here are a couple new preprints.
Block-adaptive randomization.
A proposed method for limiting the size of runs in a response-adaptive clinical trial.
Skeptical and optimistic robust priors for clinical trials.
Joint work with Jairo Fúquene and Luis Pericchi from University of Puerto Rico.
]]></description>
			<content:encoded><![CDATA[<p>Here are a couple new preprints.</p>
<p><a href="http://www.bepress.com/mdandersonbiostat/paper63/">Block-adaptive randomization</a>.<br />
A proposed method for limiting the size of runs in a response-adaptive clinical trial.</p>
<p><a href="http://www.johndcook.com/SkepticalOptimistic.pdf">Skeptical and optimistic robust priors for clinical trials</a>.<br />
Joint work with Jairo Fúquene and Luis Pericchi from University of Puerto Rico.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2011/01/20/a-couple-preprints/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Why drugs often list headache as a side-effect</title>
		<link>http://www.johndcook.com/blog/2010/09/07/why-headache-side-effect/</link>
		<comments>http://www.johndcook.com/blog/2010/09/07/why-headache-side-effect/#comments</comments>
		<pubDate>Tue, 07 Sep 2010 13:51:04 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=6370</guid>
		<description><![CDATA[In an interview on Biotech Nation, Gary Cupit made an offhand remark about why so many drugs list headache as a possible side-effect: clinical trial participants are often asked to abstain from coffee during the trial. That also explains why those who receive placebo often complain of headache as well.
Cupit&#8217;s company Somnus Therapeutics makes a [...]]]></description>
			<content:encoded><![CDATA[<p>In an <a href="http://itc.conversationsnetwork.org/shows/detail4657.html">interview</a> on Biotech Nation, Gary Cupit made an offhand remark about why so many drugs list headache as a possible side-effect: <strong>clinical trial participants are often asked to abstain from coffee</strong> during the trial. That also explains why those who receive placebo often complain of headache as well.</p>
<p>Cupit&#8217;s company Somnus Therapeutics makes a sleep medication for people who have no trouble <em>going</em> to sleep but do have trouble <em>staying</em> asleep. The medication has a timed-release so that it is active only in the middle of the night when needed. One of the criteria by which the drug is evaluated is whether there is a lingering effect the next morning. Obviously researchers would like to eliminate coffee consumption as a confounding variable. But this contributes to the litany of side-effects that announcers must mumble in television commercials.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2010/09/07/why-headache-side-effect/feed/</wfw:commentRss>
		<slash:comments>18</slash:comments>
		</item>
		<item>
		<title>Subtle variation on gaining weight to become taller</title>
		<link>http://www.johndcook.com/blog/2010/09/05/subtle-variation-on-gaining-weight-to-become-taller/</link>
		<comments>http://www.johndcook.com/blog/2010/09/05/subtle-variation-on-gaining-weight-to-become-taller/#comments</comments>
		<pubDate>Sun, 05 Sep 2010 21:46:50 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Biostatistics]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=6273</guid>
		<description><![CDATA[Back in March I wrote a blog post asking whether gaining weight makes you taller. Weight and height are clearly associated, and from that data alone one might speculate that gaining weight could make you taller. Of course causation is in the other direction: becoming taller generally makes you gain weight.
In the 1980&#8217;s, cardiologists discovered [...]]]></description>
			<content:encoded><![CDATA[<p>Back in March I wrote a blog post asking whether <a href="http://www.johndcook.com/blog/2010/03/12/does-gaining-weight-make-you-taller/">gaining weight makes you taller</a>. Weight and height are clearly associated, and from that data alone one might speculate that gaining weight could make you taller. Of course causation is in the other direction: becoming taller generally makes you gain weight.</p>
<p>In the 1980&#8217;s, cardiologists discovered that patients with irregular heart beats for the first 12 days following a heart attack were much more likely to die. Antiarrythmic drugs became standard therapy. But in the next decade cardiologist discovered this was a bad idea. According to Philip Devereaux, &#8220;The trial didn&#8217; t just show that the drugs weren&#8217;t saving lives, it showed they were actually killing people.&#8221;</p>
<p>David Freedman relates the story above in his book <a href="http://www.amazon.com/gp/product/0316023787?ie=UTF8&amp;tag=theende-20&amp;linkCode=as2&amp;camp=1789&amp;creative=390957&amp;creativeASIN=0316023787">Wrong</a>. Freedman says</p>
<blockquote><p>In fact, notes Devereaux, the drugs killed more Americans than the Vietnam War did — roughly an average of forty thousand a year died from the drugs in the United States alone.</p></blockquote>
<p>Cardiologists had good reason to suspect that antiarrythmic drugs would save lives. In retrospect, it may be that heart-attack patients with poor prognosis have arrhythmia rather than arrhythmia causing poor prognosis. Or it may be that the association is more complicated than either explanation.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2010/09/05/subtle-variation-on-gaining-weight-to-become-taller/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>I promise I&#8217;m not trying to learn anything</title>
		<link>http://www.johndcook.com/blog/2010/03/29/i-promise-im-not-trying-to-learn-anything/</link>
		<comments>http://www.johndcook.com/blog/2010/03/29/i-promise-im-not-trying-to-learn-anything/#comments</comments>
		<pubDate>Mon, 29 Mar 2010 15:44:08 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=4802</guid>
		<description><![CDATA[Medical experiments come under greater scrutiny than ordinary medical practice. There are good reasons for such precautions, but this leads to a sort of paradox. As Frederick Mosteller observed
We have a strange double standard now. As long as a physician treats a patient intending to cure, the treatment is admissible. When the object is to [...]]]></description>
			<content:encoded><![CDATA[<p>Medical experiments come under greater scrutiny than ordinary medical practice. There are good reasons for such precautions, but this leads to a sort of paradox. As <a href="http://www.amazon.com/gp/product/0387779558?ie=UTF8&amp;tag=theende-20&amp;linkCode=as2&amp;camp=1789&amp;creative=390957&amp;creativeASIN=0387779558">Frederick Mosteller</a> observed</p>
<blockquote><p>We have a strange double standard now. As long as a physician treats a patient intending to cure, the treatment is admissible. When the object is to find out whether the treatment has value, the physician is immediately subject to many constraints.</p></blockquote>
<p>If a physician has two treatment options, A and B, he can assign either treatment as long as he believes that one is best. But if he admits that he doesn&#8217;t know which is better and says he wants to treat some patients each way in order to get a better idea how they compare, then he has to propose a study and go through a long review processes.</p>
<p>I agree with Mosteller that we have a strange double  standard, that a doctor is free to do what he wants as long as he doesn&#8217;t try to learn anything. On the other hand, review boards reduce the chances that patients will be asked to participate in ill-conceived experiments by looking for possible conflicts of interest, weaknesses in statistical design, etc. And such precautions are more necessary in experimental medicine than in more routine medicine. Still, there is more uncertainty in medicine than we may like to admit, and the line between &#8220;experimental&#8221; and &#8220;routine&#8221; can be fuzzy.</p>
<p><strong>Related posts</strong>:</p>
<p><a href="http://www.johndcook.com/blog/2009/04/08/science-versus-medicine/">Science versus medicine</a><br />
<a href="http://www.johndcook.com/blog/2008/12/28/early-evidence-based-medicine/">Early evidence-based medicine</a><br />
<a href="http://www.johndcook.com/blog/2008/03/28/dose-finding-why-start-at-the-lowest-dose/">Dose-finding: why start at the lowest dose?</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2010/03/29/i-promise-im-not-trying-to-learn-anything/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Something like a random sequence but &#8230;</title>
		<link>http://www.johndcook.com/blog/2010/02/24/random-sequence/</link>
		<comments>http://www.johndcook.com/blog/2010/02/24/random-sequence/#comments</comments>
		<pubDate>Wed, 24 Feb 2010 12:33:37 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Probability and Statistics]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=4655</guid>
		<description><![CDATA[When people ask for a random sequence, they&#8217;re often disappointed with what they get.
Random sequences clump more than most folks expect. For graphical applications, quasi-random sequence may be more appropriate.These sequences are &#8220;more random than random&#8221; in the sense that they behave more like what some folks expect from randomness. They jitter around like a [...]]]></description>
			<content:encoded><![CDATA[<p>When people ask for a random sequence, they&#8217;re often disappointed with what they get.</p>
<p>Random sequences clump more than most folks expect. For graphical applications, <a href="http://www.johndcook.com/blog/2009/03/16/quasi-random-sequences-in-art-and-integration/">quasi-random sequence</a> may be more appropriate.These sequences are &#8220;more random than random&#8221; in the sense that they behave more like what some folks expect from randomness. They jitter around like a random sequence, but they don&#8217;t clump as much.</p>
<p>Researchers conducting clinical trials are dismayed when a randomized trial puts several patients in a row on the same treatment. They want to assign patients one at a time to one of two treatments with equal probability, but they also want the allocation to work out evenly. This is like saying you want to flip a coin 100 times, and you also want to get exactly 50 heads and 50 tails. You can&#8217;t guarantee both, but there are effective compromises.</p>
<p>One approach is to randomize in blocks. For example, you could randomize in blocks of 10 patients by taking a sequence of 5 A&#8217;s and 5 B&#8217;s and randomly permuting the 10 letters. This guarantees that the allocations will be balanced, but some outcomes will be predictable. At a minimum, the last assignment in each block is always predictable: you assign whatever is left. Assignments could be even more predictable: if you give n A&#8217;s in a row in a block of 2n, you know the last n assignments will be all B&#8217;s.</p>
<p>Another approach is to &#8220;encourage&#8221; balance rather than enforce it. When you&#8217;ve given more A&#8217;s than B&#8217;s you could increase the probability of assigning a B. The greater the imbalance, the more heavily you bias the randomization probability in favor of the treatment that has been assigned less. This is a sort of compromise between equal randomization and block randomization. All assignments are random, though some assignments may be more predictable than others. Large imbalances are less likely than with equal randomization, but more likely than with block randomization. You can tune how aggressively the method responds to imbalances in order to make the method more like equal randomization or more like block randomization.</p>
<p>No approach to randomization will satisfy everyone because there are conflicting requirements. Randomization is a <a href="http://www.johndcook.com/blog/2008/04/22/problems-versus-dilemmas/">dilemma</a> to be managed rather than a problem to be solved.</p>
<p><strong>Related posts</strong>:</p>
<p><a href="http://www.johndcook.com/blog/2009/03/16/quasi-random-sequences-in-art-and-integration/">Quasi-random sequences in art and integration</a><br />
<a href="http://www.johndcook.com/blog/2008/07/22/three-ways-of-tuning-an-adaptively-randomized-trial/">Three ways of tuning an adaptively randomized trial</a><br />
<a href="http://www.johndcook.com/blog/2008/02/01/population-drift/">Population drift</a><br />
<a href="http://www.johndcook.com/blog/2008/04/15/galen-and-clinical-trials/">Galen and clinical trials</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2010/02/24/random-sequence/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Malaria on the prairie</title>
		<link>http://www.johndcook.com/blog/2010/02/09/malaria-on-the-prairie/</link>
		<comments>http://www.johndcook.com/blog/2010/02/09/malaria-on-the-prairie/#comments</comments>
		<pubDate>Tue, 09 Feb 2010 17:17:45 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Science]]></category>
		<category><![CDATA[Books]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=4508</guid>
		<description><![CDATA[My family loves the Little House on the Prairie books. We read them aloud to our three oldest children and we&#8217;re in the process of reading them with our fourth child. We just read the chapter describing when the entire Ingalls family came down with malaria, or &#8220;fever &#8216;n&#8217; ague&#8221; as they called it.
The family [...]]]></description>
			<content:encoded><![CDATA[<p>My family loves the <a href="http://www.amazon.com/gp/product/B002IT19WY?ie=UTF8&amp;tag=theende-20&amp;linkCode=xm2&amp;camp=1789&amp;creativeASIN=B002IT19WY">Little House on the Prairie</a> books. We read them aloud to our three oldest children and we&#8217;re in the process of reading them with our fourth child. We just read the chapter describing when the entire Ingalls family came down with malaria, or &#8220;fever &#8216;n&#8217; ague&#8221; as they called it.</p>
<p>The family had settled near a creek that was infested with mosquitoes. All the settlers around the creek bottoms came down with malaria, though at the time (circa 1870) they did not know the disease was transmitted by mosquitoes. One of the settlers, Mrs. Scott, believed that malaria was caused by eating the watermelons that grew in the creek bottoms. She had empirical evidence: everyone who had eaten the melons contracted malaria. Charles Ingalls thought that was ridiculous. After he recovered from his attack of malaria, he went down to the creek and brought back a huge watermelon and ate it. His reasoning was that &#8220;Everybody knows that fever &#8216;n&#8217; ague comes from breathing the night air.&#8221;</p>
<p>It&#8217;s easy to laugh at Mrs. Scott and Mr. Ingalls. What ignorant, superstitious people. But they were no more ignorant than their contemporaries, and both had good reasons for their beliefs. Mrs. Scott had observational data on her side. Ingalls was relying on the accepted wisdom of his day. (After all, &#8220;malaria&#8221; means &#8220;bad air.&#8221;)</p>
<p>People used to believe all kinds of things that are absurd now, particularly in regard to medicine. But they were also right about many things that are hard to enumerate now because we take them for granted. Stories of conventional wisdom being correct are not interesting, unless there was some challenge to that wisdom. The easiest examples of folk wisdom to recall may be the instances in which science initially contradicted folk wisdom but later confirmed it. For example, we have come back to believing that breast milk is best for babies and that a moderate amount of sunshine is good for you.</p>
<p><strong>Related posts</strong>:</p>
<p><a href="http://www.johndcook.com/blog/2009/11/04/little-coffee-on-the-prairie/">A little coffee on the prairie</a><br />
<a href="http://www.johndcook.com/blog/2008/04/15/galen-and-clinical-trials/">Galen and clinical trials</a><br />
<a href="http://www.johndcook.com/blog/2008/04/01/randomized-trials-of-parachute-use/">Randomized trials of parachute use</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2010/02/09/malaria-on-the-prairie/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Biostatistics software</title>
		<link>http://www.johndcook.com/blog/2010/01/13/biostatistics-software/</link>
		<comments>http://www.johndcook.com/blog/2010/01/13/biostatistics-software/#comments</comments>
		<pubDate>Wed, 13 Jan 2010 17:54:09 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Statistics]]></category>
		<category><![CDATA[Bayesian]]></category>
		<category><![CDATA[Probability and Statistics]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=4227</guid>
		<description><![CDATA[The M. D. Anderson Cancer Center Department of Biostatistics has a software download site listing software developed by the department over many years.
The home page of the download site allows you to see all products sorted by date or by name. This page also allows search. A new page lets you see the software organized [...]]]></description>
			<content:encoded><![CDATA[<p>The M. D. Anderson Cancer Center Department of Biostatistics has a software download site listing software developed by the department over many years.</p>
<p>The <a href="http://biostatistics.mdanderson.org/SoftwareDownload/Default.aspx">home page</a> of the download site allows you to see all products sorted by date or by name. This page also allows search. A new page lets you see the software organized by <a href="http://biostatistics.mdanderson.org/SoftwareDownload/SiteAux/tags.html">tags</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2010/01/13/biostatistics-software/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Managing biological data</title>
		<link>http://www.johndcook.com/blog/2009/12/14/managing-biological-data/</link>
		<comments>http://www.johndcook.com/blog/2009/12/14/managing-biological-data/#comments</comments>
		<pubDate>Mon, 14 Dec 2009 20:12:11 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Biostatistics]]></category>
		<category><![CDATA[Science]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=3937</guid>
		<description><![CDATA[Jon Udell&#8217;s latest Interviews with Innovators podcast features Randall Julian of Indigo BioSystems. I found this episode particularly interesting because it deals with issues I have some experience with.
The problems in managing biological data begin with how to store the raw experiment data. As Julian says
&#8230; without buying into all the hype around semantic web [...]]]></description>
			<content:encoded><![CDATA[<p>Jon Udell&#8217;s <a href="http://itc.conversationsnetwork.org/shows/detail4329.html">latest</a> Interviews with Innovators podcast features Randall Julian of <a href="http://www.indigobio.com/">Indigo BioSystems</a>. I found this episode particularly interesting because it deals with issues I have some experience with.</p>
<p>The problems in managing biological data begin with how to store the raw experiment data. As Julian says</p>
<blockquote><p>&#8230; without buying into all the hype around semantic web and so on, you would argue that a flexible schema makes more sense in a knowledge gathering or knowledge generation context than a fixed schema does.</p></blockquote>
<p>So you need something less rigid than a relational database and something with more structure than a set of Excel spreadsheets. That&#8217;s not easy, and I don&#8217;t know whether anyone has come up with an optimal solution yet. Julian said that he has seen many attempts to put vast amounts of biological data into a rigid relational database schema but hasn&#8217;t seen this approach succeed yet. My experience has been similar.</p>
<p>Representing raw experimental data isn&#8217;t enough. In fact, that&#8217;s the easy part. As Jon Udell comments during the interview</p>
<blockquote><p>It&#8217;s easy to represent data. It&#8217;s hard to represent the experiment.</p></blockquote>
<p>That is, the data must come with ample context to make sense of the data. Julian comments that without this context, the data may as well be a list of zip codes. And not only must you capture experimental context, you must describe the analysis done to the data. (See, for example, this post about researchers <a href="http://www.johndcook.com/blog/2009/09/18/make-up-your-own-rules-of-probability/">making up their own rules of probability</a>.)</p>
<p>Julian comments on how electronic data management is not nearly as common as someone unfamiliar with medical informatics might expect.</p>
<blockquote><p>So right now maybe 50% of the clinical trials in the world are done using electronic data capture technology. &#8230; that&#8217;s the thing that maybe people don&#8217;t understand about health care and the life sciences in general is that there is still a huge amount of paper out there.</p></blockquote>
<p>Part of the reason for so much paper goes back to the belief that one must choose between highly normalized relational data stores and unstructured files. Given a choice between inflexible bureaucracy and chaos, many people choose chaos. It may work about as well, and it&#8217;s much cheaper to implement. I&#8217;ve seen both extremes. I&#8217;ve also been part of a project using a flexible but structured approach that worked quite well.</p>
<p><strong>Related posts</strong>:</p>
<p><a href="http://www.johndcook.com/blog/2009/05/05/reproducible-ideas/">Posts on reproducibility</a><br />
<a href="http://www.johndcook.com/blog/2008/04/22/problems-versus-dilemmas/">Problems versus dilemmas</a><br />
<a href="http://www.johndcook.com/blog/2009/05/05/blogging-about-reproducible-research/">Blogging about reproducible research</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2009/12/14/managing-biological-data/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>A case for robust Bayesian priors</title>
		<link>http://www.johndcook.com/blog/2009/11/30/robust-bayesian-priors/</link>
		<comments>http://www.johndcook.com/blog/2009/11/30/robust-bayesian-priors/#comments</comments>
		<pubDate>Mon, 30 Nov 2009 15:44:07 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Bayesian]]></category>
		<category><![CDATA[Probability and Statistics]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=3821</guid>
		<description><![CDATA[A paper I wrote with Jairo Fúquene and Luis Pericchi is now available online.
A Case for Robust Bayesian Priors with Applications to Clinical Trials
Jairo Fúquene, John Cook, and Luis Pericchi
Bayesian Analysis (2009) 4, Number 4, pp. 817–846.
]]></description>
			<content:encoded><![CDATA[<p>A paper I wrote with Jairo Fúquene and Luis Pericchi is now available online.</p>
<p><a href="http://ba.stat.cmu.edu/journal/2009/vol04/issue04/fuquene.pdf">A Case for Robust Bayesian Priors with Applications to Clinical Trials</a><br />
Jairo Fúquene, John Cook, and Luis Pericchi<br />
Bayesian Analysis (2009) 4, Number 4, pp. 817–846.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2009/11/30/robust-bayesian-priors/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Bayesian clinical trials in one zip code</title>
		<link>http://www.johndcook.com/blog/2009/10/27/bayesian-clinical-trials-zip-code/</link>
		<comments>http://www.johndcook.com/blog/2009/10/27/bayesian-clinical-trials-zip-code/#comments</comments>
		<pubDate>Tue, 27 Oct 2009 11:06:05 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Bayesian]]></category>
		<category><![CDATA[Biostatistics]]></category>
		<category><![CDATA[Cancer]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=3309</guid>
		<description><![CDATA[I recently ran across this quote from Mithat Gönen of Memorial Sloan-Kettering Cancer Center:
While there are certainly some at other centers, the bulk of applied Bayesian clinical trial design in this country is largely confined to a single zip code.
from &#8220;Bayesian clinical trials: no more excuses,&#8221; Clinical Trials 2009; 6; 203.
The zip code Gönen alludes [...]]]></description>
			<content:encoded><![CDATA[<p>I recently ran across this quote from Mithat Gönen of Memorial Sloan-Kettering Cancer Center:</p>
<blockquote><p>While there are certainly some at other centers, the bulk of applied Bayesian clinical trial design in this country is largely confined to a single zip code.</p></blockquote>
<p>from &#8220;Bayesian clinical trials: no more excuses,&#8221; <em>Clinical Trial</em>s 2009; 6; 203.</p>
<p>The zip code Gönen alludes to is 77030, the zip code of M. D. Anderson Cancer Center. I can&#8217;t say how much activity there is elsewhere,  but certainly we design and conduct a lot of Bayesian clinical trials at MDACC.</p>
<p><strong>Related posts</strong>:</p>
<p><a href="http://www.johndcook.com/blog/2009/04/23/cartoon-about-my-job/">Cartoon guide to cancer research</a><br />
<a href="http://www.johndcook.com/blog/2008/08/26/stopping-trials-of-ineffective-drugs-earlier/">Stopping trials of ineffective drugs sooner</a><br />
<a href="http://www.johndcook.com/blog/2008/07/22/three-ways-of-tuning-an-adaptively-randomized-trial/">Three ways of tuning an adaptively randomized clinical trial</a><br />
<a href="http://www.johndcook.com/blog/2008/02/01/population-drift/">Population drift</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2009/10/27/bayesian-clinical-trials-zip-code/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Off to Puerto Rico</title>
		<link>http://www.johndcook.com/blog/2009/05/24/off-to-puerto-rico/</link>
		<comments>http://www.johndcook.com/blog/2009/05/24/off-to-puerto-rico/#comments</comments>
		<pubDate>Sun, 24 May 2009 13:38:57 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=2362</guid>
		<description><![CDATA[I&#8217;m leaving today for San Juan. I&#8217;m giving a couple talks at a conference on clinical trials.
Puerto Rico is beautiful. (I want to say a &#8220;lovely island,&#8221; but then the song America from West Side Story gets stuck in my head.) Here are a couple photos from my last visit.


]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m leaving today for San Juan. I&#8217;m giving a couple talks at a conference on clinical trials.</p>
<p>Puerto Rico is beautiful. (I want to say a &#8220;lovely island,&#8221; but then the song <a href="http://www.youtube.com/watch?v=26P1B9FTuBk">America</a> from West Side Story gets stuck in my head.) Here are a couple photos from my last visit.</p>
<p style="text-align:center"><img src="http://www.johndcook.com/osj.jpg" alt="" width="384" height="233" /></p>
<p style="text-align:center"><img src="http://www.johndcook.com/prtree.jpg" alt="" width="384" height="267" /></p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2009/05/24/off-to-puerto-rico/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>R package for robust priors</title>
		<link>http://www.johndcook.com/blog/2009/05/11/r-package-for-robust-priors/</link>
		<comments>http://www.johndcook.com/blog/2009/05/11/r-package-for-robust-priors/#comments</comments>
		<pubDate>Tue, 12 May 2009 02:00:21 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Statistics]]></category>
		<category><![CDATA[Bayesian]]></category>
		<category><![CDATA[Rstat]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=2267</guid>
		<description><![CDATA[Jairo Fuquene has released an R package on CRAN to accompany our paper
A Case for Robust Bayesian priors with Applications to Binary Clinical Trials
Jairo A. Fuquene P., John D. Cook, Luis Raul Pericchi
]]></description>
			<content:encoded><![CDATA[<p>Jairo Fuquene has released an <a href="http://cran.r-project.org/web/packages/ClinicalRobustPriors/index.html">R package</a> on CRAN to accompany our paper</p>
<p><a href="http://www.bepress.com/mdandersonbiostat/paper44/">A Case for Robust Bayesian priors with Applications to Binary Clinical Trials</a><br />
Jairo A. Fuquene P., John D. Cook, Luis Raul Pericchi</p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2009/05/11/r-package-for-robust-priors/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Science versus medicine</title>
		<link>http://www.johndcook.com/blog/2009/04/08/science-versus-medicine/</link>
		<comments>http://www.johndcook.com/blog/2009/04/08/science-versus-medicine/#comments</comments>
		<pubDate>Thu, 09 Apr 2009 02:29:56 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Biostatistics]]></category>
		<category><![CDATA[Cancer]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=1954</guid>
		<description><![CDATA[Before I started working for a cancer center, I was not aware of the tension between science and medicine. Popular perception is that the two go together hand and glove, but that&#8217;s not always true.
Physicians are trained to use their subjective judgment and to be decisive. And for good reason: making a fairly good decision [...]]]></description>
			<content:encoded><![CDATA[<p>Before I started working for a cancer center, I was not aware of the tension between science and medicine. Popular perception is that the two go together hand and glove, but that&#8217;s not always true.</p>
<p>Physicians are trained to use their subjective judgment and to be decisive. And for good reason: making a fairly good decision quickly is often better than making the best decision eventually. But scientists must be tentative, withhold judgment, and follow protocols.</p>
<p>Sometimes physician-scientists can reconcile their two roles, but sometimes they have to choose to wear one hat or the other at different times.</p>
<p>The physician-scientist tension is just one facet of the constant tension between treating each patient effectively and learning how to treat future patients more effectively. Sometimes the interests of current patients and future patients coincide completely, but not always.</p>
<p>This ethical tension is part of what makes biostatistics a separate field of statistics. In manufacturing, for example, you don&#8217;t need to balance the interests of current light bulbs and future light bulbs. If you need to destroy 1,000 light bulbs to find out how to make better bulbs in the future, no big deal. But different rules apply when experimenting on people. Clinical trials will often use statistical designs that sacrifice some statistical power in order to protect the people participating in the trial. Ethical constraints make biostatistics interesting.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2009/04/08/science-versus-medicine/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Probability that a study result is true</title>
		<link>http://www.johndcook.com/blog/2008/11/24/probability-that-a-study-result-is-true/</link>
		<comments>http://www.johndcook.com/blog/2008/11/24/probability-that-a-study-result-is-true/#comments</comments>
		<pubDate>Mon, 24 Nov 2008 16:42:22 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Science]]></category>
		<category><![CDATA[Statistics]]></category>
		<category><![CDATA[Probability and Statistics]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=948</guid>
		<description><![CDATA[Suppose a new study comes out saying a drug or a food or a habit lowers your risk of some disease. What is the probability that the study&#8217;s result is correct? Obviously this is a very important question, but one that is not raised often enough.

I&#8217;ve referred to a paper by John Ioannidis (*) several [...]]]></description>
			<content:encoded><![CDATA[<p>Suppose a new study comes out saying a drug or a food or a habit lowers your risk of some disease. What is the probability that the study&#8217;s result is correct? Obviously this is a very important question, but one that is not raised often enough.</p>
<p><span id="more-948"></span></p>
<p>I&#8217;ve referred to a paper by John Ioannidis (*) several times before, but I haven&#8217;t gone over the model he uses to support his claim that most study results are false. This post will look at some equations he derives for estimating the probability that a claimed positive result is correct.</p>
<p>First of all, let R be the ratio of positive findings to negative findings being investigated in a particular area. Of course we never know exactly what R is, but let&#8217;s pretend that somehow we knew that out of 1000 hypotheses being investigated in some area, 200 are correct. Then R would be 200/800 = 0.25. The value of R varies quite a bit, being relatively large in some fields of study and quite small in others. Imagine researchers pulling hypotheses to investigate from a hat. The probability of selecting a hypothesis that really is true would be R/(R+1) and the probability selecting a false hypothesis is 1/(R+1).</p>
<p>Let α be the probability of incorrectly declaring a false hypothesis to be true. Studies are often designed with the goal that α would be 0.05. Let β be the probability that a study would incorrectly conclude that that a true hypothesis is false. In practice, β is far more variable than α. You might find study designs with β anywhere from 0.5 down to 0.01. The design choice β = 0.20 is common in some contexts.</p>
<p>There are two ways to publish a study claiming a new result: you could have selected a true hypothesis and correctly concluded that it was true, or you could have selected a false but incorrectly concluded it was true. The former has probability (1-β)R/(R+1) and the latter has probability α/(R+1). The total probability of concluding a hypothesis is true, correctly or incorrectly, is the sum of these probabilities, i.e. ((1-β)R + α)/(R+1). The probability that a study conclusion <em>is</em> true given that you <em>concluded</em> it was true, the positive predictive value or PPV, is the ratio of (1-β)R/(R+1) to ((1-β)R + α)/(R+1). In summary, under the assumptions above, the probability of a claimed result being true is (1-β)R/((1-β)R + α).</p>
<p>If (1 &#8211; β)R &lt; α then the model say that a claim is more likely to be false than true. This can happen if R is small, i.e. there are not a large proportion of true results under investigation, and if β is large, i.e. if studies are small. If R is smaller than α, most studies will be false no matter how small you make β, i.e. no matter how large the study. This says that in a challenging area, where few of the ideas being investigated lead to progress, there will be a large proportion of false results published, even if the individual researchers are honest and careful.</p>
<p>Ioannidis develops two other models refining the model above. Suppose that because of bias, some proportion of results that would otherwise have been reported as negative are reported as positive. Call this proportion u. The derivation of the positive predictive value is similar to that in the previous model, but messier. The final result is R(1-β + uβ)/(R(1-β + uβ) + α + u &#8211; αu). If 1 &#8211; β &gt; α, which is nearly always the case, then the probability of a reported result being correct decreases as bias increases.</p>
<p>The final model considers the impact of multiple investigators testing the same hypothesis. If more people try to prove the same thing, it&#8217;s more likely that someone will get lucky and &#8220;prove&#8221; it, whether or not the thing to be proven is true. Leaving aside bias, if n investigators are testing each hypothesis, the probability that a positive claim is true is given by R(1 &#8211; β<sup>n</sup>)/(R + 1 &#8211; (1 &#8211; α)<sup>n</sup> &#8211; Rβ<sup>n</sup>). As n increases, the probability of a positive claim being true decreases.</p>
<p>The probability of a result being true is often much lower than is commonly believed. One reason is that hypothesis testing focuses on the probability of the data given a hypothesis rather than the probability of a hypothesis given the data. Calculating the probability of a hypothesis given data relies on prior probabilities, such as the factors R/(R+1) and 1/(R+1) above. These prior probabilities are elusive and controversial, but they are critical in evaluating how likely it is that claimed results are true.</p>
<p>(*) John P. A. Ioannidis, Why most published research findings are false. CHANCE volume 18, number 4, 2005.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2008/11/24/probability-that-a-study-result-is-true/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Sometimes it&#8217;s right under your nose</title>
		<link>http://www.johndcook.com/blog/2008/10/07/sometimes-its-right-under-your-nose/</link>
		<comments>http://www.johndcook.com/blog/2008/10/07/sometimes-its-right-under-your-nose/#comments</comments>
		<pubDate>Wed, 08 Oct 2008 02:00:41 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Creativity]]></category>
		<category><![CDATA[Science]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=502</guid>
		<description><![CDATA[Neptune was discovered in 1846. But Galileo&#8217;s notebooks describe a &#8220;star&#8221; he saw on 28 December 1612 and 2 January 1613 that we now know was Neptune. Galileo even noticed that his star was in a slightly different location for his two observations, but he chalked the difference up to observational error.
The men who discovered [...]]]></description>
			<content:encoded><![CDATA[<p>Neptune was discovered in 1846. But Galileo&#8217;s notebooks describe a &#8220;star&#8221; he saw on 28 December 1612 and 2 January 1613 that we now know was Neptune. Galileo even noticed that his star was in a slightly different location for his two observations, but he chalked the difference up to observational error.</p>
<p>The men who discovered Neptune were not the first to see it; they were the first to realize what they were looking at.</p>
<p><img class="aligncenter" src="http://upload.wikimedia.org/wikipedia/commons/thumb/0/06/Neptune.jpg/240px-Neptune.jpg" alt="Voyager 2 photo of Neptune via Wikipedia" width="240" height="236" /></p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2008/10/07/sometimes-its-right-under-your-nose/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How to pick simulation scenarios</title>
		<link>http://www.johndcook.com/blog/2008/10/06/how-to-pick-simulation-scenarios/</link>
		<comments>http://www.johndcook.com/blog/2008/10/06/how-to-pick-simulation-scenarios/#comments</comments>
		<pubDate>Mon, 06 Oct 2008 16:15:02 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=517</guid>
		<description><![CDATA[People new to simulation start by picking scenarios based on what they hope will happen. That&#8217;s OK, but it&#8217;s more important to pick scenarios that you expect are likely to happen or fear might happen.
]]></description>
			<content:encoded><![CDATA[<p>People new to simulation start by picking scenarios based on what they <em>hope</em> will happen. That&#8217;s OK, but it&#8217;s more important to pick scenarios that you <em>expect</em> are likely to happen or <em>fear</em> might happen.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2008/10/06/how-to-pick-simulation-scenarios/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Drug looks promising, come back in 30 years</title>
		<link>http://www.johndcook.com/blog/2008/09/07/drug-looks-promising-come-back-in-30-years/</link>
		<comments>http://www.johndcook.com/blog/2008/09/07/drug-looks-promising-come-back-in-30-years/#comments</comments>
		<pubDate>Sun, 07 Sep 2008 22:46:00 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=373</guid>
		<description><![CDATA[The most recent 60-Second Science podcast summarizes a paper in Science magazine reporting that the average interval between a drug being deemed &#8220;promising&#8221; and the first paper appearing showing clinical effectiveness is 24 years.
Note that the publication of a paper saying a drug is clinically effective is a far cry from regulatory approval. Many new [...]]]></description>
			<content:encoded><![CDATA[<p>The most recent 60-Second Science <a href="http://www.sciam.com/podcast/episode.cfm?id=2E12FC4C-ED3D-FC85-9FF7EEA4E28B0819">podcast</a> summarizes a paper in Science magazine reporting that the average interval between a drug being deemed &#8220;promising&#8221; and the first paper appearing showing clinical effectiveness is 24 years.</p>
<p>Note that the publication of a paper saying a drug is clinically effective is a far cry from regulatory approval. Many new drugs that look like an improvement after a phase II trial turn out to be no better than existing treatments, and those really are better take years to achieve regulatory approval.</p>
<p>See also <a href="../2008/02/08/false-positives-for-medical-papers/"></a></p>
<p><a href="http://www.johndcook.com/blog/2008/02/07/most-published-research-results-are-false/">False positives for medical papers<br />
Most published research results are false</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2008/09/07/drug-looks-promising-come-back-in-30-years/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Random inequalities VII: three or more variables</title>
		<link>http://www.johndcook.com/blog/2008/09/06/random-inequalities-vii-three-or-more-variables/</link>
		<comments>http://www.johndcook.com/blog/2008/09/06/random-inequalities-vii-three-or-more-variables/#comments</comments>
		<pubDate>Sun, 07 Sep 2008 03:33:39 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Math]]></category>
		<category><![CDATA[Bayesian]]></category>
		<category><![CDATA[Probability and Statistics]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/?p=328</guid>
		<description><![CDATA[The previous posts in this series have looked at P(X &#62; Y), the probability that a sample from a random variable X is greater than a sample from an independent random variable Y. In applications, X and Y have different distributions but come from the same distribution family.
Sometimes applications require computing P(X &#62; max(Y, Z)). [...]]]></description>
			<content:encoded><![CDATA[<p>The previous posts in this series have looked at P(X &gt; Y), the probability that a sample from a random variable X is greater than a sample from an independent random variable Y. In applications, X and Y have different distributions but come from the same distribution family.</p>
<p>Sometimes applications require computing P(X &gt; max(Y, Z)). For example, an <a href="https://biostatistics.mdanderson.org/SoftwareDownload/SingleSoftware.aspx?Software_Id=62">adaptively randomized trial</a> of three treatments may be designed to assign a treatment with probability equal to the probability that that treatment has the best response. In a trial with a binary outcome, the variables X, Y, and Z may be beta random variables representing the probability of response. In a trial with a time-to-event outcome, the variables might be gamma random variables representing survival time.</p>
<p>Sometimes we&#8217;re interested in the opposite inequality, P(X &lt; min(Y,Z)). This would be the case if we thought in terms of failures rather than responses, or wanted to minimize the time to a desirable event rather than maximizing the time to an undesirable event.</p>
<p>The maximum and minimum inequalities are related by the following equation:</p>
<p>P(X &lt; min(Y,Z)) = P(X &gt; max(Y, Z)) + 1 &#8211; P(X &gt; Y) &#8211; P(X &gt; Z).</p>
<p>These inequalities are used for safety monitoring rules as well as to determine randomization probabilities. In a trial seeking to maximize responses, a treatment arm X might be dropped if P(X &gt; max(Y,Z)) becomes too small.</p>
<p>In principle one could design an adaptively randomized trial with n treatment arms for any n ≥ 2 based on P(X<sub>1</sub> &gt; max(X<sub>2</sub>, &#8230;, X<sub>n</sub>)). In practice, the most common value of n by far is 2. Sometimes n is 3. I&#8217;m not familiar with an adaptively randomized trial with more than three arms. I&#8217;ve heard of an adaptively randomized trial that was designed with five arms, but I don&#8217;t believe the trial ran.</p>
<p>Computing P(X<sub>1</sub> &gt; max(X<sub>2</sub>, &#8230;, X<sub>n</sub>)) by numerical integration becomes more difficult as n increases. For large n, simulation may be more efficient than integration. Computing P(X<sub>1</sub> &gt; max(X<sub>2</sub>, &#8230;, X<sub>n</sub>)) for gamma random variables with n=3 was unacceptably slow in a previous version of our <a href="https://biostatistics.mdanderson.org/SoftwareDownload/SingleSoftware.aspx?Software_Id=62">adaptive randomization software</a>. The search for a faster algorithm lead to this paper: <a href="http://www.bepress.com/mdandersonbiostat/paper30/">Numerical Evaluation of Gamma Inequalities</a>.</p>
<p>Previous posts on random inequalities:</p>
<p><a href="http://www.johndcook.com/blog/2008/07/26/random-inequalities-i/">Introduction</a><br />
<a href="http://www.johndcook.com/blog/2008/07/26/random-inequalities-ii-analytical-results/">Analytical results</a><br />
<a href="http://www.johndcook.com/blog/2008/07/26/random-inequalities-iii-numerical-results/">Numerical results</a><br />
<a href="http://www.johndcook.com/blog/2008/08/09/random-inequalities-iv-cauchy-distributions/">Cauchy distributions</a><br />
<a href="http://www.johndcook.com/blog/2008/08/21/random-inequalities-v-beta-distributions/">Beta distributions</a><br />
<a href="http://www.johndcook.com/blog/2008/08/30/random-inequalities-vi-gamma-distributions/">Gamma distributions</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2008/09/06/random-inequalities-vii-three-or-more-variables/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Random inequalities VI: gamma distributions</title>
		<link>http://www.johndcook.com/blog/2008/08/30/random-inequalities-vi-gamma-distributions/</link>
		<comments>http://www.johndcook.com/blog/2008/08/30/random-inequalities-vi-gamma-distributions/#comments</comments>
		<pubDate>Sat, 30 Aug 2008 11:55:54 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Statistics]]></category>
		<category><![CDATA[Probability and Statistics]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/08/30/random-inequalities-vi-gamma-distributions/</guid>
		<description><![CDATA[This post looks at computing P(X &#62; Y) where X and Y are gamma random variables. These inequalities are central to the Thall-Wooten method of monitoring single-arm clinical trials with time-to-event outcomes. They also are central to adaptively randomized clinical trials with time-to-event outcomes.
When X and Y are gamma random variables P(X &#62; Y) can [...]]]></description>
			<content:encoded><![CDATA[<p>This post looks at computing P(X &gt; Y) where X and Y are gamma random variables. These inequalities are central to the <a href="https://biostatistics.mdanderson.org/SoftwareDownload/SingleSoftware.aspx?Software_Id=73">Thall-Wooten</a> method of monitoring single-arm clinical trials with time-to-event outcomes. They also are central to <a href="https://biostatistics.mdanderson.org/SoftwareDownload/SingleSoftware.aspx?Software_Id=62">adaptively randomized clinical trials</a> with time-to-event outcomes.</p>
<p>When X and Y are gamma random variables P(X &gt; Y) can be computed in terms of the incomplete beta function. Suppose X has shape α<sub>X</sub> and scale β<sub>X</sub> and Y has shape α<sub>Y</sub> and scale β<sub>Y</sub>. Then β<sub>X</sub>Y/(β<sub>X</sub> Y+ β<sub>Y</sub>X) has a beta(α<sub>Y</sub>, α<sub>X</sub>) distribution. (This result is well-known in the special case of the scale parameters both equal to 1. I wrote up the more general result <a href="http://www.bepress.com/mdandersonbiostat/paper46/">here</a> but I don&#8217;t imagine I was the first to stumble on the generalization.) It follows that</p>
<blockquote><p>P(X &lt; Y) = P(B  &lt; β<sub>X</sub>/(β<sub>X</sub>+ β<sub>Y</sub>)</p></blockquote>
<p>where B is a beta(α<sub>Y</sub>, α<sub>X</sub>) random variable.</p>
<p>For more details, see <a href="http://www.bepress.com/mdandersonbiostat/paper30/">Numerical evaluation of gamma inequalities</a>.</p>
<p>Previous posts on random inequalities:</p>
<p><a href="http://www.johndcook.com/blog/2008/07/26/random-inequalities-i/">Introduction</a><br />
<a href="http://www.johndcook.com/blog/2008/07/26/random-inequalities-ii-analytical-results/">Analytical results</a><br />
<a href="http://www.johndcook.com/blog/2008/07/26/random-inequalities-iii-numerical-results/">Numerical results</a><br />
<a href="http://www.johndcook.com/blog/2008/08/09/random-inequalities-iv-cauchy-distributions/">Cauchy distributions</a><br />
<a href="http://www.johndcook.com/blog/2008/08/21/random-inequalities-v-beta-distributions/">Beta distributions</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2008/08/30/random-inequalities-vi-gamma-distributions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Stopping trials of ineffective drugs earlier</title>
		<link>http://www.johndcook.com/blog/2008/08/26/stopping-trials-of-ineffective-drugs-earlier/</link>
		<comments>http://www.johndcook.com/blog/2008/08/26/stopping-trials-of-ineffective-drugs-earlier/#comments</comments>
		<pubDate>Wed, 27 Aug 2008 00:39:43 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Bayesian]]></category>
		<category><![CDATA[Probability and Statistics]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/08/26/stopping-trials-of-ineffective-drugs-earlier/</guid>
		<description><![CDATA[Valen Johnson and I recently posted a working paper on a method for stopping trials of ineffective drugs earlier. For Bayesians, we argue that our method is more consistently Bayesian than other methods in common use. For frequentists, we show that our method has better frequentist operating characteristics than the most commonly used safety monitoring [...]]]></description>
			<content:encoded><![CDATA[<p>Valen Johnson and I recently posted a working paper on a method for <a href="http://www.bepress.com/mdandersonbiostat/paper47/">stopping trials of ineffective drugs earlier</a>. For Bayesians, we argue that our method is more consistently Bayesian than other methods in common use. For frequentists, we show that our method has better frequentist operating characteristics than the most commonly used safety monitoring method.</p>
<p>The paper looks at binary and time-to-event trials. The results are most dramatic for the time-to-event analog of the <a href="https://biostatistics.mdanderson.org/SoftwareDownload/SingleSoftware.aspx?Software_Id=3">Thall-Simon</a> method, the <a href="https://biostatistics.mdanderson.org/SoftwareDownload/SingleSoftware.aspx?Software_Id=73">Thall-Wooten</a> method, as shown below.</p>
<p style="text-align: center"><img src="http://www.johndcook.com/bs_vs_thall_wooten.gif" alt="" /></p>
<p>This graph plots the probability of concluding that an experimental treatment is inferior when simulating from true mean survival times ranging from 2 to 12 months. The trial is designed to test a null hypothesis of 6 months mean survival against an alternative hypothesis of 8 months mean survival. When the true mean survival time is less than the alternative hypothesis of 8 months, the Bayes factor design is more likely to stop early. And when the true mean survival time is greater than the alternative hypothesis, the Bayes factor method is less likely to stop early.</p>
<p>The Bayes factor method also outperforms the Thall-Simon method for monitoring single-arm trials with binary outcomes. The Bayes factor method stops more often when it should and less often when it should not. However, the difference in operating characteristics is not as pronounced as in the time-to-event case.</p>
<p>The paper also compares the Bayes factor method to the frequentist mainstay, the Simon two-stage design. Because the Bayes factor method uses continuous monitoring, the method is able to use fewer patients while maintaining the type I and type II error rates of the Simon design as illustrated in the graph below.</p>
<p style="text-align: center"><img src="http://www.johndcook.com/bf_vs_simon.gif" alt="bayes factor vs simon two-stage designs " /></p>
<p>The graph above plots the number of patients used in a trial testing a null hypothesis of a 0.2 response rate against an alternative of a 0.4 response rate. Design 8 is the Bayes factor method advocated in the paper. Designs 7a and 7b are variations on the Simon two-stage design. The horizontal axis gives the true probabilities of response. We simulated true probabilities of response varying from 0 to 1 in increments of 0.05. The vertical axis gives the number of patients treated before the trial was stopped. When the true probability of response is less than the alternative hypothesis, the Bayes factor method treats fewer patients. When the true probability of response is better than the alternative hypothesis, the Bayes factor method treats slightly more patients.</p>
<p>Design 7a is the strict interpretation of the Simon method: one interim look at the data and another analysis at the end of the trial. Design 7b is the Simon method as implemented in practice, stopping when the criteria for continuing cannot be met at the next analysis. (For example, if the design says to stop if there are three or fewer responses out of the first 15 patients, then the method would stop after the 12th patient if there have been no responses.) In either case, the Bayes factor method uses fewer patients. The rejection probability curves, not shown here, show that the Bayes factor method matches (actually, slightly improves upon) the type I and type II error rates for the Simon two-stage design.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2008/08/26/stopping-trials-of-ineffective-drugs-earlier/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Random inequalities V: beta distributions</title>
		<link>http://www.johndcook.com/blog/2008/08/21/random-inequalities-v-beta-distributions/</link>
		<comments>http://www.johndcook.com/blog/2008/08/21/random-inequalities-v-beta-distributions/#comments</comments>
		<pubDate>Thu, 21 Aug 2008 11:51:56 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Math]]></category>
		<category><![CDATA[Statistics]]></category>
		<category><![CDATA[Bayesian]]></category>
		<category><![CDATA[Probability and Statistics]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/08/21/random-inequalities-v-beta-distributions/</guid>
		<description><![CDATA[I&#8217;ve put a lot of effort into writing software for evaluating random inequality probabilities with beta distributions because such inequalities come up quite often in application. For example, beta inequalities are at the heart of the Thall-Simon method for monitoring single-arm trials and adaptively randomized trials with binary endpoints.
It&#8217;s not easy to evaluate P(X &#62; [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve put a lot of effort into writing software for evaluating random inequality probabilities with beta distributions because such inequalities come up quite often in application. For example, beta inequalities are at the heart of the <a href="http://biostatistics.mdanderson.org/SoftwareDownload/SingleSoftware.aspx?Software_Id=12">Thall-Simon method</a> for monitoring single-arm trials and <a href="http://biostatistics.mdanderson.org/SoftwareDownload/SingleSoftware.aspx?Software_Id=62">adaptively randomized trials</a> with binary endpoints.</p>
<p>It&#8217;s not easy to evaluate P(X &gt; Y) accurately and efficiently when X and Y are independent random variables. I&#8217;ve seen several attempts that were either inaccurate or slow, including a few attempts on my part. Efficiency is important because this calculation is often in the inner loop of a simulation study. Part of the difficulty is that the calculation depends on four parameters and no single algorithm will work well for all parameter combinations.</p>
<p>Let g(a, b, c, d) equal P(X &gt; Y) where X ~ beta(a, b) and Y ~ beta(c, d). Then the function g has several symmetries.</p>
<ul>
<li>g(a, b, c, d) = 1 &#8211; g(c, d, a, b)</li>
<li>g(a, b, c, d) = g(d, c, b, a)</li>
<li>g(a, b, c, d) = g(d, b, c, a)</li>
</ul>
<p>The first two relations were published by W. R. Thompson in 1933, but as far as I know the third relation first appeared in this <a href="http://www.bepress.com/mdandersonbiostat/paper46/">technical report</a> in 2003.</p>
<p>For special values of the parameters, the function g(a, b, c, d) can be computed in <a href="http://www.mdanderson.org/pdf/biostats_utmdabtr_005_05.pdf">closed form</a>. Some of these special cases are when</p>
<ul>
<li>one of the four parameters is an integer</li>
<li>a + b + c + d = 1</li>
<li>a + b = c + d = 1.</li>
</ul>
<p>The function g(a, b, c, d) also satisfies several recurrence relations that make it possible to bootstrap the latter two special cases into more results. Define the beta function B(a, b) as Γ(a, b)/(Γ(a) Γ(b)) and define h(a, b, c, d) as B(a+c, b+d)/( B(a, b) B(c, d) ). Then the following recurrence relations hold.</p>
<ul>
<li>g(a+1, b, c, d) = g(a, b, c, d) + h(a, b, c, d)/a</li>
<li>g(a, b+1, c, d) = g(a, b, c, d) &#8211; h(a, b, c, d)/b</li>
<li>g(a, b, c+1, d) = g(a, b, c, d) &#8211; h(a, b, c, d)/c</li>
<li>g(a, b, c, d+1) = g(a, b, c, d) + h(a, b, c, d)/d</li>
</ul>
<p>For more information about beta inequalities, see these papers:</p>
<p><a href="http://www.bepress.com/mdandersonbiostat/paper46/">Numerical computation of stochastic inequality probabilities</a><br />
<a href="http://www.mdanderson.org/pdf/biostats_utmdabtr_005_05.pdf">Exact calculation of beta inequalities</a></p>
<p>Previous posts on random inequalities: <a href="http://www.johndcook.com/blog/2008/07/26/random-inequalities-i/"></a></p>
<p><a href="http://www.johndcook.com/blog/2008/07/26/random-inequalities-i/">Introduction</a><br />
<a href="http://www.johndcook.com/blog/2008/07/26/random-inequalities-ii-analytical-results/">Analytical results</a><br />
<a href="http://www.johndcook.com/blog/2008/07/26/random-inequalities-iii-numerical-results/">Numerical results</a><br />
<a href="http://www.johndcook.com/blog/2008/08/09/random-inequalities-iv-cauchy-distributions/">Cauchy distributions</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2008/08/21/random-inequalities-v-beta-distributions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Statistically significant but incorrect</title>
		<link>http://www.johndcook.com/blog/2008/08/19/statistically-significant-but-incorrect/</link>
		<comments>http://www.johndcook.com/blog/2008/08/19/statistically-significant-but-incorrect/#comments</comments>
		<pubDate>Tue, 19 Aug 2008 06:00:37 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Statistics]]></category>
		<category><![CDATA[Probability and Statistics]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/08/19/statistically-significant-but-incorrect/</guid>
		<description><![CDATA[The Decision Science News blog has an article highlighting a tool to illustrate how often experiments with significant p-values draw false conclusions. Here&#8217;s the web site they refer to.
See also Most published research results are false.
]]></description>
			<content:encoded><![CDATA[<p>The <a href="http://www.decisionsciencenews.com/?p=399">Decision Science News</a> blog has an article highlighting a tool to illustrate how often experiments with significant p-values draw false conclusions. Here&#8217;s the <a href="http://www.jerrydallal.com/LHSP/multtest.htm">web site</a> they refer to.</p>
<p>See also <a href="http://www.johndcook.com/blog/2008/02/07/most-published-research-results-are-false/">Most published research results are false</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2008/08/19/statistically-significant-but-incorrect/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Conflicting ideas of simplicity</title>
		<link>http://www.johndcook.com/blog/2008/08/12/conflicting-ideas-of-simplicity/</link>
		<comments>http://www.johndcook.com/blog/2008/08/12/conflicting-ideas-of-simplicity/#comments</comments>
		<pubDate>Tue, 12 Aug 2008 12:28:32 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Probability and Statistics]]></category>
		<category><![CDATA[Simplicity]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/08/12/conflicting-ideas-of-simplicity/</guid>
		<description><![CDATA[Sometimes it&#8217;s simpler to compute things exactly than to use an approximation. When you work on problems that cannot be computed exactly long enough, you start to assume everything falls in that category. I posted a tech report a few days ago about a problem in studying clinical trials that could be solved exactly even [...]]]></description>
			<content:encoded><![CDATA[<p>Sometimes it&#8217;s simpler to compute things exactly than to use an approximation. When you work on problems that cannot be computed exactly long enough, you start to assume everything falls in that category. I posted a <a href="http://www.bepress.com/mdandersonbiostat/paper45/">tech report</a> a few days ago about a problem in studying clinical trials that could be solved exactly even though it was commonly approximated by simulation.</p>
<p>This is another example of <a href="http://www.johndcook.com/blog/2008/07/29/try-the-simplest-thing-that-could-possibly-work/">trying the simplest thing that might work</a>. But it&#8217;s also an example of <strong>conflicting ideas of simplicity</strong>. It&#8217;s simpler, in a sense, to do what you&#8217;ve always done than to do something new.</p>
<p>It&#8217;s also an example of a conflict between a programmer&#8217;s idea of simplicity versus a user&#8217;s idea of simplicity. For this problem, the slower and less accurate code requires less work. It&#8217;s more straight-forward and more likely to be correct. The exact solution takes less code but more thought, and I didn&#8217;t get it right the first time. But from a user&#8217;s perspective, having exact results is simpler in several ways: no need to specify a number of replications, no need to wait for results, no need to argue over what&#8217;s real and what&#8217;s simulation noise, etc. In this case I&#8217;m the programmer and the user so I feel the tug in both directions.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2008/08/12/conflicting-ideas-of-simplicity/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Subjecting fewer patients to ineffective treatments</title>
		<link>http://www.johndcook.com/blog/2008/08/05/subjecting-fewer-patients-to-ineffective-treatments/</link>
		<comments>http://www.johndcook.com/blog/2008/08/05/subjecting-fewer-patients-to-ineffective-treatments/#comments</comments>
		<pubDate>Wed, 06 Aug 2008 02:46:06 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Bayesian]]></category>
		<category><![CDATA[Biostatistics]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/08/05/subjecting-fewer-patients-to-ineffective-treatments/</guid>
		<description><![CDATA[Tomorrow morning I&#8217;m giving a talk on how to subject fewer patients to ineffective treatment in clinical trials. I should have used something like the title of this post as the title of my talk, but instead my talk is called &#8220;Clinical Trial Monitoring With Bayesian Hypothesis Testing.&#8221; Classic sales mistake: emphasizing features rather than [...]]]></description>
			<content:encoded><![CDATA[<p>Tomorrow morning I&#8217;m giving a talk on how to subject fewer patients to ineffective treatment in clinical trials. I should have used something like the title of this post as the title of my talk, but instead my talk is called &#8220;Clinical Trial Monitoring With Bayesian Hypothesis Testing.&#8221; Classic sales mistake: emphasizing features rather than benefits. But the talk is at a statistical conference, so maybe the feature-oriented title isn&#8217;t so bad.</p>
<p>Ethical concerns are the main consideration that makes biostatistics a separate branch of statistics. You can&#8217;t test experimental drugs on people the way you test experimental fertilizers on crops. In human trials, you want to stop the trial early if it looks like the experimental treatment is not as effective as a comparable established treatment, but you want to keep going if it looks like the new treatment might be better. You need to establish rules before the trial starts that quantify exactly what it means to look like a treatment is doing better or worse than another treatment. There are a lot of ways of doing this quantification, and some work better than others. Within its context (single-arm phase II trials with binary or time-to-event endpoints) the method I&#8217;m presenting stops ineffective trials sooner than the methods we compare it to while stopping no more often in situations where you&#8217;d want the trial to continue.</p>
<p>If you&#8217;re not familiar with statistics, this may sound strange. Why not always stop when a treatment is worse and never stop when it&#8217;s better? Because you never know with certainty that one treatment is better than another. The more patients you test, the more sure you can be of your decision, but some uncertainty always remains. So you face a trade-off between being more confident of your conclusion and experimenting on more patients. If you think a drug is bad, you don&#8217;t want to treat thousands more patients with it in order to be extra confident that it&#8217;s bad, so you stop. But you run the risk of shutting down a trial of a treatment that really is an improvement but by chance appeared to be worse at the time you made the decision to stop. Statistics is all about such trade-offs.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2008/08/05/subjecting-fewer-patients-to-ineffective-treatments/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Random inequalities I: introduction</title>
		<link>http://www.johndcook.com/blog/2008/07/26/random-inequalities-i/</link>
		<comments>http://www.johndcook.com/blog/2008/07/26/random-inequalities-i/#comments</comments>
		<pubDate>Sat, 26 Jul 2008 14:42:20 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Math]]></category>
		<category><![CDATA[Statistics]]></category>
		<category><![CDATA[Bayesian]]></category>
		<category><![CDATA[Probability and Statistics]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/07/26/random-inequalities-i/</guid>
		<description><![CDATA[Many Bayesian clinical trial methods have at their core a random inequality. Some examples from M. D. Anderson: adaptive randomization, binary safety monitoring, time-to-event safety monitoring. These method depends critically on evaluating P(X &#62; Y) where X and Y are independent random variables. Roughly speaking, P(X &#62; Y) is the probability that the treatment represented [...]]]></description>
			<content:encoded><![CDATA[<p>Many Bayesian clinical trial methods have at their core a random inequality. Some examples from M. D. Anderson: <a href="https://biostatistics.mdanderson.org/softwaredownload/SingleSoftware.aspx?Software_Id=62">adaptive randomization</a>, <a href="https://biostatistics.mdanderson.org/softwaredownload/SingleSoftware.aspx?Software_Id=12">binary safety monitoring</a>, <a href="https://biostatistics.mdanderson.org/softwaredownload/SingleSoftware.aspx?Software_Id=73">time-to-event safety monitoring</a>. These method depends critically on evaluating P(X &gt; Y) where X and Y are independent random variables. Roughly speaking, P(X &gt; Y) is the probability that the treatment represented by X is better than the treatment represented by Y. In a trial with binary outcomes, X and Y may be the posterior probabilities of response on each treatment. In a trial with time-to-event outcomes, X and Y may be posterior probabilities of median survival time on two treatments.</p>
<p>People often have a little difficulty understanding what P(X &gt; Y) means. What <em>does </em>it mean? If we take a sample from X and a random sample from Y, P(X &gt;Y) is the probability that the former is larger than the latter. Most confusion around random inequalities comes from thinking of random variables as constants, not random quantities. Here are a couple examples.</p>
<p>First, suppose X and Y have normal distributions with standard deviation 1. If X has mean 4 and Y has mean 3, what is P(X &gt; Y)? Some would say 1, because X is bigger than Y. But that&#8217;s not true. X has a larger <em>mean</em> than Y, but fairly often a <em>sample</em> from Y will be larger than a <em>sample</em> from X.  P(X &gt; Y) = 0.76 in this case.</p>
<p>Next, suppose X and Y are identically distributed. Now what is P(X &gt; Y)? I&#8217;ve heard people say zero because the two random variables are equal. But they&#8217;re not equal. Their <em>distribution functions</em> are equal but the two random variables are independent. P(X &gt; y) = 1/2 by symmetry.</p>
<p>I believe there&#8217;s a psychological tendency to underestimate large inequality probabilities. (I&#8217;ve had several discussions with people who would not believe a reported inequality probability until they computed it themselves. These discussions are important when the decision whether to continue a clinical trial hinges on the result.) For example, suppose X and Y represent the probability of success in a trial in which there were 17 successes out of 30 on X and 12 successes out of 30 on Y. Using a beta distribution model, the density functions of X and Y are given below.</p>
<p style="text-align: center"><img src="http://www.johndcook.com/betaineq.png" alt="beta inequality graph" border="0" width="504" height="392" /></p>
<p>The density function for X is essentially the same as Y but shifted to the right. Clearly P(X &gt; Y) is greater than 1/2. But how much greater than a half? You might think not too much since there&#8217;s a lot of mass in the overlap of the two densities. But P(X &gt; Y) is a little more than 0.9.</p>
<p>The image above and the numerical results mentioned in this post were produced by the <a href="https://biostatistics.mdanderson.org/softwaredownload/SingleSoftware.aspx?Software_Id=9">Inequality Calculator</a> software.</p>
<p>Part II will discuss <a href="http://www.johndcook.com/blog/2008/07/26/random-inequalities-ii-analytical-results/">analytically evaluating random inequalities</a>. Part III will discuss <a href="http://www.johndcook.com/blog/2008/07/26/random-inequalities-iii-numerical-results/">numerically evaluating random inequalities</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2008/07/26/random-inequalities-i/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Three ways of tuning an adaptively randomized trial</title>
		<link>http://www.johndcook.com/blog/2008/07/22/three-ways-of-tuning-an-adaptively-randomized-trial/</link>
		<comments>http://www.johndcook.com/blog/2008/07/22/three-ways-of-tuning-an-adaptively-randomized-trial/#comments</comments>
		<pubDate>Tue, 22 Jul 2008 18:49:05 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Statistics]]></category>
		<category><![CDATA[Bayesian]]></category>
		<category><![CDATA[Probability and Statistics]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/07/22/three-ways-of-tuning-an-adaptively-randomized-trial/</guid>
		<description><![CDATA[Yesterday I gave a presentation on designing clinical trials using adaptive randomization software developed at M. D. Anderson Cancer Center. The heart of the presentation is summarized in the following diagram.

(A slightly larger and clearer version if the diagram is available here.)
Traditional randomized trials use equal randomization (ER). In a two-arm trial, each treatment is given with [...]]]></description>
			<content:encoded><![CDATA[<p align="left">Yesterday I gave a presentation on designing clinical trials using <a href="https://biostatistics.mdanderson.org/SoftwareDownload/SingleSoftware.aspx?Software_Id=62">adaptive randomization software</a> developed at M. D. Anderson Cancer Center. The heart of the presentation is summarized in the following diagram.</p>
<p style="text-align: center"><img src="http://www.johndcook.com/Arand3.gif" border="0" alt="Diagram of three methods of tuning adaptively randomized trial designs" width="395" height="220" /></p>
<p>(A slightly larger and clearer version if the diagram is available <a href="http://www.johndcook.com/Arand4.gif">here</a>.)</p>
<p>Traditional randomized trials use <strong>equal randomization</strong> (ER). In a two-arm trial, each treatment is given with probability 1/2. <strong>Simple adaptive randomization</strong> (SAR) calculates the probability that a treatment is the better treatment given the data seen so far and randomizes to that treatment with that probability. For example, if it looks like there&#8217;s an 80% chance that Treatment B is better, patients will be randomized to Treatment B with probability 0.80. <strong>Myopic optimization</strong> (MO) gives each patient what appears to be the best treatment given the available data with no randomization.</p>
<p>Myopic optimization is ethically appealing, but has terrible statistical properties. Equal randomization has good statistical properties, but will put the same number of patients on each treatment, regardless of the evidence that one treatment is better. Simple adaptive randomization is a compromise position, retaining much of the <a href="http://www.mdanderson.org/pdf/biostats_utmdabtr_002_06.pdf">power</a> of equal randomization while also treating more patients on the better treatment on average.</p>
<p>The adaptive randomization software provides three ways of compromising between the operating characteristics ER and SAR.</p>
<ol>
<li>Begin the trial with a burn-in period of equal randomization followed by simple adaptive randomization.</li>
<li>Use simple adaptive randomization, except if the randomization probability drops below a certain threshold, substitute that minimum value.</li>
<li>Raise the simple adaptive randomization probability to a power between 0 and 1 to obtain a new randomization probability.</li>
</ol>
<p>Each of these three approaches reduces to ER at one extreme and SAR at the other. In between the extremes, each produces a design with operating characteristics somewhere between those of ER and SAR.</p>
<p>In the first approach, if the burn-in period is the entire trial, you simply have an ER trial. If there is no burn-in period, you have an SAR trial. In between you could have a burn-in period equal to some percentage of the total trial between 0 and 100%. A burn-in period of 20% is typical.</p>
<p>In the second approach, you could specify the minimum randomization probability as 0.5, negating the adaptive randomization and yielding ER. At the other extreme, you could set the minimum randomization probability to 0, yielding SAR. In between you could specify some non-zero randomization probability such as 0.10.</p>
<p>In the third approach, a power of zero yields ER. A power of 1 yields SAR. Unlike the other two approaches, this approach could yield designs approaching MO by using powers larger than 1. This is the most general approach since it can produce a continuum of designs with characteristics ranging from ER to MO. For more on this approach, see <a href="http://www.bepress.com/mdandersonbiostat/paper27/">Understanding the exponential tuning parameter in adaptively randomized trials</a>.</p>
<p>So with three methods to choose from, which one do you use? I did some simulations to address this question. I expected that all three methods would perform about the same. However, this is not what I found. To read more, see <a href="http://www.bepress.com/mdandersonbiostat/paper32">Comparing methods of tuning adaptive randomized trials</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2008/07/22/three-ways-of-tuning-an-adaptively-randomized-trial/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Cohort assignments in clinical trials</title>
		<link>http://www.johndcook.com/blog/2008/06/26/cohort-assignments-in-clinical-trials/</link>
		<comments>http://www.johndcook.com/blog/2008/06/26/cohort-assignments-in-clinical-trials/#comments</comments>
		<pubDate>Thu, 26 Jun 2008 11:41:43 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/06/26/cohort-assignments-in-clinical-trials/</guid>
		<description><![CDATA[Cohorts are very simple in theory but messy in practice. In a clinical trial, a cohort is a group of patients who receive the same treatment. For example, in dose-finding trials, it is very common to treat patients in groups of three. I&#8217;ll stick with cohorts of three just to be concrete, though nothing here depends [...]]]></description>
			<content:encoded><![CDATA[<p>Cohorts are very simple in theory but messy in practice. In a clinical trial, a cohort is a group of patients who receive the same treatment. For example, in dose-finding trials, it is very common to treat patients in groups of three. I&#8217;ll stick with cohorts of three just to be concrete, though nothing here depends particularly on this choice of cohort size.</p>
<p>If we number patients in the order in which they arrive, patients 1, 2, and 3 would be the first cohort. Patients 4, 5, and 6 would be the second cohort, etc. If it were always that simple, we could determine which cohort a patient belongs to based on their accrual number alone. To calculate a patient&#8217;s cohort number, subtract 1 from their accrual number, divide by 3, throw away any remainder, and add 1. In math symbols, the cohort number for patient #n would be 1 + ⌊(n-1)/3⌋. (See the <a href="http://www.johndcook.com/blog/2008/06/26/why-computer-scientists-count-from-zero/">next post</a>.)</p>
<p>Here&#8217;s an example of why that won&#8217;t work. Suppose you treat patients 1, 2, and 3, then discover that patient #2 was not eligible for the trial after all. (This happens regularly.) Now a 4th patient enters the trial. What cohort are they in? If patient #4 arrived <em>after</em> you discovered that patient #2 was ineligible, you could put patient #4 in the first cohort, essentially taking patient #2&#8217;s place. But if patient #4 arrived <em>before</em> you discovered that patient #2 was ineligible, then patient #4 would receive the treatent assigned to the second cohort; the first cohort would have a hole in it and only contain two patients. You could treat patient #5 with the treatment of the first cohort to try to patch the hole, but that&#8217;s more confusing. It gets even worse if you&#8217;re on to the third or fourth cohort before discovering a gap in the first cohort.</p>
<p>In addition to patients being removed from a trial due to ineligibility, patients can remove themselves from a trial at any time.</p>
<p>There are numerous other ways the naïve view of cohorts can fail. A doctor may decide to give the same treatment to only two consecutive patients, or to four consecutive patients, letting medical judgment override the dose assignment algorithm for a particular patient. A mistake could cause a patient to receive the dose intended for another cohort. Researchers may be unable to access the software needed to make the dose assignment for a new cohort and so they give a new patient the dose from the previous cohort.</p>
<p>Cohort assignments can become so tangled that it is simply not possible to look at an ordered list of patients and their treatments after the fact and determine how the patients were grouped into cohorts. Cohort assignment is to some extent a mental construct, an expression of how the researcher thought about the patients, rather than an objective grouping.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2008/06/26/cohort-assignments-in-clinical-trials/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Small effective sample size does not mean uninformative</title>
		<link>http://www.johndcook.com/blog/2008/06/17/small-effective-sample-size-does-not-mean-uninformative/</link>
		<comments>http://www.johndcook.com/blog/2008/06/17/small-effective-sample-size-does-not-mean-uninformative/#comments</comments>
		<pubDate>Wed, 18 Jun 2008 04:07:28 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Bayesian]]></category>
		<category><![CDATA[Probability and Statistics]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/06/17/small-effective-sample-size-does-not-mean-uninformative/</guid>
		<description><![CDATA[Today I talked to a doctor about the design of a randomized clinical trial that would use a Bayesian monitoring rule. The probability of response on each arm would be modeled as a binomial with a beta prior. Simple conjugate model. The historical response rate in this disease is only 5%, and so the doctor [...]]]></description>
			<content:encoded><![CDATA[<p>Today I talked to a doctor about the design of a randomized clinical trial that would use a Bayesian monitoring rule. The probability of response on each arm would be modeled as a binomial with a beta prior. Simple conjugate model. The historical response rate in this disease is only 5%, and so the doctor had chosen a beta(0.1, 1.9) prior so that the prior mean matched the historical response rate.</p>
<p>For beta distributions, the sum of the two parameters is called the effective sample size. There is a simple and natural explanation for why a beta(a, b) distribution is said to contain as much information as a+b data observations. By this criterion, the beta(0.1, 1.9) distribution is not very informative: it only has as much influence as two observations. However, viewed in another light, a beta(0.1, 1.9) distribution is highly informative.</p>
<p>This trial was designed to stop when the posterior probability is more than  0.999 that one treatment is more effective than the other. That&#8217;s an unusually high standard of evidence for stopping a trial — a cutoff of 0.99 or smaller would be much more common — and yet a trial could stop after only six patients. If X is the probability of response on one arm and Y is the probability of response on the other, after three failures on the first treatment and three successes on the other, Pr(Y &gt; X) &gt; 0.999.</p>
<p>The explanation for how the trial could stop so early is that the prior distribution is, in an odd sense, highly informative. The trial starts with a strong assumption that each treatment is ineffective. This assumption is somewhat justified by of experience, and yet a beta(0.1, 1.9) distribution doesn&#8217;t fully capture the investigator&#8217;s prior belief.</p>
<p>(Once at least one response has been observed, the beta(0.1, 1.9) prior becomes essentially uninformative. But until then, in this context, the prior is informative.)</p>
<p>A problem with a beta prior is that there is no way to specify the mean at 0.05 without also placing a large proportion of the probability mass below 0.05. The beta prior places too little probability on better outcomes that might reasonably happen. I imagine a more diffuse prior with <em>mode</em> 0.05 rather than mean 0.05 would better describe the prior beliefs regarding the treatments.</p>
<p>The beta prior is convenient because Bayes&#8217; theorem takes a very simple form in this case: starting from a beta(a, b) prior and observing s successes and f failures, the posterior distribution is beta(a+s, b+f).  But a prior less convenient algebraically could be more <a href="http://www.johndcook.com/blog/2008/06/08/robust-priors/">robust</a> and better adept at representing prior information.</p>
<p>A more basic observation is that &#8220;informative&#8221; and &#8220;uninformative&#8221; depend on context. This is part of what motivated Jeffreys to look for prior distributions that were equally (un)informative under a set of transformations. But Jeffreys&#8217; approach isn&#8217;t the final answer. As far as I know, there&#8217;s no universally acceptable resolution to this <a href="http://www.johndcook.com/blog/2008/04/22/problems-versus-dilemmas/">dilemma</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2008/06/17/small-effective-sample-size-does-not-mean-uninformative/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>First cancer gene therapy to pass phase III</title>
		<link>http://www.johndcook.com/blog/2008/06/02/first-cancer-gene-therapy-to-pass-phase-iii/</link>
		<comments>http://www.johndcook.com/blog/2008/06/02/first-cancer-gene-therapy-to-pass-phase-iii/#comments</comments>
		<pubDate>Mon, 02 Jun 2008 14:23:43 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Clinical trials]]></category>
		<category><![CDATA[Cancer]]></category>
		<category><![CDATA[Genetics]]></category>

		<guid isPermaLink="false">http://www.johndcook.com/blog/2008/06/02/first-cancer-gene-therapy-to-pass-phase-iii/</guid>
		<description><![CDATA[A gene therapy developed at M. D. Anderson Cancer Center for head and neck cancer is the first such treatment to succeed in a phase III trial. See the press release for more details.
(Phase III studies are large, multi-institutional studies required for regulatory approval of new drugs.)
]]></description>
			<content:encoded><![CDATA[<p>A gene therapy developed at M. D. Anderson Cancer Center for head and neck cancer is the first such treatment to succeed in a phase III trial. See the <a href="http://www.mdanderson.org/departments/newsroom/display.cfm?id=4376DB08-82BB-4FCB-9FAB5CF9FA51DEFF&amp;method=displayFull&amp;pn=00c8a30f-c468-11d4-80fb00508b603a14">press release</a> for more details.</p>
<p>(Phase III studies are large, multi-institutional studies required for regulatory approval of new drugs.)</p>
]]></content:encoded>
			<wfw:commentRss>http://www.johndcook.com/blog/2008/06/02/first-cancer-gene-therapy-to-pass-phase-iii/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

<!-- Dynamic Page Served (once) in 0.953 seconds -->

