Statistical Inference After an Adaptive Group Sequential Design: A ...
Statistical Inference After an Adaptive Group Sequential Design: A ...
Statistical Inference After an Adaptive Group Sequential Design: A ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
596 b i o s t a t i s t i c s<br />
Tremmel<br />
istics regarding both expected sample size <strong>an</strong>d<br />
power. One possible solution c<strong>an</strong> be found in<br />
Bayesi<strong>an</strong> decision theory (27, p. 204); the necessary<br />
loss function could be constructed by assigning<br />
monetary values for patients enrolled,<br />
as well as for the type II error. This could be the<br />
subject of future investigations.<br />
Some of the adv<strong>an</strong>tages of classical GSD<br />
hinge on strict adherence to predetermined decision<br />
rules. In the typical DMB-driven clinical<br />
trial, this may turn out to be <strong>an</strong> illusion, which<br />
should lead to some of the same issues with statistical<br />
inference as described above for the<br />
aGSD. Indeed, this was the motivation behind<br />
the development of the rCI (3, p. 189). In m<strong>an</strong>y<br />
cases, the DMBs are fully unblinded to interim<br />
results, which may inform their recommendation<br />
of the timing of the next interim <strong>an</strong>alysis,<br />
as well as effect size assumptions <strong>an</strong>d target<br />
power (for <strong>an</strong> example, see Ref. 28). This may<br />
impact not only the statistical inference, but<br />
also the validity of the traditional GSD—a<br />
much more severe problem. Gr<strong>an</strong>ted, in the scenarios<br />
investigated for results-driven timing of<br />
interim <strong>an</strong>alyses for traditional GSDs, the im-<br />
pact on α was small (29). Nevertheless, it may<br />
seem cle<strong>an</strong>er to use a design that fully “legalizes”<br />
such “crimes” (<strong>an</strong>d regulators ought to encourage<br />
it)—in particular for open-label trials<br />
such as our case study.<br />
c o N c L U s i o N<br />
There is a trade-off between flexibility in trial<br />
conduct, <strong>an</strong>d accuracy of statistical inference.<br />
Generally, the flexible aGSD design will lead to<br />
wider confidence intervals. In addition, the<br />
openness of the trial causes theoretical difficulties<br />
with some aspects of statistical inference<br />
(in particular: bias) that are not all resolved.<br />
There are cases when this trade-off may favor<br />
the flexible approach—in particular, when the<br />
trial is a r<strong>an</strong>domized, open-label trial, <strong>an</strong>d/or<br />
when the size of a worthwhile effect depends on<br />
future developments.<br />
Acknowledgments—The author is indebted to Dr. C. K.<br />
Ch<strong>an</strong>g for some early suggestions. The author also<br />
owes th<strong>an</strong>ks to two <strong>an</strong>onymous reviewers for their encouragement<br />
<strong>an</strong>d thorough questioning.<br />
a P P E N D i x 1<br />
P R o o F t H a t t H E P V a L U E b a s E D o N W R i g H t ( 1 2 ) i s U N i F o R M Ly<br />
D i s t R i b U t E D F o R P o c o c k ’ s D E s i g N<br />
Pocock’s procedure defines P crit = f(α) such that the probability (under H 0 ) that the smallest observed<br />
P value is lower th<strong>an</strong> P crit is α. For this case, Wright’s adjusted P value is α′ = f −1 (P min ), where P min is the<br />
minimum P value actually observed, <strong>an</strong>d α′ is the probability, under H 0 , to observe a minimum P value<br />
as small as or smaller th<strong>an</strong> P min .<br />
α′ is a P value because it follows the uniform (0,1) distribution under H 0 : The function f −1 (.) is the<br />
probability integral tr<strong>an</strong>sformation of the null distribution of P min , that is, f −1 (.) says how likely it is, under<br />
H 0 , to obtain a minimum P value that is even smaller th<strong>an</strong> the observed minimum P value. Using<br />
upper case for the r<strong>an</strong>dom variables, prob(PMIN < P min ) = prob(A′ < α′) = α′, which defines the uniform<br />
distribution.<br />
a P P E N D i x 2<br />
D E c o M P o s i t i o N o F t H E s t R a t i F i E D U N s q U a R E D M H s t a t i s t i c<br />
The unsquared version Z MH of the M<strong>an</strong>tel-Haenszel statistic Q MH c<strong>an</strong> be shown to be a weighted average<br />
of the estimator ϑ ˆ = (πˆ 1 − πˆ 2 ) (30). For stage k, Z MH.k = (w Bk (πˆ B1k − πˆ B2k ) + w Ck (πˆ C1k − πˆ C2k ))/σ k , where<br />
w Bk is the M<strong>an</strong>tel-Haenszel weight (n B1k * n B2k )/(n B1k + n B2k ), <strong>an</strong>d the first index B designates the stratum<br />
(B for Binet stage B, <strong>an</strong>d C for Binet stage C). σ k is a function of the four margins of the 2 × 2 table: