23.02.2013 Views

Statistical Inference After an Adaptive Group Sequential Design: A ...

Statistical Inference After an Adaptive Group Sequential Design: A ...

Statistical Inference After an Adaptive Group Sequential Design: A ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

596 b i o s t a t i s t i c s<br />

Tremmel<br />

istics regarding both expected sample size <strong>an</strong>d<br />

power. One possible solution c<strong>an</strong> be found in<br />

Bayesi<strong>an</strong> decision theory (27, p. 204); the necessary<br />

loss function could be constructed by assigning<br />

monetary values for patients enrolled,<br />

as well as for the type II error. This could be the<br />

subject of future investigations.<br />

Some of the adv<strong>an</strong>tages of classical GSD<br />

hinge on strict adherence to predetermined decision<br />

rules. In the typical DMB-driven clinical<br />

trial, this may turn out to be <strong>an</strong> illusion, which<br />

should lead to some of the same issues with statistical<br />

inference as described above for the<br />

aGSD. Indeed, this was the motivation behind<br />

the development of the rCI (3, p. 189). In m<strong>an</strong>y<br />

cases, the DMBs are fully unblinded to interim<br />

results, which may inform their recommendation<br />

of the timing of the next interim <strong>an</strong>alysis,<br />

as well as effect size assumptions <strong>an</strong>d target<br />

power (for <strong>an</strong> example, see Ref. 28). This may<br />

impact not only the statistical inference, but<br />

also the validity of the traditional GSD—a<br />

much more severe problem. Gr<strong>an</strong>ted, in the scenarios<br />

investigated for results-driven timing of<br />

interim <strong>an</strong>alyses for traditional GSDs, the im-<br />

pact on α was small (29). Nevertheless, it may<br />

seem cle<strong>an</strong>er to use a design that fully “legalizes”<br />

such “crimes” (<strong>an</strong>d regulators ought to encourage<br />

it)—in particular for open-label trials<br />

such as our case study.<br />

c o N c L U s i o N<br />

There is a trade-off between flexibility in trial<br />

conduct, <strong>an</strong>d accuracy of statistical inference.<br />

Generally, the flexible aGSD design will lead to<br />

wider confidence intervals. In addition, the<br />

openness of the trial causes theoretical difficulties<br />

with some aspects of statistical inference<br />

(in particular: bias) that are not all resolved.<br />

There are cases when this trade-off may favor<br />

the flexible approach—in particular, when the<br />

trial is a r<strong>an</strong>domized, open-label trial, <strong>an</strong>d/or<br />

when the size of a worthwhile effect depends on<br />

future developments.<br />

Acknowledgments—The author is indebted to Dr. C. K.<br />

Ch<strong>an</strong>g for some early suggestions. The author also<br />

owes th<strong>an</strong>ks to two <strong>an</strong>onymous reviewers for their encouragement<br />

<strong>an</strong>d thorough questioning.<br />

a P P E N D i x 1<br />

P R o o F t H a t t H E P V a L U E b a s E D o N W R i g H t ( 1 2 ) i s U N i F o R M Ly<br />

D i s t R i b U t E D F o R P o c o c k ’ s D E s i g N<br />

Pocock’s procedure defines P crit = f(α) such that the probability (under H 0 ) that the smallest observed<br />

P value is lower th<strong>an</strong> P crit is α. For this case, Wright’s adjusted P value is α′ = f −1 (P min ), where P min is the<br />

minimum P value actually observed, <strong>an</strong>d α′ is the probability, under H 0 , to observe a minimum P value<br />

as small as or smaller th<strong>an</strong> P min .<br />

α′ is a P value because it follows the uniform (0,1) distribution under H 0 : The function f −1 (.) is the<br />

probability integral tr<strong>an</strong>sformation of the null distribution of P min , that is, f −1 (.) says how likely it is, under<br />

H 0 , to obtain a minimum P value that is even smaller th<strong>an</strong> the observed minimum P value. Using<br />

upper case for the r<strong>an</strong>dom variables, prob(PMIN < P min ) = prob(A′ < α′) = α′, which defines the uniform<br />

distribution.<br />

a P P E N D i x 2<br />

D E c o M P o s i t i o N o F t H E s t R a t i F i E D U N s q U a R E D M H s t a t i s t i c<br />

The unsquared version Z MH of the M<strong>an</strong>tel-Haenszel statistic Q MH c<strong>an</strong> be shown to be a weighted average<br />

of the estimator ϑ ˆ = (πˆ 1 − πˆ 2 ) (30). For stage k, Z MH.k = (w Bk (πˆ B1k − πˆ B2k ) + w Ck (πˆ C1k − πˆ C2k ))/σ k , where<br />

w Bk is the M<strong>an</strong>tel-Haenszel weight (n B1k * n B2k )/(n B1k + n B2k ), <strong>an</strong>d the first index B designates the stratum<br />

(B for Binet stage B, <strong>an</strong>d C for Binet stage C). σ k is a function of the four margins of the 2 × 2 table:

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!