Statistical Inference After an Adaptive Group Sequential Design: A ...

More documents

Recommendations

Info

596 b i o s t a t i s t i c s Tremmel istics regarding both expected sample size and power. One possible solution can be found in Bayesian decision theory (27, p. 204); the necessary loss function could be constructed by assigning monetary values for patients enrolled, as well as for the type II error. This could be the subject of future investigations. Some of the advantages of classical GSD hinge on strict adherence to predetermined decision rules. In the typical DMB-driven clinical trial, this may turn out to be an illusion, which should lead to some of the same issues with statistical inference as described above for the aGSD. Indeed, this was the motivation behind the development of the rCI (3, p. 189). In many cases, the DMBs are fully unblinded to interim results, which may inform their recommendation of the timing of the next interim analysis, as well as effect size assumptions and target power (for an example, see Ref. 28). This may impact not only the statistical inference, but also the validity of the traditional GSD—a much more severe problem. Granted, in the scenarios investigated for results-driven timing of interim analyses for traditional GSDs, the impact on α was small (29). Nevertheless, it may seem cleaner to use a design that fully “legalizes” such “crimes” (and regulators ought to encourage it)—in particular for open-label trials such as our case study. c o N c L U s i o N There is a trade-off between flexibility in trial conduct, and accuracy of statistical inference. Generally, the flexible aGSD design will lead to wider confidence intervals. In addition, the openness of the trial causes theoretical difficulties with some aspects of statistical inference (in particular: bias) that are not all resolved. There are cases when this trade-off may favor the flexible approach—in particular, when the trial is a randomized, open-label trial, and/or when the size of a worthwhile effect depends on future developments. Acknowledgments—The author is indebted to Dr. C. K. Chang for some early suggestions. The author also owes thanks to two anonymous reviewers for their encouragement and thorough questioning. a P P E N D i x 1 P R o o F t H a t t H E P V a L U E b a s E D o N W R i g H t ( 1 2 ) i s U N i F o R M Ly D i s t R i b U t E D F o R P o c o c k ’ s D E s i g N Pocock’s procedure defines P crit = f(α) such that the probability (under H 0 ) that the smallest observed P value is lower than P crit is α. For this case, Wright’s adjusted P value is α′ = f −1 (P min ), where P min is the minimum P value actually observed, and α′ is the probability, under H 0 , to observe a minimum P value as small as or smaller than P min . α′ is a P value because it follows the uniform (0,1) distribution under H 0 : The function f −1 (.) is the probability integral transformation of the null distribution of P min , that is, f −1 (.) says how likely it is, under H 0 , to obtain a minimum P value that is even smaller than the observed minimum P value. Using upper case for the random variables, prob(PMIN < P min ) = prob(A′ < α′) = α′, which defines the uniform distribution. a P P E N D i x 2 D E c o M P o s i t i o N o F t H E s t R a t i F i E D U N s q U a R E D M H s t a t i s t i c The unsquared version Z MH of the Mantel-Haenszel statistic Q MH can be shown to be a weighted average of the estimator ϑ ˆ = (πˆ 1 − πˆ 2 ) (30). For stage k, Z MH.k = (w Bk (πˆ B1k − πˆ B2k ) + w Ck (πˆ C1k − πˆ C2k ))/σ k , where w Bk is the Mantel-Haenszel weight (n B1k * n B2k )/(n B1k + n B2k ), and the first index B designates the stratum (B for Binet stage B, and C for Binet stage C). σ k is a function of the four margins of the 2 × 2 table:
Inference After Adaptive GSD b i o s t a t i s t i c s 597 Drug Information Journal σ k 2 = (nB1k n B2k r B.k f B.k )/[n B 2 (nB − 1)] + (n C1k n C2k r C.k f C.k )/[n C 2 (nC − 1)] where r and f indicate the number of responding and failing (not responding) patients. This Z statistic can be represented as a product of an estimator and a weight, as desired: N o t E s Z MH.k = [w Bk /(w Bk + w Ck ) ϑ ˆ Bk + w Ck /(w Bk + w Ck ) ϑˆ Ck ] * (w Bk + w Ck )/σ k = ϑˆ weighted.k * (w Bk + w Ck )/σ k 1. This was done with SeqTrial function seqDesign (8). Another software commonly used for such calculations is EAST (14). 2. This decomposition is not trivial; there are cases where it cannot be done. For the Wilcoxon test, this problem was noted before (15). R E F E R E N c E s 1. Pocock SJ. Clinical Trials: A Practical Approach. New York: Wiley; 1983. 2. Lan KKG, DeMets DL. Discrete sequential boundaries for clinical trials. Biometrika. 1983;70:659–663. 3. Jennison C, Turnbull BW. Group Sequential Methods With Applications to Clinical Trials. Boca Raton, FL: Chapman & Hall/CRC; 2000. 4. Proschan MA, Hunsberger SA. Designed extension of studies based on conditional power. Biometrics. 1995;51:1315–1324. 5. Lehmacher W, Wassmer G. Adaptive sample size calculations in group sequential trials. Biometrics. 1999;55:1286–1290. 6. Knauf WU, Lissichkov T, Aldaoud A, et al. Phase III randomized study of bendamustine versus chlorambucil in previously untreated patients with chronic lymphocytic leukemia. J Clin Oncol. 2009;27:4378–4384. 7. Fleming TR, Richardson BA. Some design issues in microbicide HIV prevention trials. J Infect Dis. 2004;190:666–674. 8. Insightful. S+ SeqTrial 2 User’s Guide. Insightful Corp.; 2002. 9. Whitehead J. On the bias of maximum likelihood estimation following a sequential test. Biometrika. 1986;73:573–581. 10. Emerson SS, Kittelson JM. A computationally simpler algorithm for the UMVUE of a normal mean following a group sequential design. Biometrics. 1997;53:365–369. 11. Coburger S, Wassmer G. Conditional point estimation in adaptive group sequential test designs. Biometric J. 2001;43:821–833. 12. Wright PS. Adjusted P-values for simultaneous inference. Biometrics. 1992;48:1005–1013. 13. Wassmer G. Planning and analyzing adaptive group sequential survival trials. Biometric J. 2006; 48:714–729. 14. Cytel Inc. East 5 (v5.2). 2008. 15. Lan KK, Wittes J. The B-value: a tool for monitoring data. Biometrics. 1988;44:579–585. 16. Tsiatis AA. The asymptotic joint distribution of the efficient scores test for the proportional hazards model calculated over time. Biometrika. 1981;68:311–315. 17. Jahn-Eimermacher A, Ingel K. Adaptive trial design: a general methodology for censored time to event data. Contemp Clin Trials. 2009;30:171– 177. 18. Marubini E, Valsecchi MG. Analysing Survival Data From Clinical Trials and Observational Studies. Chichester, UK: Wiley; 1995. 19. Bauer P, Koenig F. The reassessment of trial perspectives from interim data—a critical view. Stat Med. 2006;25:23–36. 20. Armitage P, McPherson CK, Rowe BC. Repeated significance tests on accumulating data. J R Stat Soc A. 1969;132:235–244. 21. Brannath W, Posch M, Bauer P. Recursive combination tests. J Am Stat Assoc. 2002;97:236–243. 22. Brannath W, Mehta CR, Posch M. Exact confidence bounds following adaptive group sequential tests. Biometrics. 2009;65:539–546. 23. Brannath W, Koenig F, Bauer P. Estimation in confirmatory adaptive designs with treatment selection. Adaptive Trials 2008, Barcelona. 24. Tsiatis AA, Mehta C. On the inefficiency of the adaptive design for monitoring clinical trials. Biometrika. 2003;90:367–387. 25. Jennison C, Turnbull BW. Efficient group sequential designs when there are several effect sizes under consideration. Stat Med. 2006;25: 917–932.
Page 1 and 2: Lothar T. Tremmel, PhD Sr. Director
Page 3 and 4: Inference After Adaptive GSD b i o
Page 5 and 6: Inference After Adaptive GSD b i o
Page 7: Inference After Adaptive GSD b i o

Statistical Inference After an Adaptive Group Sequential Design: A ...

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?