Gao X, Starmer J, Martin ER. A multiple testing correction method for ...

368 Gao et al. 

the SNPs all together if the eigenvalues can be 

derived. In the situations where the high dimensionality 

prohibits the calculation of eigenvalues, we can 

analyze the SNPs on each chromosome separately or 

according to the gene functions and then sum all of 

the Meff values together. The total Meff can be used to 

calculate the adjusted PWER. In genome-wide 

association studies, we have to partition the SNPs 

into several parts and analyze them separately. Since 

SNPs on different chromosomes are expected to be 

in linkage equilibrium in general populations, the 

genome-wide effective number of independent tests 

can be obtained by summing the chromosome 

specific Meff values. For each chromosome, we may 

use the partition-ligation approach by dividing the 

SNPs into several parts, and then sum the Meff 

values from each partition, similar to how we tested 

our Alzheimer SNP data set. The total Meff is used in 

the final adjustment calculation. Due to the interblock 

correlations that are unlikely to be captured in 

this partition-ligation approach, the total Meff may 

be slightly conservative. However, the interblock 

correlations may be reduced if we partition SNPs 

according to their haplotype block structure. 

In summary, the simpleM algorithm provides a 

highly accurate approximation to the permutationbased 

correction threshold and is easily implemented. 

Itisshowntobesimple,fastandmoreaccuratethan 

recently developed methods and is comparable to the 

permutation-based correction threshold using both 

simulated and real SNP data. The efficiency and 

accuracy of the simpleM method make it an attractive 

choice for multiple testing adjustment when there is 

high intermarker LD in the SNP data set as in 

candidate gene or genome-wide association studies. 

ACKNOWLEDGMENTS 

This work was supported in part by NIH grants 

NS39764, AG019757 and AG20135 and NIEHS T32 

ES007126. We thank Dr. Gary Beecham who prepared 

the Alzheimer data for us. We thank Dr. 

Richard Morris for initial inspiration. 

REFERENCES 

Armitage P. 1955. Tests for linear trends in proportions and 

frequencies. Biometrics 11:375–386. 

Barrett JC, Fry B, Maller J, Daly MJ. 2005. Haploview: analysis and 

visualization of LD and haplotype maps. Bioinformatics 

21:263–265. 

Benjamini Y, Hochberg Y. 1995. Controlling the false discovery 

rate: a practical and powerful approach to multiple testing. J R 

Stat Soc B 57:289–300. 

Bonferroni CE. 1935. Il calcolo delle assicurazioni su gruppi di 

teste, chapter ‘‘Studi in Onore del Professore Salvatore ortu 

Carboni’’. Rome. p 13–60. 

Bonferroni CE. 1936. Teoria statistica delle classi e calcolo delle 

probabilitá. Pubblicazioni del Istituto Superiore di Scienze 

Economiche e Commerciali di Firenze 8:3–62. 

Genet. Epidemiol. 

Cheverud JM. 2001. A simple correction for multiple comparisons 

in interval mapping genome scans. Heredity 87:52–58. 

Churchill GA, Doerge RW. 1994. Empirical threshold values for 

quantitative trait mapping. Genetics 138:963–971. 

Deng HW. 2000. Re: ‘‘biased tests of association: comparisons of 

allele frequencies when departing from Hardy-Weinberg 

proportions’’. Am J Epidemiol 151:335–336. 

Excoffier L, Slatkin M. 1995. Maximum-likelihood estimation of 

molecular haplotype frequencies in a diploid population. Mol 

Biol Evol 12:921–927. 

Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, Blumenstiel 

B, Higgins J, DeFelice M, Lochner A, Faggart M, Liu-Cordero 

SN, Rotimi C, Adeyemo A, Cooper R, Ward R, Lander ES, Daly 

MJ, Altshuler D. 2002. The structure of haplotype blocks in the 

human genome. Science 296:2225–2229. 

Hastie T, Tibshirani R, Friedman J. 2001. The Elements of 

Statistical Learning. Berlin: Springer. 

Hoh J, Wille A, Ott J. 2001. Trimming, weighting, and grouping 

SNPs in human case-control association studies. Genome Res 

11:2115–2119. 

Hudson RR. 2002. Generating samples under a Wright-Fisher 

neutral modal of genetic variation. Bioinformatics 18:337–338. 

Knapp M. 2001. Re:‘‘biased tests of association: comparisons of 

allele frequencies when departing from Hardy-Weinberg 

proportions’’. Am J Epidemiol 154:287–288. 

Li J, Ji L. 2005. Adjusting multiple testing in multilocus analyses using 

the eigenvalues of a correlation matrix. Heredity 95:221–227. 

Lin Z, Altman RB. 2004. Finding haplotype tagging SNPs by use of 

principal components analysis. Am J Hum Genet 75:850–861. 

Mardia KV, Kent JT, Bibby JM. 1979. Multivariate Analysis. 

London: Academic Press. 

Meng Z, Zaykin DV, Xu CF, Wagner M, Ehm MG. 2003. Selection 

of genetic markers for association analyses, using linkage 

disequilibrium and haplotypes. Am J Hum Genet 73:115–130. 

Nielsen DM, Ehm MG, Weir BS. 1999. Detecting marker-disease 

association by testing for Hardy-Weinberg disequilibrium at a 

marker locus. Am J Hum Genet 63:1531–1540. 

Nyholt DR. 2004. A simple correction for multiple testing for 

single-nucleotide polymorphisms in linkage disequilibrium 

with each other. Am J Hum Genet 74:765–769. 

Nyholt DR. 2005. Evaluation of Nyholt’s procedure for multiple 

testing correction—author’s reply. Hum Hered 60:61–62. 

Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich 

D. 2006. Principal components analysis corrects for stratification in 

genome-wide association studies. Nat Genet 38:904–909. 

Rinaldo A, Bacanu SA, Devlin B, Sonpar V, Wasserman L, Roeder 

K. 2005. Characterization of multilocus linkage disequilibrium. 

Genet Epidemiol 28:193–206. 

Risch N, Merikangas K. 1996. The future of genetic studies of 

complex human diseases. Science 273:1516–1517. 

Ritchie MD, Hahn LW, Roodi N, Bailey LR, Dupont WD, 

Parl FF, Moore JH. 2001. Multifactor-dimensionality reduction 

reveals high-order interactions among estrogen-metabolism 

genes in sporadic breast cancer. Am J Hum Genet 69:138–147. 

Salyakina D, Seaman SR, Browning BL, Dudbridge F, Muller- 

Myhsok B. 2005. Evaluation of Nyholt’s procedure for multiple 

testing correction. Hum Hered 60:19–25. 

Sasieni PD. 1997. From genotypes to genes: doubling the sample 

size. Biometrics 53:1253–1261. 

Schäfer J, Strimmer K. 2005. A shrinkage approach to large scale 

covariance-matrix estimation and implications for functional 

genomics. Stat Appl Genet Mol Biol 4:32. 

Schaid DJ. 2004. Linkage disequilibrium testing when linkage 

phase is unknown. Genetics 166:505–512.

Previous page

Next page

1

2

3

4

5

6

7

8

9

Gao X, Starmer J, Martin ER. A multiple testing correction method for ...

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?