13.07.2015 Views

The Genom of Homo sapiens.pdf

The Genom of Homo sapiens.pdf

The Genom of Homo sapiens.pdf

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

GENETICS OF COMMON DISEASES 397within genomic regions implicated through linkage analyses,as for example, in the cloning <strong>of</strong> the gene responsiblefor cystic fibrosis (Riordan et al. 1989).Two factors greatly increased interest in the application<strong>of</strong> LD mapping to common diseases. One was the demonstration<strong>of</strong> Risch and Merikangas (1996) <strong>of</strong> the greaterpower <strong>of</strong> association mapping over linkage studies. <strong>The</strong>yshowed that for modest effect sizes, affected sib pair analyses,for example, would require unrealistically largesample sizes. <strong>The</strong> other was the rapidly expanding coverage<strong>of</strong> the human genome with polymorphic markers.<strong>The</strong> potential application <strong>of</strong> LD mapping for commondisease generated intense interest in patterns <strong>of</strong> linkagedisequilibrium in human populations. Until 2001, thiswork was predicated in many ways on expectations <strong>of</strong>gradual, though noisy, decay <strong>of</strong> LD with physical distancein the genome. It had long been appreciated thatcertain genes, such as the β-globin gene cluster (Chakravatiet al. 1984; Jeffries et al. 2000), harbored hot spots <strong>of</strong>recombination—regions <strong>of</strong> intense homologous recombination—andalso that LD decay was <strong>of</strong>ten poorly correlatedwith distance, either because <strong>of</strong> uneven recombinationin some regions or because <strong>of</strong> the stochasticeffects.Nevertheless, many empirical studies looked at the averagedecay in LD with distance over different genomicregions, effectively ignoring the precise pattern <strong>of</strong> decaywithin a given region. In addition, models used to generateexpectation <strong>of</strong> the pattern <strong>of</strong> LD assumed uniform recombination.In his influential report, Leonid Kruglyakestablished a sort <strong>of</strong> null model for the expected extent <strong>of</strong>LD by assuming an idealized human demographic historyand assuming homogeneous recombination rates. Underthese assumptions, Kruglyak (1999) showed that usablelevels <strong>of</strong> LD (at that time set at d 2 >0.1, which correspondsto a r 2 5%) across 500 kb <strong>of</strong> Chromosome 5q31, Daly etal. (2001) argued that the pattern was better viewed asdiscrete, with stretches <strong>of</strong> sequence showing little or noLD breakdown interspersed by regions <strong>of</strong> sharper LDbreakdown. <strong>The</strong> stretches <strong>of</strong> limited haplotype diversitywere called blocks, and within them up to 95% <strong>of</strong> the observedchromosomes were accounted for by 3 or 4 haplotypes(Daly et al. 2001). Similar patterns were reported byJohnson et al., who observed a similar block-like pattern<strong>of</strong> LD among 122 SNPs across 135 kb <strong>of</strong> 9 genes in Europeanpopulations. Haplotype diversity within blockswas low, with a maximum <strong>of</strong> 6 common haplotypes (frequency>5%) observed within any one block (Johnson etal. 2001).<strong>The</strong>se publications led directly to the idea that blocks<strong>of</strong> LD, within which haplotype diversity is limited, are aprevailing characteristic <strong>of</strong> the genome. This in turn led tothe concept that a set <strong>of</strong> SNPs could represent, or tag,each <strong>of</strong> the common haplotypes in a given region. <strong>The</strong>seSNPs were first referred to as haplotype-tagging SNPs(htSNPs). In an accompanying supplement to the Johnsonet al. study, David Clayton introduced an approach for selectinghtSNPs that focused on the proportion <strong>of</strong> the haplotypediversity that could be explained by a set <strong>of</strong> tags(Johnson et al. 2001). This definition <strong>of</strong> htSNPs was notexplicitly tied to the idea <strong>of</strong> a block, but given its focus ontagging a small number <strong>of</strong> haplotypes, it has <strong>of</strong>ten beenperceived in the community as applying to the selection<strong>of</strong> htSNPs within blocks (see below).In the same issue, Jeffries et al. published data relatedto a possible cause <strong>of</strong> uneven decay <strong>of</strong> LD. Through singlesperm typing in the class II major histocompatibilitycomplex (MHC) region, they showed recombinationevents to be clustered in narrow, discrete regions (Jeffreyset al. 2001).This shows that at least in one genomic region the location<strong>of</strong> recombination hot spots corresponds with, anddrives discontinuities in, the pattern <strong>of</strong> LD. <strong>The</strong>re is currentlya great deal <strong>of</strong> debate about the extent to which LDdiscontinuities coincide with recombination rate hotspots. However, this should not distract from the centralcontribution <strong>of</strong> the Johnson et al., Daly et al., and Jeffrieset al. papers. In breaking with the strong tradition <strong>of</strong>viewing LD decay as relatively smooth and homogeneous,this work represented a striking and importantconceptual advance. After these papers, it was immediatelyunacceptable to focus on descriptions <strong>of</strong> LD that ignoredthe precise pattern <strong>of</strong> decay within a region, andsimilarly unacceptable to make predictions based onmodels that assumed uniform recombination rates.Whatever the causes <strong>of</strong> sharp LD discontinuities, theyhave important implications for the design and interpretation<strong>of</strong> association studies. <strong>The</strong> first large-scale studyexplicitly incorporating the idea <strong>of</strong> very uneven patterns<strong>of</strong> LD decay, or blocks <strong>of</strong> LD, was reported less than ayear later by Gabriel et al. (2002), who analyzed patterns<strong>of</strong> haplotype diversity from over 3700 SNPs in 51 regionsspanning 13 megabases <strong>of</strong> the genome in four populationgroups (European, African-American, Nigerian, and Chinese).<strong>The</strong> same apparent block-like pattern <strong>of</strong> LD withaccompanying lack <strong>of</strong> within-block haplotype diversityas reported by Daly et al. (2001) was observed across all

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!