13.07.2015 Views

The Genom of Homo sapiens.pdf

The Genom of Homo sapiens.pdf

The Genom of Homo sapiens.pdf

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

82 BERTRANPETIT ET AL.diversity in humans means that the density <strong>of</strong> markers ina map may be restricted, and not all SNPs will be as abundantas required to pick up an LD signal with, for example,an infrequent variant with a small contribution to acomplex disease.As explained above, population size and its fluctuationsover time have a clear effect on LD, with low effectivepopulation sizes and bottlenecks contributing to it. Ifthe history <strong>of</strong> a population is sufficiently well known, thiscan be verified. For instance, Laan and Pääbo (1997) andKaessmann et al. (2002) found that the Saami and Evenki,reindeer herders <strong>of</strong> presumably constant size, showhigher levels <strong>of</strong> LD when compared to neighboring, expandingpopulations. This has two implications: (1) Asexplained in the next section, it is possible to attempt thereverse inference; that is, to infer (although in a rathervague and qualitative manner) population history fromLD and (2) population history matters in biomedical researchaimed at deciphering the genetic architecture <strong>of</strong>complex disease making successful gene hunting morelikely in some populations. In particular, populations <strong>of</strong>constant size, such as the Saami and Evenki, may harborextensive LD that may allow the detection <strong>of</strong> the relevantgenetic associations in complex diseases. Moreover,these populations tend to be isolated, with lower geneticdiversity, which may also be reflected in a lesser allelicand genic heterogeneity in complex diseases, thus facilitatingthe discovery <strong>of</strong> such variants.Despite the hopes created by the isolated populationapproach, however, it is not clear what contribution itmay have to the understanding <strong>of</strong> genetic disease. Atworst, population history may be so complex and sopoorly known that LD in a population cannot be predictablea priori and should be determined empirically foreach population and genome region. <strong>The</strong> sources <strong>of</strong>paleodemographic data are scarce and are providedmainly by paleoanthropology, archaeology, and historicalsources, which necessarily implies large uncertainties.<strong>The</strong> example <strong>of</strong> the Icelandic population is quite revealing;from the first settlement by Vikings in the MiddleAges, the demographic history <strong>of</strong> the island is known inexceptionally excruciating detail, given the Icelandicnational passion for genealogy. However, the unknowndiversity in the settlers has precluded the use <strong>of</strong> LD as atool for association mapping in an efficient way, and themost valuable resource for biomedical research in Icelandremains the genealogical database.<strong>The</strong> genome-wide structure <strong>of</strong> LD in humans needs tobe tackled and understood if LD is to be used efficientlyas a gene discovery tool. However, the population samplingdensity required to obtain a description <strong>of</strong> the requiredlevel <strong>of</strong> detail is not clearly known. A cynical viewwould be that only the populations that can afford thecostly pharmaceutical diagnostics and treatments need tobe sampled; in an ideal world <strong>of</strong> universal health care, thelevel <strong>of</strong> detail that should be both relevant and nonredundantmay be just guessed if LD is to behave as genetic diversity.<strong>The</strong>n, there would be almost as much variationwithin as among continents, and all continents should besampled, with much more variation within Africa thanelsewhere. <strong>The</strong> old-fashioned vision <strong>of</strong> human diversityas made <strong>of</strong> monolithic Europeans, East Asians, andAfricans still prevails in genetics without a solid basis,and there is resistance to the realization that although itaccounts for a large fraction <strong>of</strong> the world population, itdoes not relate to the actual structure <strong>of</strong> LD and genomevariation.WORLDWIDE VARIATION OF LD INSPECIFIC GENE REGIONS<strong>The</strong> usage <strong>of</strong> LD as a tool to unravel population historyand to reconstruct demographic aspects <strong>of</strong> human evolutionhas been pioneered by Kenn Kidd at Yale University,and his collaborators. <strong>The</strong> first global analysis <strong>of</strong> the LDvariation in human populations was undertaken in a region<strong>of</strong> around 9.8 kb within the CD4 locus (Tishk<strong>of</strong>f etal. 1996). <strong>The</strong>se first results based on 42 worldwide populationsshowed extremely reduced LD in sub-SaharanAfrican populations and significantly higher levels <strong>of</strong>haplotype diversity than in non-African populations. Thisfact has been explained by the larger effective populationsize maintained in sub-Saharan populations and the effect<strong>of</strong> a recent bottleneck in the expansion <strong>of</strong> modern humansout <strong>of</strong> Africa. <strong>The</strong> rationale <strong>of</strong> this conclusion is based onthe fact that LD decays through time due to recombination,and therefore large and old populations tend to presentreduced levels <strong>of</strong> LD, whereas not enough time haselapsed to reduce LD in small and recently formed populationsthat preserve the strong LD produced in a bottleneck.Thus, the global LD approach was consistent withthe hypothesis <strong>of</strong> the “out <strong>of</strong> Africa” origin <strong>of</strong> modern humans,already supported by a large body <strong>of</strong> genetic evidence.However, these results were derived from a singlelocus and may be the spurious result <strong>of</strong> other processessuch as differential selective pressures among humanpopulations or simple stochastic variation. Populationhistory, nonetheless, should affect all the genome withthe same average strength, and this signal should be detectedin other genome regions.Subsequent worldwide analyses in several other generegions did show a similar pattern <strong>of</strong> lower LD amongsub-Saharan African populations and a LD increase outsideAfrica. However, there were a number <strong>of</strong> deviationsfrom this overall theme. <strong>The</strong> extreme LD pattern shownby the CD4 locus (that is, low LD in Africa and muchhigher LD elsewhere) is also present in DRD2 (Kidd et al.1998), DM (Tischk<strong>of</strong>f et al. 1998), PLAT (Tishk<strong>of</strong>f et al.2000), and COMT (DeMille et al. 2002). Such LD geographicalstructure is moderate in PAH (Kidd et al. 2000)and PKLR-GBA region (Mateu et al. 2002); whereas LDin Africans is similar to that <strong>of</strong> non-Africans in CFTR(Mateu et al. 2001), and even higher in SLC6A4 (Gelernteret al. 1999). <strong>The</strong>se different LD geographic structuresmay be explained by different recombination rates at eachlocus: Loci with higher recombination rates would haverecovered faster from the bottleneck, reaching again thelow African LD levels. Low-grained specific recombinationestimates are available (Kong et al. 2002), but theydo not seem to correlate with LD geographical patternsfor the specific genome regions, as shown in Table 1. <strong>The</strong>increase in LD caused by a bottleneck in the migration

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!