12.07.2015 Views

Initial sequencing and analysis of the human genome - Vitagenes

Initial sequencing and analysis of the human genome - Vitagenes

Initial sequencing and analysis of the human genome - Vitagenes

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

articlesvelocardi<strong>of</strong>acial/DiGeorge <strong>and</strong> Williams' syndromes 215,235±240 ).Until now, <strong>the</strong> identi®cation <strong>and</strong> characterization <strong>of</strong> segmentalduplications have been based on anecdotal reportsÐfor example,®nding that certain probes hybridize to multiple chromosomal sitesor noticing duplicated sequence at certain recurrent chromosomalbreakpoints. The availability <strong>of</strong> <strong>the</strong> entire genomic sequence willmake it possible to explore <strong>the</strong> nature <strong>of</strong> segmental duplicationsmore systematically. This <strong>analysis</strong> can begin with <strong>the</strong> current state <strong>of</strong><strong>the</strong> draft <strong>genome</strong> sequence, although caution is required becausesome apparent duplications may arise from a failure to mergesequence contigs from overlapping clones. Alternatively, erroneousFigure 31 Duplication l<strong>and</strong>scape <strong>of</strong> chromosome 21. The size <strong>and</strong> location <strong>of</strong>intrachromosomal (blue) <strong>and</strong> interchromosomal (red) duplications are depicted along <strong>the</strong>sequence <strong>of</strong> <strong>the</strong> long arm <strong>of</strong> chromosome 21. Gaps between ®nished sequence aredenoted by empty space but do not represent actual gap size.assembly <strong>of</strong> closely related sequences from nonoverlapping clonesmay underestimate <strong>the</strong> true frequency <strong>of</strong> such features, particularlyamong those segments with <strong>the</strong> highest sequence similarity. Accordingly,we adopted a conservative approach for estimating suchduplication from <strong>the</strong> available draft <strong>genome</strong> sequence.Pericentromeres <strong>and</strong> subtelomeres. We began by re-evaluating <strong>the</strong>®nished sequences <strong>of</strong> chromosomes 21 <strong>and</strong> 22. The initial papers on<strong>the</strong>se chromosomes 93,94 noted some instances <strong>of</strong> interchromosomalduplication near each centromere. With <strong>the</strong> ability now to compare<strong>the</strong>se chromosomes to <strong>the</strong> vast majority <strong>of</strong> <strong>the</strong> <strong>genome</strong>, it isapparent that <strong>the</strong> regions near <strong>the</strong> centromeres consist almostentirely <strong>of</strong> interchromosomal duplicated segments, with littleor no unique sequence. Smaller regions <strong>of</strong> interchromosomalduplication are also observed near <strong>the</strong> telomeres.Chromosome 22 contains a region <strong>of</strong> 1.5 Mb adjacent to <strong>the</strong>centromere in which 90% <strong>of</strong> sequence can now be recognized toconsist <strong>of</strong> interchromosomal duplication (Fig. 30). Conversely, 52%<strong>of</strong> <strong>the</strong> interchromosomal duplications on chromosome 22 werelocated in this region, which comprises only 5% <strong>of</strong> <strong>the</strong> chromosome.Also, <strong>the</strong> subtelomeric end consists <strong>of</strong> a 50-kb region consistingalmost entirely <strong>of</strong> interchromosomal duplications.Chromosome 21 presents a similar l<strong>and</strong>scape (Fig. 31). The ®rst1 Mb after <strong>the</strong> centromere is composed <strong>of</strong> interchromosomalrepeats, as well as <strong>the</strong> largest (. 200 kb) block <strong>of</strong> intrachromosomallyduplicated material. Again, most interchromosomal duplicationson <strong>the</strong> chromosome map to this region <strong>and</strong> <strong>the</strong> mostsubtelomeric region (30 kb) shows extensive duplication amongnonhomologous chromosomes.aSequence identity (%)10099989796959493929190100 200 300 400 500cSequence identity (%)10099989796959493929190100 200 300 400 500Sequence identity (%)10099989796959493929190600 700 800 900 1,000Sequence identity (%)10099989796959493929190600 700 800 900 1,000Sequence identity (%)100999897969594939291901,100 1,200 1,300 1,400 1,500Sequence identity (%)100999897969594939291901,100 1,200 1,300 1,400 1,500bSequence identity10099989796959493929190CTR CMD ALD320 340 360 380 420400 440dSequence identity (%)Sequence identity (%)1009998979695949392919010099989796959493929190100 200 300 400 500600Figure 32 Mosaic patterns <strong>of</strong> duplications. Panels depict various patterns <strong>of</strong> duplicationwithin <strong>the</strong> <strong>human</strong> <strong>genome</strong> (PARASIGHT). For each region, a segment <strong>of</strong> draft <strong>genome</strong>sequence (100±500 kb) is shown with both interchromosomal (red) <strong>and</strong> intrachromosomal(blue) duplications displayed along <strong>the</strong> horizontal line. Below <strong>the</strong> line, each separatesequence duplication is indicated (with a distinct colour) relative to per cent nucleotideidentity for <strong>the</strong> duplicated segment (y axis). Black bars show <strong>the</strong> relative locations <strong>of</strong> largeblocks <strong>of</strong> heterochromatic sequences (alpha, gamma <strong>and</strong> HSAT sequence). a, An activepericentromeric region on chromosome 21. b, An ancestral region from Xq28 that hascontributed various `genic' segments to pericentromeric regions. c, A pericentromeric890 © 2001 Macmillan Magazines Ltd NATURE | VOL 409 | 15 FEBRUARY 2001 | www.nature.com

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!