13.07.2015 Views

The Genom of Homo sapiens.pdf

The Genom of Homo sapiens.pdf

The Genom of Homo sapiens.pdf

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

144 RUDD, SCHUELER, AND WILLARDFigure 3. Repeat content <strong>of</strong> the junction between the short-arm euchromatin and centromere <strong>of</strong> the X chromosome. Each line andcolor illustrates a different type <strong>of</strong> repeat family present in the most proximal 1 Mb on Xp (drawn using rm2parasight; D. Locke, unpubl.).<strong>The</strong> “satellite junction” indicates an abrupt transition from the euchromatin <strong>of</strong> Xp and the first pericentromeric satellite sequences.At least three types <strong>of</strong> satellites are present here: monomeric α satellite, γ satellite, and a 35-bp satellite (HSAT4). <strong>The</strong> mostproximal annotated genes on Xp (ZXDA and ZXDB) are shown, with the direction <strong>of</strong> transcription indicated by the arrows. <strong>The</strong> “arrayjunction” indicates the transition between monomeric α satellite and DXZ1 higher-order repeats, which extend a further ~3 Mb atthe centromere. (Based on Schueler et al. 2001.)quenced (low-pass sequence sampling). BAC 285M22bridges the junction between the D17Z1-B array andmonomeric α satellite; fluorescence in situ hybridizationstudies demonstrated that this junction lies to the shortarm side <strong>of</strong> D17Z1 (Fig. 4A,C). When we compared thehigher-order repeats in 285M22 and 18L18 (n = 7), allwere 98–99% identical, indicating that they are part <strong>of</strong> ahighly homogeneous array <strong>of</strong> α satellite. Notably, however,whereas D17Z1 and D17Z1-B share a similar multimericstructure, they are only 92% identical, demonstratingthat they are distinct yet related higher-orderrepeats. (Such a relationship differs from a number <strong>of</strong>variants <strong>of</strong> D17Z1 that, although distinctive in theirhigher-order structure and their genomic localizationwithin the large D17Z1 locus, are not distinguished bylevels <strong>of</strong> overall sequence relatedness [Warburton andWillard 1990, 1995].)Although a contig extending across the full D17Z1 andD17Z1-B arrays remains to be achieved, we have establisheda complete contig linking D17Z1-B to the euchromatin<strong>of</strong> 17p (M.K. Rudd, unpubl.), essentially using thestrategy developed as part <strong>of</strong> our Xp work (Schueler et al.2001). <strong>The</strong> genomic content <strong>of</strong> this part <strong>of</strong> chromosome17 is, however, somewhat more complex than that <strong>of</strong> theX, in that there are three different blocks <strong>of</strong> monomeric αsatellite located within ~500 kb <strong>of</strong> D17Z1-B, separatedby regions <strong>of</strong> genomic sequence populated by a number<strong>of</strong> different repeat families (both satellite and non-satellite)and at least some transcribed elements. A similar pictureappears to be emerging on the long arm (17q) side <strong>of</strong>the centromere, although a contig containing stretches <strong>of</strong>monomeric α satellite and proximal 17q sequences hasnot yet been linked to the large D17Z1 array at the centromere(M.K. Rudd, unpubl.).<strong>The</strong> coexistence within pericentromeric regions <strong>of</strong> bothhigher-order repeat arrays <strong>of</strong> α satellite and more limitedstretches <strong>of</strong> monomeric α satellite without any detectablehigher-order structure appears to be a consistent theme <strong>of</strong>a number <strong>of</strong> human chromosomes (Horvath et al. 2001).Although the organizational distinction between thesetypes <strong>of</strong> α satellites (i.e., multimeric vs. monomeric) isapparent at short range with standard homology-findingprograms (e.g., Fig. 1B,C), it is also important to addressthe phylogenetic sequence relationships among the individualmonomers populating these different classes <strong>of</strong> therepeat family.To illustrate this point, we examined ~100 monomers<strong>of</strong> monomeric α satellite from each <strong>of</strong> the Xp and 17qcontigs, together with 28 monomers making up the DXZ1and D17Z1 higher-order repeats. As seen in Figure 5, thesequences fall into two distinct phylogenetic clades, correspondingprecisely to their monomeric or multimericorigin. Within the multimeric clade, the DXZ1 andD17Z1 monomers exhibit close sequence relatedness, asestablished previously (Waye and Willard 1986). Notably,the monomeric clade includes monomers from bothchromosomes (Fig. 5); in other words, clusters <strong>of</strong>monomeric α satellite from one chromosome are moreclosely related to monomeric α satellite from the otherchromosome than they are to the multimeric α satellitethat maps only a few hundred kilobases away on the samechromosome.<strong>The</strong>se relationships presumably reflect the highly efficienthomogenization mechanisms that drive and maintainthe high degree <strong>of</strong> sequence homogeneity withinhigher-order repeat arrays, even across severalmegabases <strong>of</strong> genomic DNA (Willard and Waye 1987;Durfy and Willard 1989; Schueler et al. 2001; Schindelhauerand Schwarz 2002). However, the phylogeneticdata argue that homogenization mechanisms betweenmonomeric and multimeric α satellite are poorly efficientor nonexistent, even over much shorter genomic dis-

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!