13.07.2015 Views

The Genom of Homo sapiens.pdf

The Genom of Homo sapiens.pdf

The Genom of Homo sapiens.pdf

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

MOUSE GENOME ENCYCLOPEDIA PROJECT 197presence <strong>of</strong> trehalose, the 5-kb FL cDNA product is visiblein lanes 2 and 3 without any partial products, showinglow electrophoretic mobility. In Figure 2b, naturalmRNA was used. Using the traditional protocol, thelongest size <strong>of</strong> the first-strand cDNA is around 9 kb.However, in the presence <strong>of</strong> trehalose, it is much improvedto 14–16 kb, covering almost all <strong>of</strong> the transcriptspresent.Selection Technology (CAP Trapper Technology)Figure 3a shows the structure <strong>of</strong> mRNA. <strong>The</strong> CAP siteshould be labeled to select FL cDNAs. We exploited the“diol” structure shown in this figure, which is a specificfeature <strong>of</strong> the CAP site. <strong>The</strong> CAP and poly(A) sites arethe only two sites where diol groups occur. <strong>The</strong> diolgroup can be oxidized by sodium periodine (NaIO 4 ) toproduce a dialdehyde group that could be connected to ahydrazide group. Biotin-hydrazide, which is commerciallyavailable, is very convenient for direct biotinylationin this reaction. This procedure is based on the chemicalreaction, and the efficiency is almost perfect(Carninci et al. 1996, 1997).Figure 3b shows the whole procedure <strong>of</strong> CAP trapping.After single-stranded cDNA synthesis, the CAP andpoly(A) sites are labeled using the protocol shown in Figure3a. RNase I is used to cleave remaining singlestrandedRNA, but does not act on RNA–DNA hybrids.Single-stranded RNA is cleaved by RNase I at a 5´ site.Cleavage also occurs at a poly(A) site (Fig. 3b). <strong>The</strong> nonpoly(T)complementary sequence is designed as a primerfor the first cDNA. Hybridization <strong>of</strong> mRNA with FLcDNA preserves the biotinylated sites, which would otherwisebe lost following RNase I digestion. <strong>The</strong>refore,only mRNA hybridized with full-length single-strandedcDNA is trapped by avidin beads. <strong>The</strong> first-strand cDNAis then isolated and subjected to the subsequent cloningprocess.Normalization and Subtraction TechnologyA standard library reflecting the original mRNA populationis biased, causing unpermissible redundant clonesin the process <strong>of</strong> collecting FL cDNAs. Three populationsare defined within the collected cDNAs. <strong>The</strong>se are “abundantcDNA,” “cDNAs already collected,” and “rare andnew cDNA.” We used the biotinylated RNAs <strong>of</strong> the startingmaterial, and biotinylated RNAs transcribed in vitr<strong>of</strong>rom cDNA already collected, as subtraction drivers(Carninci et al. 2000). <strong>The</strong> first-strand cDNA and thesedrivers are mixed, and hybrids are trapped using avidinbeads. Thus, the undesirable populations <strong>of</strong> first-strandcDNA are eliminated, and only rare and new cDNAs arecollected. In this protocol, we did not use PCR becausethis produces a large cloning bias. This resulted in a goodefficiency <strong>of</strong> new gene discovery in our library.New Cloning VectorWe developed cloning vectors λFLC1, λFLC2,λFLC3, and λFLC4, based on infection after packagingand in vivo circularization, using the Cre-lox recombinationsystem (Carninci et al. 2001). λFLC1 is the prototype<strong>of</strong> our vector, which is based on the insertional λ vector.λFLC2 is the substitution vector for longer cDNAs, <strong>of</strong>more than 3.5 kb on average. <strong>The</strong> super-rare cutter sitesin λFLC3 are designed to transfer the cDNA to the expressionvector. Finally, λFLC4 was designed to clonesuperlong cDNA, with an average length <strong>of</strong> around 10 kb.λFLC4 carries the F-factor replication origin, which isvery stable even when carrying very long insert DNA; forexample, that used in BAC vectors for genomic DNAcloning.Summary <strong>of</strong> the Riken cDNA LibraryRiken cDNA technology created a FL cDNA library <strong>of</strong>good quality. <strong>The</strong> average length <strong>of</strong> cDNA is 2.0–2.5 kbin a standard library, and more than 3.5 kb in λFLC2 andλFLC3. Using λFLC4, cDNAs greater than 10 kb are easilycloned. <strong>The</strong> full-length coverage <strong>of</strong> the Riken libraryis such that 90–95% <strong>of</strong> clones in the standard library encodeat least initiation codon (ATG), and 60% <strong>of</strong> clonesin the accumulated cDNA bank cover whole coding sequences(CDS). In the accumulated bank, the full-lengthrate is lower than that <strong>of</strong> the single library, because wecompromised and incorporated partial cDNAs into thebank if the sequence was novel.Figure 3. (a) Structure <strong>of</strong> mRNA and the procedure <strong>of</strong> its biotinylation.Two diol groups are oxidized by NaIO 4 , and thencoupled with biotin hydrazide. (b) Scheme <strong>of</strong> CAP Trapper forFL cDNA selection.RISA SYSTEMAs explained above, a high-throughput sequence systemwith the capacity to finalize the collection <strong>of</strong> FLcDNA within a few years had not been developed whenwe started. <strong>The</strong> required capacity was 40,000 sequence

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!