01.04.2015 Views

Comparative Genomics-Basic and Applied Research.pdf

Comparative Genomics-Basic and Applied Research.pdf

Comparative Genomics-Basic and Applied Research.pdf

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

16 <strong>Comparative</strong> <strong>Genomics</strong><br />

A. B.<br />

E.<br />

(iii)<br />

(ii)<br />

C. D.<br />

(i)<br />

FIGURE 2.1 (See color figure in the insert following page 48.) 454 Life Sciences sequencing.<br />

(A) DNA preparation: Isolated genomic DNA is fragmented, ligated to adaptors, <strong>and</strong> separated<br />

into single str<strong>and</strong>s. (B) Emulsion PCR: Single-str<strong>and</strong>ed DNAs are bound to beads under<br />

conditions that favor one DNA molecule per bead. An oil-PCR reaction mixture is added to<br />

encapsulate bead–DNA complexes into single oil droplets, onto which PCR amplification is<br />

performed to create beads containing several million copies of the same template sequence.<br />

(C) Deposition of the PCR-amplified beads into individual wells in the PTP is followed by<br />

the addition of smaller beads immobilized with ATP surfurylase <strong>and</strong> luciferase (D), which<br />

convert inorganic pyrophosphate into a light signal. (E) Schematic of the GS20 instrument,<br />

which consists of the following subsystems: (i) fluidic assembly for delivery of dATP, dCTP,<br />

dGTP, <strong>and</strong> dTTP reagents; (ii) PTP; <strong>and</strong> (iii) CCD camera. Figure reprinted from Margulies<br />

et al., Nature 437, 376–380, 2005, by permission from Macmillan Publishers Ltd., copyright<br />

(2005).<br />

high-resolution charged coupled device (CCD) camera, permitting detection of the light<br />

generated from each PTP well undergoing the pyrosequencing reaction (Figure 2.1E).<br />

With a pass rate of ~50% <strong>and</strong> a read length of 100 bases, one run will produce about<br />

30–40 million bases of sequence data in 4–5 hours.<br />

The Genome Sequencer 20 (GS20) instrument was launched by 454 Life Sciences<br />

in 2005. More than 40 articles have since been published on the GS20 platform,<br />

describing sequencing of bacterial genomes, 28,30–34 surveying microbial environments<br />

(i.e., metagenomics), 35–40 profiling expressed sequence tags (ESTs), 41–44 <strong>and</strong> wholegenome<br />

surveys of ancient DNA. 45–47 Many of these studies highlight the advantages<br />

<strong>and</strong> disadvantages of the GS20, depending on the intended goals of the research effort.<br />

For example, Hofreuter et al. reported the sequencing <strong>and</strong> characterization of the<br />

highly pathogenic Campylobacter jejuni strain 81-176. 34 Two 454 Life Sciences runs<br />

were performed, generating 60,905,794 high-quality bases from 558,331 successful<br />

reads (i.e., the average read length was 109 bases). A de novo assembly produced a<br />

genome with 34x coverage (i.e., on average, each nucleotide in the assembly was called<br />

by 34 different reads) in 43 contigs (contiguous sequence represented by two or more

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!