Sequencing
SFAF2016%20Meeting%20Guide%20Final%203
SFAF2016%20Meeting%20Guide%20Final%203
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
11th Annual <strong>Sequencing</strong>, Finishing, and Analysis in the Future Meeting<br />
DIRECT DETERMINATION OF GENOME SEQUENCES<br />
Thursday, 2nd June 15:50 La Fonda Ballroom Tech Talk (TT‐2.01)<br />
David Jaffe, Neil Weisenfeld, Vijay Kumar, Kamila Belhocine, Rajiv Bharadwaj,<br />
Deanna Church, Paul Hardenbol, Jill Herschleb, Chris Hindson, Yuan Li, Patrick Marks,<br />
Pranav Patel, Andrew Price, Michael Schnall Levin, Alex Wong, Indira Wu<br />
10x Genomics, Inc<br />
We introduce a new method for determining the genome sequence of an organism. Our method has<br />
the following key advantages:<br />
1. Our starting material consists of 1 ng of high molecular weight DNA, as compared to typical requirements<br />
of 1,000‐10,000 ng or more.<br />
2. We create a single library, as compared to typical methods which require creation of multiple<br />
libraries and often multiple data types. This makes our process fundamentally more robust than<br />
typical methods.<br />
3. Our costs are about ten times lower.<br />
4.The entire process, including assembly, is not organism‐specific. For example, there are no<br />
parameters to specify to the algorithm.<br />
5. We produce a genome sequence that mirrors the actual chromosomes in the sample, as contrasted<br />
with prior methods for which a contig is a mélange of homologous sequences.<br />
To accomplish this, we create a single 10X Genomics Linked‐Read (Chromium Genome) library and<br />
sequence it on the HiSeq X instrument. The data type consists of barcoded pools of reads, each<br />
originating from several very long molecules, with each molecule represented by many reads, and<br />
shallowly covered. Our new turn‐key software, the Supernova Assembler, exploits these pools, first to<br />
create local assemblies, that can capture difficult regions, and then to phase homologous chromosomes.<br />
Using human genomes, we obtain contigs of size 100 kb, in scaffolds of size larger than 10 Mb, and<br />
composed of phase blocks of size 3‐4 Mb. Notably, these phase block lengths greatly exceed those<br />
obtained from any other method, in spite of being constructed from dramatically less expensive data.<br />
We rigorously assess the accuracy of our assemblies, including by means of a HGP sample for which<br />
340 Mb of finished sequence is available. We further demonstrate our method on a wide range of<br />
organisms (both animal and plant), obtained from diverse starting materials. For many purposes, our<br />
method supplants all prior methods for obtaining genome sequences by providing a direct and<br />
inexpensive path to the true sequence of the sample.<br />
118