Sequencing
SFAF2016%20Meeting%20Guide%20Final%203
SFAF2016%20Meeting%20Guide%20Final%203
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
11th Annual <strong>Sequencing</strong>, Finishing, and Analysis in the Future Meeting<br />
IMPROVING GENOME ANALYSIS USING<br />
LINKED-READS<br />
Friday, 3rd June 14:00 La Fonda Ballroom Talk (OS‐9.01)<br />
Deanna Church, Kristina Giorda, Cassandra Jabara, Sofia Kyriazopoulu Panagiotopoulou,<br />
Andrew Wei Xu, Heather Ordonez, Haynes Heaton, Mark Pratt, Patrick Marks,<br />
Paul Hardenbol, Adrian Fehr, Michael Schnall Levin<br />
10x Genomics, Inc<br />
High‐throughput sequencing (HTS) has revolutionized genome analysis. Tens of thousands of<br />
genomes and hundreds of thousands of exomes have been analyzed globally allowing for new biological<br />
insights at both population and individual levels. Despite these advances, it has become increasingly<br />
clear that traditional methods are insufficient for providing a complete view of the genome.<br />
Paralogous sequences can often confound alignment, leaving biomedically important regions of the<br />
genome with low quality alignments and variant calls. Extracting information on large‐scale events,<br />
includingcopy number variants (CNVs) and complex structural variants (SVs), is challenging using<br />
only short read data. Further, haplotype‐level resolution in a single individual is not attainable<br />
using short read analysis. To address these problems, we have developed a technology that allows<br />
for the retention of long range information while retaining the power, accuracy, and scalability of<br />
short read sequencing technologies, producing a data type referred to as ‘Linked‐Reads’ that enables a<br />
more complete analysis of a genome. At its core, haplotype‐level dilution of long input molecules into<br />
over 1 million barcoded partitions allows for high‐resolution reference‐based analysis. We have<br />
demonstrated the ability to reconstruct individual haplotypes that span several megabases and have<br />
validated these haplotype reconstructions using trio sequencing data. Coupling Linked‐Reads with<br />
novel algorithms that take advantage of these linkages allows for improved performance in regions<br />
of the genome that are typically inaccessible due to the presence of paralogous sequence. Validation<br />
of these variant calls has been challenging as they typically fall outside the Genome In a Bottle<br />
(GIAB) high confidence regions, but we have confirmed several hundred of these using orthogonal<br />
sequencing technologies. The power of the long range linkages also enables the improved detection<br />
of complex structural variants. In addition to identifying copy number variants (CNVs) we detect<br />
inter and intra‐chromosomal events as well as more complex structural rearrangements. Linked‐<br />
Read technology can be used in both a genome and targeted sequencing context, allowing access<br />
to a broader range of applications. The development of Linked‐Reads is an important step in the<br />
evolution of genome analysis by allowing access to more of the genome, resolving complex variants<br />
and reconstructing long‐range haplotypes.<br />
143