01.06.2016 Views

Sequencing

SFAF2016%20Meeting%20Guide%20Final%203

SFAF2016%20Meeting%20Guide%20Final%203

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

11th Annual <strong>Sequencing</strong>, Finishing, and Analysis in the Future Meeting<br />

IMPROVING GENOME ANALYSIS USING<br />

LINKED-READS<br />

Friday, 3rd June 14:00 La Fonda Ballroom Talk (OS‐9.01)<br />

Deanna Church, Kristina Giorda, Cassandra Jabara, Sofia Kyriazopoulu Panagiotopoulou,<br />

Andrew Wei Xu, Heather Ordonez, Haynes Heaton, Mark Pratt, Patrick Marks,<br />

Paul Hardenbol, Adrian Fehr, Michael Schnall Levin<br />

10x Genomics, Inc<br />

High‐throughput sequencing (HTS) has revolutionized genome analysis. Tens of thousands of<br />

genomes and hundreds of thousands of exomes have been analyzed globally allowing for new biological<br />

insights at both population and individual levels. Despite these advances, it has become increasingly<br />

clear that traditional methods are insufficient for providing a complete view of the genome.<br />

Paralogous sequences can often confound alignment, leaving biomedically important regions of the<br />

genome with low quality alignments and variant calls. Extracting information on large‐scale events,<br />

includingcopy number variants (CNVs) and complex structural variants (SVs), is challenging using<br />

only short read data. Further, haplotype‐level resolution in a single individual is not attainable<br />

using short read analysis. To address these problems, we have developed a technology that allows<br />

for the retention of long range information while retaining the power, accuracy, and scalability of<br />

short read sequencing technologies, producing a data type referred to as ‘Linked‐Reads’ that enables a<br />

more complete analysis of a genome. At its core, haplotype‐level dilution of long input molecules into<br />

over 1 million barcoded partitions allows for high‐resolution reference‐based analysis. We have<br />

demonstrated the ability to reconstruct individual haplotypes that span several megabases and have<br />

validated these haplotype reconstructions using trio sequencing data. Coupling Linked‐Reads with<br />

novel algorithms that take advantage of these linkages allows for improved performance in regions<br />

of the genome that are typically inaccessible due to the presence of paralogous sequence. Validation<br />

of these variant calls has been challenging as they typically fall outside the Genome In a Bottle<br />

(GIAB) high confidence regions, but we have confirmed several hundred of these using orthogonal<br />

sequencing technologies. The power of the long range linkages also enables the improved detection<br />

of complex structural variants. In addition to identifying copy number variants (CNVs) we detect<br />

inter and intra‐chromosomal events as well as more complex structural rearrangements. Linked‐<br />

Read technology can be used in both a genome and targeted sequencing context, allowing access<br />

to a broader range of applications. The development of Linked‐Reads is an important step in the<br />

evolution of genome analysis by allowing access to more of the genome, resolving complex variants<br />

and reconstructing long‐range haplotypes.<br />

143

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!