01.06.2016 Views

Sequencing

SFAF2016%20Meeting%20Guide%20Final%203

SFAF2016%20Meeting%20Guide%20Final%203

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

11th Annual <strong>Sequencing</strong>, Finishing, and Analysis in the Future Meeting<br />

BUILDING REFERENCE MATERIALS FOR MIXED<br />

MICROBIAL DETECTION WITH NEXT GENERATION<br />

SEQUENCING<br />

Friday, 3rd June 10:30 La Fonda Ballroom Talk (OS‐7.04)<br />

Jason Kralj, Scott Jackson<br />

National Institute of Standards and Technology<br />

Myriad new sequencing and analytical technologies have emerged within the realm of metagenomic<br />

sequencing. This has promised to transform the way clinical and environmental samples are analyzed.<br />

Indeed, short (e.g. PGM Ion Torrent, MiSeq and HiSeq from Illumina) and long (Pacific Biosciences,<br />

Nanopore) read platforms have much to offer in terms of ability to detect the faintest traces of pathogens<br />

within a complex mixture of organisms. In addition, scores of post‐sequencing analysis software packages<br />

offer a range of speed and sensitivity options to analyze the large data files, each with unique algorithms,<br />

databases, and interfaces. All this choice presents a serious question—does this particular analysis set work<br />

for my sample? While it is inconceivable that one could image the entire scope of experiments and sample<br />

types for microbial systems, there are a few basic analyses that would lend confidence to the abilities of these<br />

emerging tools.<br />

We have endeavored to develop mixtures of microbial DNA with the purpose of identifying lim‐ its of<br />

detection, contamination, bias, noise, and general strengths/weaknesses of various shotgun metagenomics<br />

sequencing platfor Purified DNA represents an ideal condition that can be tightly controlled and analyzed,<br />

without other confounding pre‐analytical variabilities such as extraction efficiency. Preliminary<br />

experiments using NIST reference materials (RMs) and prospective RMs have allowed us to make idealized<br />

mixtures of human, S. aureus, S. enterica subsp. enterica LT2,<br />

P. aeruginosa, and C. sporogenes. Each well‐defined mixture (one equigenomic, the other a dilution<br />

series) demonstrated the breadth of results from multiple analysis tools, and enable the end user to<br />

recognize the limits of these syste<br />

The results indicated that at DNA “concentrations” approaching 1:10 000 (in a background of<br />

human DNA), or a few thousand total reads, detection was not challenging. However, accurate<br />

identification highly depended upon the underlying database used for each tool. Furthermore,<br />

some tools’ filtering algorithms accurately discriminate accurate reads from reads containing<br />

significant numbers of sequencing errors, reducing the number of false positives. With this<br />

knowledge, we’ve proposed in silico analyses that will utilize reference data sets to inform new<br />

mixture designs that will provide benchmark analyses for the informatics tools. Meanwhile, we<br />

are expanding our suite of organisms to provide greater breadth of genomic characteristics (e.g.<br />

near neighbors, high/low G+C content, repetitive sequences) to stress test the sequencing<br />

pipeline. Together, these will enable developers to better ascertain the capabilities of their<br />

systems, and give regulatory agencies the tools and analyses needed to confidently evaluate the<br />

new tools in mixed microbial detection.<br />

135

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!