Sequencing
SFAF2016%20Meeting%20Guide%20Final%203
SFAF2016%20Meeting%20Guide%20Final%203
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
11th Annual <strong>Sequencing</strong>, Finishing, and Analysis in the Future Meeting<br />
BUILDING REFERENCE MATERIALS FOR MIXED<br />
MICROBIAL DETECTION WITH NEXT GENERATION<br />
SEQUENCING<br />
Friday, 3rd June 10:30 La Fonda Ballroom Talk (OS‐7.04)<br />
Jason Kralj, Scott Jackson<br />
National Institute of Standards and Technology<br />
Myriad new sequencing and analytical technologies have emerged within the realm of metagenomic<br />
sequencing. This has promised to transform the way clinical and environmental samples are analyzed.<br />
Indeed, short (e.g. PGM Ion Torrent, MiSeq and HiSeq from Illumina) and long (Pacific Biosciences,<br />
Nanopore) read platforms have much to offer in terms of ability to detect the faintest traces of pathogens<br />
within a complex mixture of organisms. In addition, scores of post‐sequencing analysis software packages<br />
offer a range of speed and sensitivity options to analyze the large data files, each with unique algorithms,<br />
databases, and interfaces. All this choice presents a serious question—does this particular analysis set work<br />
for my sample? While it is inconceivable that one could image the entire scope of experiments and sample<br />
types for microbial systems, there are a few basic analyses that would lend confidence to the abilities of these<br />
emerging tools.<br />
We have endeavored to develop mixtures of microbial DNA with the purpose of identifying lim‐ its of<br />
detection, contamination, bias, noise, and general strengths/weaknesses of various shotgun metagenomics<br />
sequencing platfor Purified DNA represents an ideal condition that can be tightly controlled and analyzed,<br />
without other confounding pre‐analytical variabilities such as extraction efficiency. Preliminary<br />
experiments using NIST reference materials (RMs) and prospective RMs have allowed us to make idealized<br />
mixtures of human, S. aureus, S. enterica subsp. enterica LT2,<br />
P. aeruginosa, and C. sporogenes. Each well‐defined mixture (one equigenomic, the other a dilution<br />
series) demonstrated the breadth of results from multiple analysis tools, and enable the end user to<br />
recognize the limits of these syste<br />
The results indicated that at DNA “concentrations” approaching 1:10 000 (in a background of<br />
human DNA), or a few thousand total reads, detection was not challenging. However, accurate<br />
identification highly depended upon the underlying database used for each tool. Furthermore,<br />
some tools’ filtering algorithms accurately discriminate accurate reads from reads containing<br />
significant numbers of sequencing errors, reducing the number of false positives. With this<br />
knowledge, we’ve proposed in silico analyses that will utilize reference data sets to inform new<br />
mixture designs that will provide benchmark analyses for the informatics tools. Meanwhile, we<br />
are expanding our suite of organisms to provide greater breadth of genomic characteristics (e.g.<br />
near neighbors, high/low G+C content, repetitive sequences) to stress test the sequencing<br />
pipeline. Together, these will enable developers to better ascertain the capabilities of their<br />
systems, and give regulatory agencies the tools and analyses needed to confidently evaluate the<br />
new tools in mixed microbial detection.<br />
135