Sequencing
SFAF2016%20Meeting%20Guide%20Final%203
SFAF2016%20Meeting%20Guide%20Final%203
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
11th Annual <strong>Sequencing</strong>, Finishing, and Analysis in the Future Meeting<br />
VISUAL ANALYTIC TOOL FOR INVESTIGATING<br />
UNEXPLAINED RESPIRATORY DISEASE OUTBREAKS<br />
THROUGH THE USE OF TARGETED AMPLICON<br />
RE-SEQUENCING<br />
Wednesday, 1st June 20:00 La Fonda Mezzanine (2nd Floor) Poster (PS‐2b.05)<br />
Shatavia Morrison, Heta Desai, Bernard Wolff, Alvaro Benitez, Maureen Diaz, Jonas Winchell<br />
Centers for Disease Control and Prevention<br />
The infectious etiology of respiratory outbreaks remains unidentified in ~50% of Unexplained<br />
Respiratory Diseases Outbreaks (URDO) investigated by CDC, even when traditional multi‐pathogen<br />
detection methods, such as real‐time PCR are used. The current URDO panel provides the basic<br />
level of detection (presence vs. absence) for a known set of respiratory etiologic agents. By transforming<br />
the foundation of this approach to a next generation sequencing‐based (NGS) method, we<br />
are able to generate a more comprehensive data set for all targeted agents that may be present in<br />
clinical specimens, along with the ability to identify novel or rare pathogens. A continued challenge<br />
in the “–omics” assays is the data analysis step. Existing metagenomics visualization tools are<br />
either too convoluted for user interpretation, non‐interactive, or lack the ability to display parent‐child<br />
taxonomy relationships.<br />
We present an interactive visual analytic tool that allows for analyses of multiple metagenomics<br />
datasets simultaneously in order to identify the etiologic agent(s) during URDO outbreaks. By using<br />
this tool with in‐house mock clinical samples, we were able to detect the seeded organisms. These<br />
datasets were generated with the Illumina platform. After sequence read data cleansing, Kraken was<br />
used to assign taxonomic labels using a k‐mer based approach. The Kraken output was used as<br />
the input into the URDO visualization. This visualization has two components: (i) the back‐end<br />
infrastructure and (ii) the front‐end web‐visualization component. The back‐end consists of a MySQL<br />
database on a MAMP web server and PHP scripts. The MySQL database contains the parent‐child<br />
relationships for all the bacteria and virus taxonomy lineages classified in NCBI and PHP scripts<br />
that allow for the communication between the back‐end and front‐end visualization. The front‐end<br />
visualization was built with HTML5 and D3.js. The D3.js is a data‐driven framework that allows for<br />
easy manipulation of text data into visual glyphs. It allows the user to upload multiple files and<br />
generate independent visual interpretations for each dataset as well as generate a cumulative view of<br />
all samples to allow for inter‐level comparisons. The web page navigation is based on taxonomy<br />
classification hierarchical schema. This allows the user to select at which taxonomy level they would<br />
like to begin their analysis. The visualization is divided into two visual panes for easy interpretation<br />
of parent‐child relationships. Using this approach we are able to display many‐ to‐many comparisons.<br />
The size of the bubbles represent the number of reads associated with each organism. The user is<br />
able to hover over the bubble glyph to retrieve data such as the percentage of reads represented in<br />
the dataset. The summative view allows the user to highlight a specific classification unit and its<br />
corresponding units will highlight across samples to make it easy for the user to identify any specific<br />
classification group of interest.<br />
This tool is useful for exploring metagenomics datasets. The framework joins scalable technologies<br />
together to make it modular to incorporate additional characterization features such as antibiotic<br />
resistance features, serogrouping, or receptor use. Ultimately, this tool will vastly improve URDO<br />
investigations.<br />
93