01.06.2016 Views

Sequencing

SFAF2016%20Meeting%20Guide%20Final%203

SFAF2016%20Meeting%20Guide%20Final%203

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

11th Annual <strong>Sequencing</strong>, Finishing, and Analysis in the Future Meeting<br />

VISUAL ANALYTIC TOOL FOR INVESTIGATING<br />

UNEXPLAINED RESPIRATORY DISEASE OUTBREAKS<br />

THROUGH THE USE OF TARGETED AMPLICON<br />

RE-SEQUENCING<br />

Wednesday, 1st June 20:00 La Fonda Mezzanine (2nd Floor) Poster (PS‐2b.05)<br />

Shatavia Morrison, Heta Desai, Bernard Wolff, Alvaro Benitez, Maureen Diaz, Jonas Winchell<br />

Centers for Disease Control and Prevention<br />

The infectious etiology of respiratory outbreaks remains unidentified in ~50% of Unexplained<br />

Respiratory Diseases Outbreaks (URDO) investigated by CDC, even when traditional multi‐pathogen<br />

detection methods, such as real‐time PCR are used. The current URDO panel provides the basic<br />

level of detection (presence vs. absence) for a known set of respiratory etiologic agents. By transforming<br />

the foundation of this approach to a next generation sequencing‐based (NGS) method, we<br />

are able to generate a more comprehensive data set for all targeted agents that may be present in<br />

clinical specimens, along with the ability to identify novel or rare pathogens. A continued challenge<br />

in the “–omics” assays is the data analysis step. Existing metagenomics visualization tools are<br />

either too convoluted for user interpretation, non‐interactive, or lack the ability to display parent‐child<br />

taxonomy relationships.<br />

We present an interactive visual analytic tool that allows for analyses of multiple metagenomics<br />

datasets simultaneously in order to identify the etiologic agent(s) during URDO outbreaks. By using<br />

this tool with in‐house mock clinical samples, we were able to detect the seeded organisms. These<br />

datasets were generated with the Illumina platform. After sequence read data cleansing, Kraken was<br />

used to assign taxonomic labels using a k‐mer based approach. The Kraken output was used as<br />

the input into the URDO visualization. This visualization has two components: (i) the back‐end<br />

infrastructure and (ii) the front‐end web‐visualization component. The back‐end consists of a MySQL<br />

database on a MAMP web server and PHP scripts. The MySQL database contains the parent‐child<br />

relationships for all the bacteria and virus taxonomy lineages classified in NCBI and PHP scripts<br />

that allow for the communication between the back‐end and front‐end visualization. The front‐end<br />

visualization was built with HTML5 and D3.js. The D3.js is a data‐driven framework that allows for<br />

easy manipulation of text data into visual glyphs. It allows the user to upload multiple files and<br />

generate independent visual interpretations for each dataset as well as generate a cumulative view of<br />

all samples to allow for inter‐level comparisons. The web page navigation is based on taxonomy<br />

classification hierarchical schema. This allows the user to select at which taxonomy level they would<br />

like to begin their analysis. The visualization is divided into two visual panes for easy interpretation<br />

of parent‐child relationships. Using this approach we are able to display many‐ to‐many comparisons.<br />

The size of the bubbles represent the number of reads associated with each organism. The user is<br />

able to hover over the bubble glyph to retrieve data such as the percentage of reads represented in<br />

the dataset. The summative view allows the user to highlight a specific classification unit and its<br />

corresponding units will highlight across samples to make it easy for the user to identify any specific<br />

classification group of interest.<br />

This tool is useful for exploring metagenomics datasets. The framework joins scalable technologies<br />

together to make it modular to incorporate additional characterization features such as antibiotic<br />

resistance features, serogrouping, or receptor use. Ultimately, this tool will vastly improve URDO<br />

investigations.<br />

93

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!