18.09.2015 Views

Abstracts

ngsfinalprogram

ngsfinalprogram

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Oral Presentation <strong>Abstracts</strong><br />

utility of Salmonella serotyping when integrated<br />

into a platform of WGS-based pathogen<br />

subtyping and characterization.<br />

n S4:5<br />

PATRIC PIPELINE<br />

F. Xia 1 , T. Brettin 2 , S. Boisvert 2 , N. R. Conrad 2 ,<br />

J. J. Davis 1 , T. Disz 1 , J. Edirisinghe 2 , R. A. Edwards<br />

3 , C. Henry 1 , R. W. Kenyon 4 , D. Machi 4 ,<br />

C. Mao 4 , G. J. Olsen 5 , R. Olson 2 , R. Overbeek 6 ,<br />

B. Parrello 6 , G. D. Pusch 6 , M. P. Shukla 2 , B. W.<br />

Sobral 4 , R. L. Stevens 1 , V. Vonstein 6 , A. Warren<br />

4 , R. Will 4 , H. Yoo 4 , A. R. Wattam 4 ;<br />

1<br />

University of Chicago, Chicago, IL, 2 Argonne<br />

National Laboratory, Lemont, IL, 3 San Diego<br />

State University, San Diego, CA, 4 Virginia<br />

Tech, Blacksburg, VA, 5 University of Illinois<br />

at Urbana and Champaign, Urbana, IL, 6 Fellowship<br />

for Interpretation of Genomes, Burr<br />

Ridge, IL.<br />

Recent advances in DNA sequencing technology<br />

accompanied by plummeting per-base cost<br />

is making sequence-based applications more<br />

amenable. While a plethora of bioinformatics<br />

databases and workflows exist, their capabilities<br />

are often hampered by the inconsistent<br />

use of analysis tools. PATRIC, the NIAIDfunded<br />

comprehensive bacterial bioinformatics<br />

resource, has integrated more than 30,000<br />

consistently annotated prokaryote genomes<br />

with a focus on human pathogenic species.<br />

Here we present PATRIC’s new computational<br />

services that support the assembly, annotation<br />

and metabolic modeling of user-supplied<br />

genomes in the same consistent fashion. These<br />

services, integrated with PATRIC’s collections<br />

of specialty genes such as antibiotic resistance<br />

determinants and virulence factors, will enable<br />

users to rapidly process newly sequenced<br />

pathogens and investigate key pathogenic<br />

determinants in foodborne outbreaks using the<br />

powerful visualization and comparative analysis<br />

tools in PATRIC. We have implemented<br />

the new services with three principles in mind.<br />

(1) Controlled vocabulary. At the heart of<br />

PATRIC’s annotation service is a controlled<br />

vocabulary for functional annotation derived<br />

from the curated subsystems and protein families<br />

in the RAST and SEED systems [2]. Similarly,<br />

the new model reconstruction service<br />

relies on our curated biochemistry data [5].<br />

These curation efforts ensure newly sequenced<br />

genomes can be automatically annotated and<br />

modeled and readily compared with existing<br />

reference data. (2) Modular design. In genome<br />

assembly as well as other bioinformatic analyses,<br />

there is often no single tool best suited<br />

for all occasions [4]. We have added support<br />

for more than 30 tools for error correction,<br />

contig assembly, scaffolding, contig evaluation,<br />

consensus building, gene calling, overlap<br />

removal, as well as many custom algorithms<br />

[3]. These modules are condensed into a few<br />

curated workflows to ensure convenient and<br />

efficient execution as well as consistent quality<br />

control. (3) Integrated analysis. The new<br />

workspace allows users to upload their own<br />

data for analysis, and upon completion the<br />

private results are immediately integrated into<br />

PATRIC. This enables users to take advantage<br />

of PATRIC’s data (drug targets, omics, AMR<br />

and other clinical metadata) and comparative<br />

tools (protein family sorter, phylogeny, heat<br />

maps, etc). In addition to these services, we<br />

are actively building support for batch analysis<br />

and SNP-level comparative analysis for closely<br />

related genomes. URL: https://www.patricbrc.<br />

org. References: [1] Gillespie, et al. “PATRIC:<br />

... (2011). [2] Overbeek, et al. “The SEED<br />

... (RAST).” Nucleic acids research (2014):<br />

D206-D214. [3] Brettin, et al. “RASTtk ...”<br />

Scientific reports 5 (2015). [4] Earl, et al. “Assemblathon<br />

...” Genome research (2011). [5]<br />

Henry, et al. “High-throughput ... models.”<br />

Nature biotechnology (2010).<br />

22<br />

ASM Conferences

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!