Abstracts

More documents

Recommendations

Info

Oral Presentation <strong>Abstracts</strong> utility of Salmonella serotyping when integrated into a platform of WGS-based pathogen subtyping and characterization. n S4:5 PATRIC PIPELINE F. Xia 1 , T. Brettin 2 , S. Boisvert 2 , N. R. Conrad 2 , J. J. Davis 1 , T. Disz 1 , J. Edirisinghe 2 , R. A. Edwards 3 , C. Henry 1 , R. W. Kenyon 4 , D. Machi 4 , C. Mao 4 , G. J. Olsen 5 , R. Olson 2 , R. Overbeek 6 , B. Parrello 6 , G. D. Pusch 6 , M. P. Shukla 2 , B. W. Sobral 4 , R. L. Stevens 1 , V. Vonstein 6 , A. Warren 4 , R. Will 4 , H. Yoo 4 , A. R. Wattam 4 ; 1 University of Chicago, Chicago, IL, 2 Argonne National Laboratory, Lemont, IL, 3 San Diego State University, San Diego, CA, 4 Virginia Tech, Blacksburg, VA, 5 University of Illinois at Urbana and Champaign, Urbana, IL, 6 Fellowship for Interpretation of Genomes, Burr Ridge, IL. Recent advances in DNA sequencing technology accompanied by plummeting per-base cost is making sequence-based applications more amenable. While a plethora of bioinformatics databases and workflows exist, their capabilities are often hampered by the inconsistent use of analysis tools. PATRIC, the NIAIDfunded comprehensive bacterial bioinformatics resource, has integrated more than 30,000 consistently annotated prokaryote genomes with a focus on human pathogenic species. Here we present PATRIC’s new computational services that support the assembly, annotation and metabolic modeling of user-supplied genomes in the same consistent fashion. These services, integrated with PATRIC’s collections of specialty genes such as antibiotic resistance determinants and virulence factors, will enable users to rapidly process newly sequenced pathogens and investigate key pathogenic determinants in foodborne outbreaks using the powerful visualization and comparative analysis tools in PATRIC. We have implemented the new services with three principles in mind. (1) Controlled vocabulary. At the heart of PATRIC’s annotation service is a controlled vocabulary for functional annotation derived from the curated subsystems and protein families in the RAST and SEED systems [2]. Similarly, the new model reconstruction service relies on our curated biochemistry data [5]. These curation efforts ensure newly sequenced genomes can be automatically annotated and modeled and readily compared with existing reference data. (2) Modular design. In genome assembly as well as other bioinformatic analyses, there is often no single tool best suited for all occasions [4]. We have added support for more than 30 tools for error correction, contig assembly, scaffolding, contig evaluation, consensus building, gene calling, overlap removal, as well as many custom algorithms [3]. These modules are condensed into a few curated workflows to ensure convenient and efficient execution as well as consistent quality control. (3) Integrated analysis. The new workspace allows users to upload their own data for analysis, and upon completion the private results are immediately integrated into PATRIC. This enables users to take advantage of PATRIC’s data (drug targets, omics, AMR and other clinical metadata) and comparative tools (protein family sorter, phylogeny, heat maps, etc). In addition to these services, we are actively building support for batch analysis and SNP-level comparative analysis for closely related genomes. URL: https://www.patricbrc. org. References: [1] Gillespie, et al. “PATRIC: ... (2011). [2] Overbeek, et al. “The SEED ... (RAST).” Nucleic acids research (2014): D206-D214. [3] Brettin, et al. “RASTtk ...” Scientific reports 5 (2015). [4] Earl, et al. “Assemblathon ...” Genome research (2011). [5] Henry, et al. “High-throughput ... models.” Nature biotechnology (2010). 22 ASM Conferences
Oral Presentation <strong>Abstracts</strong> n S4:6 CFSAN SNP PIPELINE: A WHOLE GENOME SEQUENCE DATA ANALYSIS PIPELINE FOR FOOD-BORNE PATHOGENS Y. Luo, J. Pettengill, J. Baugher, H. Rand, S. Davis; FDA/CFSAN, College Park, MD. In support of the analysis of whole genome sequence data (WGS) for closely related pathogens in food-borne outbreaks, the Center for Food Safety and Applied Nutrition (CFSAN) at the FDA has developed a reference-based software pipeline for high quality SNP identification and analysis. This software pipeline combines into a single package the mapping of WGS reads to a reference genome, processing of those mapping files, identification of variant sites, and production of a SNP matrix. Additional features include a summary table of the results, soft-links to minimize data storage, and the ability to switch between workstations and computer clusters with minimal effort. The CF- SAN SNP Pipeline is currently used in production mode to analyze WGS data from isolates related to food-borne illnesses. The pipeline is used when outbreak investigations are ongoing to link samples and to provide information for decision-makers. It is also used retrospectively to aid in the analysis of closed outbreaks. The CFSAN SNP Pipeline is reference-based, and so a reference must be provided. Isolate sequence data must be in fastq format but can either be paired-end or single-read data. All analysis steps are run automatically, and only depend on the proper organization of the input files and identification of a suitable reference. Additionally, each of the analysis steps can be run using individual shell scripts. The addition of new samples is very straightforward, and result files from previous portions of the analysis that do not need to be regenerated are reused. This greatly reduces the computational time when adding new samples as the mapping and pileup steps are not redone. The pipeline will run without problems on current workstations, and will run on high performance computing clusters with either Torque or Grid Engine job schedulers. The CFSAN SNP Pipeline is written in a combination of Bash and Python. The code is designed to run on Linux platforms with bash and python. BioPython must be installed in tandem with three executable software dependencies, Bowtie2, SAMtools, and VarScan. Substantial effort has been devoted to making the software robust, well-documented, and easy to use. The following links provide for access to the source code, the documentation, and the Python package. Also provided is the current publication reference. Source code: https://github.com/CFSAN-Biostatistics/snppipeline. Documentation: http://snp-pipeline. rtfd.org. PyPI package: https://pypi.python. org/pypi/snp-pipeline. Reference publication: Pettengill JB, Luo Y, Davis S, Chen Y, Gonzalez-Escalona N, Ottesen A, Rand H, Allard MW, Strain E An evaluation of alternative methods for constructing phylogenies from whole genome sequence data: A case study with Salmonella. n S4:7 ASSEMBLING WHOLE GENOMES FROM MIXED MICROBIAL COMMUNITIES USING HI-C I. Liachko 1 , J. N. Burton 1 , L. Sycuro 2 , A. H. Wiser 2 , D. N. Fredricks 2 , M. J. Dunham 1 , J. Shendure 1 ; 1 University of Washington, Seattle, WA, 2 Fred Hutchinson Cancer Research Center, Seattle, WA. Assembly of whole genomes from next-generation sequencing is inhibited by the lack of contiguity information in short-read sequencing. This limitation also impedes metagenome assembly, since one cannot tell which sequences originate from the same species within a population. We have overcome these bottlenecks by adapting a chromosome conformation capture technique (Hi-C) for the deconvolution of metagenomes and the scaffolding of de novo assemblies of individual genomes. In modeling the 3D structure of a genome, chromosome ASM Conference on Rapid Next-Generation Sequencing and Bioinformatic Pipelines for Enhanced Molecular Epidemiologic Investigation of Pathogens 23
Page 1 and 2: Final Program and Abstracts ASM Con
Page 3 and 4: Table of Contents ASM Conferences I
Page 5 and 6: Program Committee Marc Allard U.S.
Page 7 and 8: Travel Grants ASM STUDENT TRAVEL GR
Page 9 and 10: Scientific Program Friday, Septembe
Page 11 and 12: 2:30 - 2:45 pm Integrating Core Gen
Page 13 and 14: Scientific Program 10:45 - 11:15 am
Page 15 and 16: Scientific Program 4:00 - 4:30 pm M
Page 17 and 18: Oral Presentation Abstracts n S1:3
Page 19 and 20: Oral Presentation Abstracts world r
Page 21 and 22: Oral Presentation Abstracts governm
Page 23: Oral Presentation Abstracts laborat
Page 27 and 28: Oral Presentation Abstracts all of
Page 29 and 30: Oral Presentation Abstracts on orga
Page 31 and 32: Oral Presentation Abstracts automat
Page 33 and 34: Oral Presentation Abstracts n S7:10
Page 35 and 36: Oral Presentation Abstracts or para
Page 37 and 38: Oral Presentation Abstracts 4 diffe
Page 39 and 40: Oral Presentation Abstracts diagnos
Page 41 and 42: Poster Abstracts n 1 A BIOSURVEILLA
Page 43 and 44: Poster Abstracts comparative analys
Page 45 and 46: Poster Abstracts Maryland School of
Page 47 and 48: Poster Abstracts the results are pr
Page 49 and 50: Poster Abstracts the inferred trans
Page 51 and 52: Poster Abstracts tories. Due to its
Page 53 and 54: Poster Abstracts route reconstructi
Page 55 and 56: Poster Abstracts comparable to a re
Page 57 and 58: Poster Abstracts n 24 MOLECULAR CHA
Page 59 and 60: Poster Abstracts and fosfomycin-res
Page 61 and 62: Poster Abstracts loads commonly see
Page 63 and 64: Poster Abstracts n 32 KLEBSIELLA PN
Page 65 and 66: Poster Abstracts that resolved clin
Page 67 and 68: Poster Abstracts to cluster strains
Page 69 and 70: Poster Abstracts evolution. Determi
Page 71 and 72: Poster Abstracts n 43 CORRELATING P
Page 73 and 74: Poster Abstracts bacteria. However,
Page 75 and 76:
Poster Abstracts averaged 11.3% (28
Page 77 and 78:
Poster Abstracts diversity. The div
Page 79 and 80:
Poster Abstracts cury. The numbers
Page 81 and 82:
Poster Abstracts n 58 SHORT TANDEM
Page 83 and 84:
Poster Abstracts lent, although thi
Page 85 and 86:
Poster Abstracts tive comparator fo
Page 87 and 88:
Poster Abstracts n 68 COMPARATIVE P
Page 89 and 90:
Poster Abstracts isolates with vary
Page 91 and 92:
Poster Abstracts to Salmonella to e
Page 93 and 94:
Poster Abstracts gonococcal infecti
Page 95 and 96:
Poster Abstracts sess the species d
Page 97 and 98:
Poster Abstracts is a value that ca
Page 99 and 100:
Poster Abstracts when compared with
Page 101 and 102:
Poster Abstracts pulsed-field gel e
Page 103 and 104:
Poster Abstracts able area for inte
Page 105 and 106:
Poster Abstracts n 93 NEXT-GENERATI
Page 107 and 108:
Poster Abstracts species. The compo
Page 109 and 110:
Poster Abstracts pathogens. We’ll
Page 111 and 112:
Poster Abstracts ing information as
Page 113 and 114:
Poster Abstracts tive and recommend
Page 115 and 116:
Deng, X. S4:4 de Pinna, E. 83, S2:5
Page 117 and 118:
Musser, K. A. 74 Musser, K. A. 79 M
Page 119:
American Society for Microbiology 1
show all

Abstracts

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?