Abstracts

More documents

Recommendations

Info

Poster <strong>Abstracts</strong> ily integrate rich epidemiologic data, providing a barrier to wider use by public health workers. Canada’s Integrated Rapid Infectious Disease Analysis (IRIDA) web-based bioinformatics platform provides a model approach that aims to address these issues and complement other international bioinformatics pipelines for genomic epidemiology. The IRIDA development team comprises five interconnected working groups: 1) Ontology and Database; 2) Microbial Typing; 3) Architecture and API; 4) Tools Development; 5) User Experience. Teams are embedded in Canadian national and provincial public health agencies, and in academia, to engage end users and stakeholders during design and implementation phases of the project. IRIDA implements secure storage of WGS data, epidemiological and application metadata, data analysis pipelines, visualization of results, and a federated data sharing model intended to facilitate secure communication within and between provincial and federal public health institutions in Canada. Metadata is encoded in an application ontology following community recognized standards and extending existing OBO domain ontologies (http:// www.obofoundry.org/) to promote interoperability. Data analysis pipelines and execution provenance are transparently implemented using Galaxy, and federated data sharing and analysis is realized with a common REST API across platform instances. Data analysis tools being further developed include SNV- Phyl for phylogeographic analysis, in silico microbial typing capability, and IslandViewer/ visualization tools for antimicrobial resistance, virulence factor, and genomic island analysis. Linkage with other international genomic epidemiology initiatives, involving public genomic data release with more limited metadata, is also envisaged. A publicly available academic version of IRIDA that does not provide access to potentially sensitive epidemiologic metadata will provide IRIDA’s analysis tools for wider research use. An initial IRIDA version is being tested in the public health environment using current outbreak data, enabling further refinement of ontology and tool development. IRIDA is free, open-source software that may be a useful platform for other countries with autonomous health regions that wish to empower their public health workers’ genomic analysis capabilities. See http://www.irida.ca for more information. n 9 PIPELINE FOR THE AUTOMATIC IDENTIFICATION OF PATHOGENS A. Andrusch, P. Dabrowski, A. Nitsche; Robert Koch Institute, Berlin, GERMANY. Special diagnostics of infectious diseases nowadays rely on the gold standard of nucleic acid detection that is the polymerase chain reaction (PCR). The specific detection by PCR has its limit in that every pathogen has to be tested for with an own assay. This limitation can be overcome by adopting the molecular open view capacities that next-generation sequencing (NGS) can provide. NGS-based methods allow for the representative sequencing of all nucleic acids contained in clinical samples, enabling the downstream analysis of all generated reads for various known pathogens at once. This comes at the price of necessary filtering steps for the removal of background reads originating from the patient. Beyond that, NGS cannot only extend the diagnostic possibilities provided by PCR, but also serve as a stepping stone in the detection of hitherto unknown and novel pathogens. The ‘Pipeline for the Automatic Identification of Pathogens’ (PAIPline) presented here, is a new complete workflow for the pathogen search in NGS datasets. It includes several steps for the preprocessing and quality control of the raw data to ensure that only information-rich reads are evaluated. It furthermore includes steps for the assignment of reads to their respective taxons based on reliable, established referencebased algorithms. Filtering of background reads, contaminants and organisms of low interest as well as the evaluation of ambiguous read information is automatically done before 44 ASM Conferences
Poster <strong>Abstracts</strong> the results are presented. Analysis results are shown in a highly accessible manner, allowing the user to gain a quick overview as well as permitting deep analysis. The performance of the PAIPline was benchmarked on real and artificial datasets of known compositions and compared to competing tools. The results and discussed features show that the presented approach is a viable strategy for the identification of pathogen sequences in NGS datasets. n 10 SEPARATION OF FOREGROUND AND BACKGROUND READS IN MIXED NGS DATASETS S. Tausch, A. Nitsche, B. Renard, P. Dabrowski; Robert Koch Institute, Berlin, GERMANY. NGS is a valuable technology for rapid and indepth analysis of clinical samples, as it allows sequencing of a pathogen’s whole genome directly from patient material within as little as 26 hours. However, the follow-up analysis is severely slowed down by the abundance of reads originating from the host. Thus, in order to exploit the full potential of the technology for rapid diagnostics, a method for rapid in silico removal of host reads is necessary. Commonly, a mapping-based approach is used to separate reads: either reads mapping to a background reference or reads not mapping to a foreground reference are discarded. However, while the former approach is highly specific in discarding only true background reads and the latter is highly sensitive in only keeping foreground reads, neither offers a good balance. Hence we have aimed at developing a novel tool specifically geared towards both specific and sensitive separation of foreground and background reads. In order to determine whether a read belongs to the foreground or the background, we train markov chains of an order k from 4 to 12 on user-provided sets of foreground and background reference sequences, where each state is a k-mer of length k and each transition is one of the four possible bases A, C, G and T. We then calculate the difference of log likelihoods of each transition observed within a read with regards to the foreground and the background markov chains. This difference is then used as a score for the separation of reads, with scores smaller than 0 indicating a background read and scores larger than 0 indicating a foreground read. We have tested our tool on several datasets, including Cowpoxvirus sequenced from a human host. In all cases, our tool was faster than any competing tool (achieving speeds of up to 10 Megabases/second using 4 CPUs), including Kraken and mapping via bowtie2. At the same time, we consistently achieved the best F-Score of all tested tools. Our tool is developed in python and java and available for download from http://sourceforge.net/projects/ rambok/ We have developed a freely available, easy to use, rapid and both highly sensitive and specific tool for the separation of foreground and background reads in mixed NGS datasets. We believe that this will be highly useful as an initial filtering step for anyone analyzing viral sequences via NGS. n 11 A RAPID AND SCALABLE SINGLE NUCLEOTIDE POLYMORPHISM DISCOVERY AND VALIDATION PIPELINE FOR OUTBREAK INVESTIGATION OF BACTERIAL PATHOGENS B. Rusconi 1 , A. L. Rodriguez 2 , S. S. Koenig 1 , M. Eppinger 1 ; 1 University of Texas at San Antonio - South Texas Center For Emerging Infectious Diseases (STCEID), San Antonio, TX, 2 University of Texas at San Antonio -Computational Biology Initiative, San Antonio, TX. Background: Assuring a timely and effective response in the control of bacterial outbreaks is challenging, as discriminatory power becomes of particular importance to distinguish outbreak isolates that form tight clonal complexes with only few genetic polymorphisms. The increase of throughput and concomitant ASM Conference on Rapid Next-Generation Sequencing and Bioinformatic Pipelines for Enhanced Molecular Epidemiologic Investigation of Pathogens 45
Page 1 and 2: Final Program and Abstracts ASM Con
Page 3 and 4: Table of Contents ASM Conferences I
Page 5 and 6: Program Committee Marc Allard U.S.
Page 7 and 8: Travel Grants ASM STUDENT TRAVEL GR
Page 9 and 10: Scientific Program Friday, Septembe
Page 11 and 12: 2:30 - 2:45 pm Integrating Core Gen
Page 13 and 14: Scientific Program 10:45 - 11:15 am
Page 15 and 16: Scientific Program 4:00 - 4:30 pm M
Page 17 and 18: Oral Presentation Abstracts n S1:3
Page 19 and 20: Oral Presentation Abstracts world r
Page 21 and 22: Oral Presentation Abstracts governm
Page 23 and 24: Oral Presentation Abstracts laborat
Page 27 and 28: Oral Presentation Abstracts all of
Page 29 and 30: Oral Presentation Abstracts on orga
Page 31 and 32: Oral Presentation Abstracts automat
Page 35 and 36: Oral Presentation Abstracts or para
Page 37 and 38: Oral Presentation Abstracts 4 diffe
Page 39 and 40: Oral Presentation Abstracts diagnos
Page 41 and 42: Poster Abstracts n 1 A BIOSURVEILLA
Page 43 and 44: Poster Abstracts comparative analys
Page 45: Poster Abstracts Maryland School of
Page 49 and 50: Poster Abstracts the inferred trans
Page 51 and 52: Poster Abstracts tories. Due to its
Page 53 and 54: Poster Abstracts route reconstructi
Page 55 and 56: Poster Abstracts comparable to a re
Page 57 and 58: Poster Abstracts n 24 MOLECULAR CHA
Page 59 and 60: Poster Abstracts and fosfomycin-res
Page 61 and 62: Poster Abstracts loads commonly see
Page 63 and 64: Poster Abstracts n 32 KLEBSIELLA PN
Page 65 and 66: Poster Abstracts that resolved clin
Page 67 and 68: Poster Abstracts to cluster strains
Page 69 and 70: Poster Abstracts evolution. Determi
Page 71 and 72: Poster Abstracts n 43 CORRELATING P
Page 73 and 74: Poster Abstracts bacteria. However,
Page 75 and 76: Poster Abstracts averaged 11.3% (28
Page 77 and 78: Poster Abstracts diversity. The div
Page 79 and 80: Poster Abstracts cury. The numbers
Page 81 and 82: Poster Abstracts n 58 SHORT TANDEM
Page 83 and 84: Poster Abstracts lent, although thi
Page 85 and 86: Poster Abstracts tive comparator fo
Page 87 and 88: Poster Abstracts n 68 COMPARATIVE P
Page 89 and 90: Poster Abstracts isolates with vary
Page 91 and 92: Poster Abstracts to Salmonella to e
Page 93 and 94: Poster Abstracts gonococcal infecti
Page 95 and 96: Poster Abstracts sess the species d
Page 97 and 98:
Poster Abstracts is a value that ca
Page 99 and 100:
Poster Abstracts when compared with
Page 101 and 102:
Poster Abstracts pulsed-field gel e
Page 103 and 104:
Poster Abstracts able area for inte
Page 105 and 106:
Poster Abstracts n 93 NEXT-GENERATI
Page 107 and 108:
Poster Abstracts species. The compo
Page 109 and 110:
Poster Abstracts pathogens. We’ll
Page 111 and 112:
Poster Abstracts ing information as
Page 113 and 114:
Poster Abstracts tive and recommend
Page 115 and 116:
Deng, X. S4:4 de Pinna, E. 83, S2:5
Page 117 and 118:
Musser, K. A. 74 Musser, K. A. 79 M
Page 119:
American Society for Microbiology 1
show all

Abstracts

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?