18.09.2015 Views

Abstracts

ngsfinalprogram

ngsfinalprogram

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Poster <strong>Abstracts</strong><br />

when compared with hqSNPs inferred on<br />

uncorrected reads. In a cluster of Salmonella<br />

Paratyphi B var. (L+) tartrate+, the Bayes-<br />

Hammer-cleaned reads and FastX-trimmed<br />

reads yielded 314 and 311 informative hqSNP<br />

positions, respectively. In the same set, the<br />

uncorrected reads yielded 292 informative<br />

hqSNP positions, unexpectedly outperforming<br />

the Quake-cleaned reads with 241 hqSNP<br />

positions. In the fifth data set, a cluster of E.<br />

coli O157:H7, FastX-trimmed reads yielded<br />

66 hqSNP positions and Quake-cleaned reads<br />

yielded 65 hqSNP positions. BayesHammercleaned<br />

reads and uncorrected reads both performed<br />

worse for the O157:H7 set, with 61 and<br />

59 hqSNP postions, respectively. In the Enteritidis<br />

and Baildon sets, BayesHammer-corrected<br />

reads yielded hqSNP phylogenies with<br />

median bootstrap support values, 61% and<br />

24%, respectively, across all internal branches<br />

of each tree. By comparison, FastX-trimmed or<br />

untrimmed reads had a median bootstrap support<br />

of at least 49% for the Baildon cluster and<br />

0% bootstrap support for the Enteritidis cluster.<br />

For the Paratyphi B hqSNP phylogeny, median<br />

bootstrap values were 100% across all four<br />

read-trimming methods, but ranged from 1%<br />

support in the untrimmed tree up to 20% support<br />

in the FastX tree. Quake-corrected reads<br />

yielded hqSNP trees with the highest median<br />

bootstrap values for the Salmonella Newport<br />

cluster (median support = 51%) and the E. coli<br />

O157 cluster (median support =27%). These<br />

results indicate that BayesHammer enables<br />

discovery of the largest numbers of hqSNPs<br />

and while BayesHammer and Quake both perform<br />

well for inferring hqSNPs phylogenetic<br />

trees. Yet, as Quake may decrease the counts<br />

of phylogenetically-informative SNP positions,<br />

BayesHammer and FastX are likely the best<br />

first-pass cleaning/correction tools for hqSNPs<br />

pipelines.<br />

n 85<br />

EVALUATION OF WHOLE GENOME<br />

SEQUENCING TO CONFIRM OR REFUTE<br />

CLONALITY OF CONVENTIONAL GENOTYPE-<br />

DEFINED CLUSTERS OF MYCOBACTERIUM<br />

TUBERCULOSIS<br />

L. Cowan, J. Posey;<br />

Centers for Disease Control and Prevention,<br />

Atlanta, GA.<br />

Since 2004, the Division of Tuberculosis Elimination<br />

has conducted genotyping surveillance<br />

of Mycobacterium tuberculosis. Genotyping<br />

data is integrated with patient demographic<br />

and clinical data and routinely analyzed to<br />

identify suspected outbreaks. However, the<br />

discriminatory power of the current genotyping<br />

methods are sometimes insufficient, and some<br />

suspected outbreaks identified by genotyping<br />

include a mix of outbreak and sporadic cases<br />

or do not represent an outbreak. Genomic surveillance<br />

has the power to increase the accuracy<br />

of outbreak detection systems by analyzing<br />

the entire genome versus less than 1% using<br />

conventional methods. We conducted whole<br />

genome sequencing (WGS) for 20 (number of<br />

isolates per cluster, 10 - 100) suspected large<br />

outbreaks identified by routine genotyping.<br />

Reference-guided assemblies of Illumina sequence<br />

read sets were created using LaserGene<br />

SeqMan NGen (DNAStar). Polymorphisms<br />

between the clustered isolates were identified<br />

by comparing assembled genomes including<br />

coverage statistics read depth and distribution<br />

of base calls for reliable differences. The final<br />

set of polymorphisms were confirmed in each<br />

assembly by visually checking each position in<br />

the mapped reads of the assembly. The number<br />

of single nucleotide polymorphisms (SNPs)<br />

identified for each cluster ranged from 8 to<br />

101. WGS exhibited a higher level of resolution<br />

for each cluster as compared to conventional<br />

genotyping methods and identified cases<br />

that were not involved in recent transmission<br />

among the samples analyzed. The preliminary<br />

results indicate that WGS data could result in<br />

more focused targeting of limited public health<br />

ASM Conference on Rapid Next-Generation Sequencing and Bioinformatic<br />

Pipelines for Enhanced Molecular Epidemiologic Investigation of Pathogens<br />

97

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!