Sequencing
SFAF2016%20Meeting%20Guide%20Final%203
SFAF2016%20Meeting%20Guide%20Final%203
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
11th Annual <strong>Sequencing</strong>, Finishing, and Analysis in the Future Meeting<br />
IMPORTANCE OF WGS FOR IDENTIFYING<br />
ENVIRONMENTAL SOURCES OF LEGIONELLA<br />
PNEUMOPHILA DURING OUTBREAK INVESTIGATIONS<br />
OF LEGIONNAIRE’S DISEASE<br />
Thursday, 2nd June 11:40 La Fonda Ballroom Talk (OS‐5.04)<br />
Brian Raphael, Shatavia Morrison, Jeffrey Mercante, Natalia<br />
Kozak Muiznieks, Jonas Winchell<br />
Centers for Disease Control and Prevention<br />
Legionnaire’s disease (LD) is a severe pneumonia caused by various Legionellae (most commonly L. pneumophila)<br />
which can colonize man‐made water systems and cause infections in humans when aerosolized. Confirming the<br />
environmental sources associated with an outbreak is important for controlling disease transmission. We evaluated<br />
the utility of whole genome sequencing (WGS) com‐ pared to other available subtyping methods in order to better<br />
understand genetic relationships among strains isolated during LD outbreak investigations. We also assessed<br />
various WGS analysis methods to determine their ability to cluster outbreak‐related isolates.<br />
L. pneumophila is a genetically diverse species consisting of multiple serogroups (sg) and three known subspecies;<br />
subsp. pneumophila, subsp. fraseri, and subsp. pascullei. In order to more fully understand differences in gene<br />
content among these strains, our laboratory has generated three complete genome sequences of L. pneumophila<br />
subsp. pascullei strains isolated nearly 30 years apart from the same facility and compared these to other previously<br />
determined L. pneumophila subsp. sequences. Although these isolates switched from sg5 to sg1, we identified<br />
minimal changes in gene content other than variation in the LPS biosynthesis region.<br />
L. pneumophila subtyping has relied on Sequence Based Typing (SBT) where the nucleotide se‐ quences of seven<br />
loci are utilized to generate a Sequence Type (ST). ST1 is the most common ST associated with sporadic LD cases.<br />
We demonstrated that outbreak‐specific clades can be detected among draft genomes of ST1 isolates (N=50)<br />
representing different LD outbreaks, sporadic clinical isolates, and various environmental sources using core gene<br />
SNP analysis and whole genome MLST.<br />
When sequences of isolates within a suspected outbreak cluster were examined, average nucleotide identity and<br />
distance estimation using the MinHash technique (Mash) proved to be the most rapid initial method of identifying<br />
outbreak‐associated strains. Moreover, a comparison of isolates from 10 separate LD investigations occurring in<br />
New York State revealed that WGS analysis provided superior resolution of isolates compared to pulsed‐field gel<br />
electrophoresis. Extraction of SBT loci sequences in silico was only partially successful due in large part to the<br />
presence of paralogous sequences of one allele (mompS).<br />
Ideally, the genomes of Legionella spp. present in clinical specimens or environmental sources will need to be<br />
sequenced and analyzed without the time‐consuming isolation of this organism in order to realize the full potential<br />
of WGS for rapidly informing public health decision making. As a first step toward this goal, we examined archived<br />
water samples obtained from 3 separate LD investigations. While large differences in the abundances of various<br />
bacterial taxa were observed using 16S rDNA amplicon sequencing, Legionella‐specific sequences, when present,<br />
were consistently found to be