Sequencing
SFAF2016%20Meeting%20Guide%20Final%203
SFAF2016%20Meeting%20Guide%20Final%203
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
11th Annual <strong>Sequencing</strong>, Finishing, and Analysis in the Future Meeting<br />
ANALYSIS OF THE FUNCTIONAL CONTENTS OF<br />
MICROBIAL COMMUNITIES USING A NOVEL QIAGEN<br />
BIOINFORMATICS PIPELINE<br />
Thursday, 2nd June 16:50 La Fonda Ballroom Tech Talk (TT‐2.05)<br />
Andreas Sand Pedersen, Francesco Strino, Aske Simon Christensen, Martin Bundgaard<br />
Qiagen Bioinformatics<br />
The field of microbial ecology is currently being revolutionized through next generation sequencing<br />
studies, revealing unprecedented detailed insight into environmental as well as host‐associated<br />
microbiomes. But still, very few tools offer efficient and user‐friendly analysis of the functional genomic<br />
contents of microbiomes.<br />
We have implemented a unique pipeline of tools for functional and comparative analysis of microbiomes<br />
as a component in QIAGEN Bioinformatics’ comprehensive solution for microbial genomics<br />
‐ CLC Microbial Genomics Module. First step in this pipeline produces high‐quality metagenome<br />
contigs using a novel fast and highly memory efficient metagenome de novo assembler. The resulting<br />
contigs are subsequently CDS‐annotated using the MetaGeneMark plugin for CLC Genomics Workbench,<br />
and CDSs are annotated with PFAM protein families, GO terms and/or top BLAST hits<br />
(using e.g. UniProt). Finally, the relative abundance of each identified functional genomic element<br />
is computed and presented to the user in a user‐friendly manner.<br />
The functional contents of individual samples can be studied in a tabular format or visualized as<br />
pie charts, while visual comparison of multiple samples is done using stacked bar charts, layered<br />
area charts and principal coordinate plots. Finally, statistical comparison of multiple microbiome<br />
samples can be done to back‐up our observations. All visualisations and statistical analysis can be<br />
performed in the context of user‐defined meta‐information such as patient/control data.<br />
We will present benchmarks of this pipeline, performed on both previously published mock community<br />
datasets and several recently published simulated microbiome datasets, designed for benchmarking<br />
of microbiome analysis solutions.<br />
We demonstrate that QIAGEN Bioinformatics’ new metagenome de novo assembler runs as fast as the<br />
fastest alternative (MegaHit) but uses only a fraction of the memory required by any other<br />
benchmarked de novo assembler. At the same time it produces contigs that are both longer than<br />
the contigs produced by other popular metagenome assemblers (e.g. MegaHit and IDBA_UD) and<br />
are of significantly higher quality than the contigs produced by any other benchmarked de novo<br />
assembler.<br />
We furthermore demonstrate that the high‐quality contigs produced by our metagenome de novo<br />
assembler enables taxonomic and functional profiling of very high quality.<br />
And finally, we show that the complete analysis pipeline producing functional profiles from raw<br />
microbiome read datasets in five easy steps is able to reliably detect changes in the functional<br />
content of microbiomes with statistical significance.<br />
122