01.06.2016 Views

Sequencing

SFAF2016%20Meeting%20Guide%20Final%203

SFAF2016%20Meeting%20Guide%20Final%203

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

11th Annual <strong>Sequencing</strong>, Finishing, and Analysis in the Future Meeting<br />

ANALYSIS OF THE FUNCTIONAL CONTENTS OF<br />

MICROBIAL COMMUNITIES USING A NOVEL QIAGEN<br />

BIOINFORMATICS PIPELINE<br />

Thursday, 2nd June 16:50 La Fonda Ballroom Tech Talk (TT‐2.05)<br />

Andreas Sand Pedersen, Francesco Strino, Aske Simon Christensen, Martin Bundgaard<br />

Qiagen Bioinformatics<br />

The field of microbial ecology is currently being revolutionized through next generation sequencing<br />

studies, revealing unprecedented detailed insight into environmental as well as host‐associated<br />

microbiomes. But still, very few tools offer efficient and user‐friendly analysis of the functional genomic<br />

contents of microbiomes.<br />

We have implemented a unique pipeline of tools for functional and comparative analysis of microbiomes<br />

as a component in QIAGEN Bioinformatics’ comprehensive solution for microbial genomics<br />

‐ CLC Microbial Genomics Module. First step in this pipeline produces high‐quality metagenome<br />

contigs using a novel fast and highly memory efficient metagenome de novo assembler. The resulting<br />

contigs are subsequently CDS‐annotated using the MetaGeneMark plugin for CLC Genomics Workbench,<br />

and CDSs are annotated with PFAM protein families, GO terms and/or top BLAST hits<br />

(using e.g. UniProt). Finally, the relative abundance of each identified functional genomic element<br />

is computed and presented to the user in a user‐friendly manner.<br />

The functional contents of individual samples can be studied in a tabular format or visualized as<br />

pie charts, while visual comparison of multiple samples is done using stacked bar charts, layered<br />

area charts and principal coordinate plots. Finally, statistical comparison of multiple microbiome<br />

samples can be done to back‐up our observations. All visualisations and statistical analysis can be<br />

performed in the context of user‐defined meta‐information such as patient/control data.<br />

We will present benchmarks of this pipeline, performed on both previously published mock community<br />

datasets and several recently published simulated microbiome datasets, designed for benchmarking<br />

of microbiome analysis solutions.<br />

We demonstrate that QIAGEN Bioinformatics’ new metagenome de novo assembler runs as fast as the<br />

fastest alternative (MegaHit) but uses only a fraction of the memory required by any other<br />

benchmarked de novo assembler. At the same time it produces contigs that are both longer than<br />

the contigs produced by other popular metagenome assemblers (e.g. MegaHit and IDBA_UD) and<br />

are of significantly higher quality than the contigs produced by any other benchmarked de novo<br />

assembler.<br />

We furthermore demonstrate that the high‐quality contigs produced by our metagenome de novo<br />

assembler enables taxonomic and functional profiling of very high quality.<br />

And finally, we show that the complete analysis pipeline producing functional profiles from raw<br />

microbiome read datasets in five easy steps is able to reliably detect changes in the functional<br />

content of microbiomes with statistical significance.<br />

122

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!