Data integration in microbial genomics ... - Jacobs University
Data integration in microbial genomics ... - Jacobs University
Data integration in microbial genomics ... - Jacobs University
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
CHAPTER 3<br />
CDINFUSION<br />
Submission-ready, on-l<strong>in</strong>e Integration of sequence<br />
and contextual data<br />
Authors: Wolfgang Hankeln, Norma Johanna Wendel, Jan Gerken,<br />
Jost Waldmann, Pier Luigi Buttigieg, Ivaylo Kostad<strong>in</strong>ov, Renzo<br />
Kottmann, Pel<strong>in</strong> Yilmaz, Frank Oliver Glöckner<br />
Submitted to: PLoS ONE, April 2011<br />
Personal Contribution: Developed and implemented CD<strong>in</strong>Fusion<br />
together with Norma Johanna Wendel, Jan Gerken and Jost Waldmann.<br />
Wrote the <strong>in</strong>itial manuscript.<br />
Relevance: To provide the life science community with a tool to<br />
enrich sequence data with contextual data prior to submission to the<br />
INSDC.<br />
3.1 Abstract<br />
State of the art (DNA) sequenc<strong>in</strong>g methods applied <strong>in</strong> “Omics” studies<br />
grant <strong>in</strong>sight <strong>in</strong>to the ’bluepr<strong>in</strong>ts’ of organisms from all doma<strong>in</strong>s of<br />
life. Sequenc<strong>in</strong>g is carried out around the globe and the data is submitted<br />
to the public repositories of the International Nucleotide Sequence<br />
<strong>Data</strong>base Collaboration. However, the context <strong>in</strong> which these<br />
studies are conducted often gets lost, because experimental data, as<br />
well as <strong>in</strong>formation about the environment are rarely submitted along<br />
with the sequence data. If these contextual or metadata are miss<strong>in</strong>g,<br />
key opportunities of comparison and analysis across studies and habitats<br />
are hampered or even impossible. To address this problem, the