Data integration in microbial genomics ... - Jacobs University
Data integration in microbial genomics ... - Jacobs University
Data integration in microbial genomics ... - Jacobs University
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
52 3. CD<strong>in</strong>Fusion<br />
and upload it with the Tomcat manager. Further <strong>in</strong>stallation details<br />
can be found <strong>in</strong> the README.txt file that is <strong>in</strong>cluded <strong>in</strong> the source<br />
bundle and that is also available <strong>in</strong> the documentation section of the<br />
CD<strong>in</strong>Fusion web page. On some platforms the CATALINA HOME<br />
environment variable needs to be set, <strong>in</strong> order for CD<strong>in</strong>Fusion to write<br />
and read files. Relative to the path specified, CD<strong>in</strong>Fusion will create<br />
a “data” folder, where temporary files will be saved. The application<br />
has been tested on Debian GNU L<strong>in</strong>ux <strong>in</strong>stallations, but should be<br />
platform-<strong>in</strong>dependent and run on all platforms that support Java and<br />
Apache Tomcat <strong>in</strong>stallation such as W<strong>in</strong>dows TM or MAC OS TM .<br />
Availability and Future Directions<br />
The public <strong>in</strong>stallation of CD<strong>in</strong>Fusion is hosted and ma<strong>in</strong>ta<strong>in</strong>ed at the<br />
Microbial Genomics and Bio<strong>in</strong>formatics Group (MGG) of the Max<br />
Planck Institute of Mar<strong>in</strong>e Microbiology Bremen and accessible under:<br />
http://www.megx.net/cd<strong>in</strong>fusion. The source code is available<br />
under GNU LGPL 3 and deposited <strong>in</strong> a public repository: http:<br />
//code.google.com/p/cd<strong>in</strong>fusion.<br />
As open source software it is the <strong>in</strong>tention of the MGG to support this<br />
software well <strong>in</strong>to the future. Currently CD<strong>in</strong>Fusion supports submission<br />
of CD enriched sequence data to the INSDC us<strong>in</strong>g Sequ<strong>in</strong> and<br />
tbl2asn for large data sets. Support for <strong>in</strong>stallations outside the MPI<br />
cannot be granted. The direct submission to EMBL/ENA and DDBJ<br />
is planned. Furthermore the <strong><strong>in</strong>tegration</strong> of GCDML [Kottmann et al.,<br />
2008] as an exchange format would be advantageous. The GSC and<br />
life science community is encouraged to download the source code and<br />
to modify and extend the software to make it even more useful.<br />
Acknowledgments Thanks to S. Emil Ruff and the Geotechnologien<br />
project COMET/MUMM II (03G0608A, BMBF) for provid<strong>in</strong>g<br />
unpublished data and for the beta-test<strong>in</strong>g. Thanks to Michael Richter<br />
for genome data and beta-test<strong>in</strong>g. Thanks to Rudolf Amann, Bernhard<br />
Fuchs and Hanno Teel<strong>in</strong>g for the MIMAS data. Thanks to Jens Harder,<br />
Richard Hahnke and the M.Sc. Mar<strong>in</strong>e Microbiology (MarMic) class<br />
of 2014 for data and beta-test<strong>in</strong>g.