Computational tools and Interoperability in Comparative ... - CBS
Computational tools and Interoperability in Comparative ... - CBS
Computational tools and Interoperability in Comparative ... - CBS
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Chapter 5<br />
Conclusion <strong>and</strong> perspectives<br />
Conclusion <strong>and</strong> perspectives<br />
This thesis has presented a number comparative genomics <strong>tools</strong> that have been used<br />
throughout different research projects <strong>and</strong> peer review publications. The aim has been to<br />
provide methods that enable the scientist to keep up with the <strong>in</strong>creas<strong>in</strong>g speed by which<br />
genome sequences are published. Visualization plays a key role <strong>and</strong> f<strong>in</strong>d<strong>in</strong>g better ways<br />
to present sequence <strong>in</strong>formation <strong>in</strong> a condensed <strong>and</strong> <strong>in</strong>tuitive way is essential for deriv<strong>in</strong>g<br />
knowledge from the large number of bacterial stra<strong>in</strong>s be<strong>in</strong>g sequenced.<br />
Information content has previously been used to quantify conservation of DNA motifs,<br />
<strong>and</strong> a recent extension of this <strong>in</strong>formation framework has allowed to model complete<br />
promotors such as the P1/P2 system described <strong>in</strong> this work. The models shown here<br />
are to a large extent specific towards E. coli P1/P2 sites. However, the design of the<br />
matrix <strong>and</strong> spac<strong>in</strong>g configuration format of the iscan tool enables for a much broader<br />
application. The tool may be used to test different hypothesis of promotor configurations<br />
across a broader range of organisms by estimat<strong>in</strong>g the promotor conservation a s<strong>in</strong>gle<br />
comparable measure. There is still efforts to be made to implement benchmark<strong>in</strong>g <strong>and</strong> to<br />
exam<strong>in</strong>e other promotor systems.<br />
S<strong>in</strong>ce the start of the human genome project (HGP) <strong>in</strong> 1990 there has been large<br />
<strong>in</strong>vestments to develop <strong>and</strong> improve sequenc<strong>in</strong>g technology. The present stage, where a<br />
bacterial genome can be sequenced for a few thous<strong>and</strong> dollars with<strong>in</strong> few hours, is a result<br />
of years of competition <strong>and</strong> <strong>in</strong>vestments <strong>in</strong> genome projects. There are no signs that new<br />
achievements <strong>in</strong> sequenc<strong>in</strong>g technology stops here. The concept of sequenc<strong>in</strong>g s<strong>in</strong>gle DNA<br />
molecules real time has long been an ultimate goal with<strong>in</strong> genomics <strong>and</strong> DNA sequenc<strong>in</strong>g.<br />
It has been demonstrated how a DNA synthesis reaction can be monitored real-time, by<br />
immobiliz<strong>in</strong>g a DNA polymerase with<strong>in</strong> a small (20 zeptoliter) well (Eid et al., 2009). If the<br />
technology reaches a f<strong>in</strong>al product, it may well start a new era <strong>in</strong> comparative genomics.<br />
Once it is possible to obta<strong>in</strong> a genome sequence at the same rate as the DNA replication<br />
itself, <strong>and</strong> at superior read lengths, sophisticated software must be implemented for the<br />
downstream process<strong>in</strong>g. The technology can give a boost to the quality of metagenomic<br />
sequenc<strong>in</strong>g, <strong>and</strong> solve the current issues of proper assembly of these data sets.<br />
The BLASTatlas tool presented <strong>in</strong> this thesis <strong>in</strong>corporates a number of software to<br />
calculate different DNA properties as well as scripts for mapp<strong>in</strong>g sequence alignments to a<br />
reference genome. The number of dependencies makes it difficult to package the software<br />
<strong>and</strong> make <strong>in</strong>stallation on other computer systems. To share these more complex <strong>tools</strong><br />
among scientists Web Services plays an important role <strong>and</strong> it has been demonstrated how<br />
analysis <strong>and</strong> visualization methods can be offered us<strong>in</strong>g this technology. At first glance the<br />
traditional web <strong>in</strong>terfaces seems more user-friendly. However, implement<strong>in</strong>g <strong>in</strong>teroperable<br />
methods like that of the BLASTatlas method, forces a process <strong>in</strong> which the communication<br />
is formalized <strong>and</strong> def<strong>in</strong>ed <strong>in</strong> every detail. This allows direct <strong>in</strong>tegration <strong>in</strong>to the user’s pro-<br />
155