29.07.2013 Views

Computational tools and Interoperability in Comparative ... - CBS

Computational tools and Interoperability in Comparative ... - CBS

Computational tools and Interoperability in Comparative ... - CBS

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Chapter 4<br />

Web Services <strong>and</strong> <strong>Interoperability</strong> <strong>in</strong> Genomics<br />

Web Services <strong>and</strong> <strong>Interoperability</strong><br />

<strong>in</strong> Genomics<br />

This chapter describes work done connection with the EU project EMBRACE. The deliverables<br />

def<strong>in</strong>ed for <strong>CBS</strong> have had both outreach obligations as well as implementation<br />

tasks of provid<strong>in</strong>g <strong>tools</strong> <strong>and</strong> databases through Web Services. This author’s contributions<br />

reflect this duality; there was a responsibility for develop<strong>in</strong>g the server <strong>in</strong>frastructure for<br />

host<strong>in</strong>g Web Services while also teach<strong>in</strong>g about us<strong>in</strong>g <strong>and</strong> design concepts on several occasions<br />

(see appendix A.1). <strong>CBS</strong> is now us<strong>in</strong>g this work to <strong>in</strong>tegrate all major prediction<br />

servers under the same Web Services umbrella. There are currently 17 services offered<br />

us<strong>in</strong>g this technology 1 . The work on Web Services has made the foundation for creat<strong>in</strong>g<br />

an onl<strong>in</strong>e resource like BLASTatlas (paper I). Further, the RNAmmer tool (VI) is offered<br />

both as a traditional web <strong>in</strong>terface <strong>and</strong> through Web Services <strong>and</strong> these implementations<br />

demonstrate the usefullness of programmtic access to <strong>tools</strong>.<br />

4.1 Introduction<br />

Over the past decade, the <strong>in</strong>ternet has undoubtedly revolutionized the way <strong>in</strong>formation<br />

is exchanged <strong>in</strong> the modern society. From bank transactions, digital road maps <strong>and</strong><br />

satellite images, email<strong>in</strong>g, news articles, <strong>and</strong> social networks, these services are now hard<br />

to imag<strong>in</strong>e, without a digitally connected world. Biological <strong>and</strong> bio<strong>in</strong>formatic <strong>in</strong>formation<br />

is no exception as it relies on the <strong>in</strong>ternet to provide the transport of sequence data,<br />

experimental results, scientific articles etc. Both the number <strong>and</strong> complexity of biological<br />

<strong>in</strong>formation <strong>in</strong>creases day by day. As new experimental techniques become available, new<br />

types of data as well as new ways of comb<strong>in</strong><strong>in</strong>g them, are <strong>in</strong>troduced. For decades, the<br />

exchange of biological <strong>in</strong>formation over the <strong>in</strong>ternet has been <strong>in</strong> the form of human readable<br />

HTML documents (HyperText Markup Language) - or flat files resid<strong>in</strong>g on FTP servers<br />

(File Transfer Protocol). When designed, HTML was <strong>in</strong>tended to host static <strong>in</strong>formation<br />

presented by a server to a human be<strong>in</strong>g us<strong>in</strong>g a browser. Today, computers are required<br />

to digest the huge amounts of <strong>in</strong>formation with less <strong>in</strong>volvement of humans, <strong>and</strong> more<br />

advanced technologies are now required. To successfully <strong>in</strong>tegrate the vast amounts of<br />

data provided by the life science community, <strong>in</strong>teroperability rema<strong>in</strong>s a key issue. It<br />

may seem unrealistic to reach a po<strong>in</strong>t where every biologist <strong>and</strong> bio<strong>in</strong>formatician has<br />

the world’s biological databases <strong>and</strong> <strong>tools</strong> accessible through programmatic access, from<br />

their favorite programm<strong>in</strong>g language. However, with the current technologies <strong>in</strong> Web<br />

1 BLASTatlas, EasyGene, EPipe, GeneWiz, GenomeAtlas, hERG, MaxAlign, NetChop, NetCTL, Net-<br />

Glycate, NetNGlyc, NetOGlyc, NetPhos, RNAmmer, SIDDbase, SignalP, <strong>and</strong> TMHMM<br />

145

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!