14.06.2013 Views

Databases and Systems

Databases and Systems

Databases and Systems

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

237<br />

In order to make wrapper development easier <strong>and</strong> st<strong>and</strong>ardize tool integration into<br />

the Workbench, several files containing subroutines are included in the Workbench<br />

tree. These files are generally “required” by all Perl wrappers, but the subroutines<br />

may be extracted <strong>and</strong> revised to suit individual needs. While we recommend using<br />

the st<strong>and</strong>ard subroutines wherever possible, we cannot fully predict the needs of users<br />

for other non-st<strong>and</strong>ard tool specific routines. This concept is being refined for future<br />

releases of the Workbench.<br />

The central file for tool integration is the “html.pl” file, which contains the<br />

necessary subroutines for communication with BW.cgi (the Workbench core CGI<br />

program) <strong>and</strong> for passing configuration information to the wrapper scripts. This file<br />

should be included at the beginning of all scripts using the “require” directive in Perl.<br />

Tools wrappers may, however, require only a subset of the subroutines in the html.pl<br />

file.<br />

Sessions<br />

Each time a user accesses the Workbench, a previous analysis session can be<br />

resumed or a new one initiated. The role of the sessions is to maintain the state <strong>and</strong><br />

sequence of operations performed on the Workbench during an interactive session.<br />

Sessions can be useful for both recalling previously performed operations as well as<br />

for comparison with newer analyses. Essentially, sessions provide the history of the<br />

Workbench use <strong>and</strong> can be updated, erased or permanently stored.<br />

<strong>Databases</strong><br />

Almost all of the relevant biological sequence, structure <strong>and</strong> literature databases are<br />

accessible from the Biology Workbench. The databases are federated by<br />

identification of common objects <strong>and</strong> indexing each database based on these objects.<br />

We use the SRS schema (Etzold <strong>and</strong> Argos, 1993) to store <strong>and</strong> access the databases<br />

allowing them to be queried for multiple objects as well as combinations of objects<br />

using Boolean operators on the query construction page. The Workbench also<br />

permits nested Boolean queries.<br />

To aid the integration of SRS indexed databases, the file “db_config.pl”<br />

containing routines for parsing database directories <strong>and</strong> identifying available<br />

databases is provided. These routines also extract important information on specific<br />

databases to allow sequence information to be properly imported to the Workbench.<br />

A database mirroring program has also been developed for the Biology<br />

Workbench, which fetches databases as they are updated in their primary<br />

repositories. The indexing <strong>and</strong> creation of Blastable sequence database files is done<br />

locally each time new databases are retrieved. A complete list of all databases<br />

available on the Workbench is presented in Appendix A.<br />

Analysis Modules

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!