Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
6<br />
end databases <strong>and</strong> front-end viewers <strong>and</strong> analysis tools. In contrast to OPM <strong>and</strong><br />
BioKleisli, CORBA does not so much offer a st<strong>and</strong>ard data model as provide a<br />
mechanism for isolating system components from the design constraints that might be<br />
imposed by choosing particular data models. The Biowidgets article by<br />
Crabtree et al describes an approach to modular software development for<br />
bioinformatics visualization. Biowidgets represents an attempt to apply that modular<br />
design philosophy to the problem of data visualization.<br />
The ACE database manager, described by Jean <strong>and</strong> Danielle Thierry-Mieg,<br />
is the database kernel of the ACEDB system developed by Jean Thierry-Mieg <strong>and</strong><br />
Richard Durbin for the C.elegans genomic database, <strong>and</strong> subsequently reused widely<br />
by many other projects. The tremendous success of ACEDB over the years can be<br />
attributed to a number of factors, including its biologist-friendly user interface, the<br />
ease with which data can be entered into it, the ease with which its schema can be<br />
modified, the ease with which new visualization <strong>and</strong> analysis routines can be added<br />
to it, the extensive genomics knowledge incorporated into its st<strong>and</strong>ard schema <strong>and</strong><br />
software, <strong>and</strong> its price (free!). At the core of the system is a somewhat non-st<strong>and</strong>ard<br />
object-oriented DBMS which over a reasonably wide range of database sizes can<br />
outperform most commercial relational databases at tasks such as retrieving an object<br />
detail view, pulling up annotated sequence, or displaying a genetic map. It is<br />
heartening, in an era when DBMS development has been taken over by large<br />
corporations wielding enormous programming teams, to see that a tiny team of<br />
talented individuals can still write a system which outperforms the commercial<br />
products in many respects.<br />
Laboratory information management systems (LIMS) are a species of<br />
software remote from the experience of many bioinformatics practitioners, for whom<br />
data are something that automatically appear in databases. Those of us who have had<br />
occasion to w<strong>and</strong>er into the more upstream regions of the data production process<br />
encounter a world of robots, sequencing machines, bench biologists, pipettes <strong>and</strong> 96well<br />
plates. This world turns out to be full of interesting <strong>and</strong> exotic software<br />
engineering challenges which are not well served by the current generation of<br />
commercial solutions. Goodman et al present an elegant <strong>and</strong> general approach to the<br />
problem of laboratory process workflow which they developed for use in the<br />
Whitehead Genome Sequencing Center.<br />
The articles in this book constitute a very incomplete sample of the<br />
interesting systems out there. Notable gaps include databases for such important<br />
model organisms as yeast, C.elegans <strong>and</strong> Arabidopsis as well as a number of<br />
microbial databases; databases on protein sequences, families, <strong>and</strong> 3D structures;<br />
expression profile databases, <strong>and</strong> transcription factor <strong>and</strong> promoter databases.<br />
Starting points for finding such resources include [ 17,18].<br />
One final note: in keeping with the rapid pace of change in this field, many<br />
ofthe authors are no longer at the institutions where the work was performed, <strong>and</strong> in<br />
some cases the addresses shown are out of date. During the period when this book<br />
was being written (1997 -98), a number of the authors (myself, Markowitz et al, Karp,<br />
Etzold) moved from public sector to private sector positions, <strong>and</strong> several of the