You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
60<br />
not independent of biology; finally, we have extensively trained the project members<br />
who perform knowledge entry. The advantage of domain-independent knowledge<br />
acquisition tools is that they can be reused across application domains; however, for<br />
most tasks they cannot compete with the speed <strong>and</strong> intuitiveness of domain-specific<br />
tools.<br />
Another advantage of graphical knowledge acquisition tools is that they can<br />
perform immediate checking of new information. In the first few years of our<br />
project, knowledge was entered into structured text files by one group of people, <strong>and</strong><br />
then loaded into the KB by a second group of people who ran parsers over the files.<br />
The inevitable errors <strong>and</strong> inconsistencies in the information were time consuming for<br />
the two groups to resolve.<br />
In its early phases, the EcoCyc project collected kinetics data for each metabolic<br />
enzyme. We ceased to collect this data upon discovering that the data are extremely<br />
time consuming to collect because there are large amounts of kinetics data in the<br />
literature, <strong>and</strong> upon discovering that this information is of relatively low utility<br />
because the data are collected under widely varying experimental conditions, <strong>and</strong><br />
therefore cannot easily be combined to produce a coherent simulation of a set of<br />
enzymes under a single condition.<br />
EcoCyc employs a frame knowledge representation system as its data management<br />
substrate. Its schema-evolution capabilities have simplified the development of the<br />
EcoCyc schema. Its object-oriented data model has yielded a much simpler <strong>and</strong> more<br />
comprehensible schema than would be possible under the relational model.<br />
Development of the EcoCyc GUI began before invention of the WWW; therefore,<br />
EcoCyc was developed using a Common Lisp X-windows environment. We were<br />
able to retrofit EcoCyc to run through the WWW in addition to X-windows through<br />
the use of a tool called CWEST. CWEST translates low-level data structures<br />
produced by the Common Lisp windowing system into a combination of GIF images<br />
<strong>and</strong> HTML pages, <strong>and</strong> allowed us to adapt EcoCyc to the WWW with relatively little<br />
effort. Our use of a powerful windowing system was an extremely positive decision.<br />
References<br />
1.<br />
2.<br />
3.<br />
Bairoch. The ENZYME databank in 1995. Nucl Acids Res, 24:221--222, 1996.<br />
Berlyn, K. Brooks Low, K.E. Rudd, <strong>and</strong> M. Singer. Linkage map of Escherichia<br />
coli K—12, edition 9. In Neidhardt et al. [16], pages 1715--1902.<br />
Blattner, G. Plunkett III, C.A. Bloch, N.T. Perna, V. Burl<strong>and</strong>, M. Riley, J.<br />
Collado-Vides, J.D. Glasner, C.K. Rode, G.F. Mayhew, J. Gregor, N.W. Davis,<br />
H.A. Kirkpatrick, M.A. Goeden, D.J. Rose, B. Mau, <strong>and</strong> Y. Shao. The complete<br />
genome sequence of Escherichia coli K—12. Science, 277: 1453--1462, 1997.<br />
4. Cameron <strong>and</strong> I. Tong. Cellular <strong>and</strong> metabolic engineering: An overview.<br />
Applied Biochemistry <strong>and</strong> Biotechnology, 38: 105--140, 1993.<br />
5. J.-F. Tomb et al. The complete genome sequence of the gastric pathogen<br />
Helicobacter pylori. Nature, 388539--547, 1997.