28.02.2013 Views

Bio-medical Ontologies Maintenance and Change Management

Bio-medical Ontologies Maintenance and Change Management

Bio-medical Ontologies Maintenance and Change Management

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

50 E. De Francesco et al.<br />

In Figure 7, an example of the synthetic description we adopted is illustrated.<br />

In particular, a form is exploited where both the acronym <strong>and</strong><br />

the full name of the database under analysis are showed. We consider the<br />

CryptoDB [35] (Cryptosporidium Genome Resources) database, that is, the<br />

genome database of the AIDS-related apicomplexan-parasite, as reported in<br />

the brief description below the full name in the form. Also the web-link to<br />

the considered database is provided, along with its database features. In<br />

particular, CryptoDB is a relational DBMS based on the Genomic Unified<br />

Schema (illustrated above, in Section 2.2). All features corresponding to the<br />

other analysis criteria are reported. In detail, we highlight that CryptoDB<br />

contains information on genes, clone/contig sequences, polymorphism <strong>and</strong><br />

motifs, <strong>and</strong> that it stores physical maps, pathways <strong>and</strong> bibliographic references<br />

as well. It supports simple queries <strong>and</strong> also analysis queries based on<br />

similarity search, <strong>and</strong> allows graphical interaction <strong>and</strong> sequence based query<br />

methods. Additional information are contained in the form about the search<br />

criteria supported by each database. In particular, CryptoDB can be queried<br />

by using both keywords <strong>and</strong> map positions as search parameters (map positions<br />

require the specification of the positions of chromosomic regions). The<br />

result formats supported by CryptoDB are HTML (as all the other genomic<br />

databases), FASTA <strong>and</strong> tab-separated files. It allows the user to download<br />

data stored in its repository, but not to submit new data, as specified in the<br />

last two field (“submit” <strong>and</strong> “download”) of the form.<br />

3 Concluding Remarks<br />

In this chapter we presented the main results of an analysis we have conducted<br />

on 74 genomic databases. Such an analysis has been done mainly focusing<br />

on a database technology point of view, by choosing five specific criteria<br />

under which all the considered databases have been investigated. Thus, the<br />

considered databases have been classified according to such criteria <strong>and</strong> their<br />

synthetic descriptions are given, with the aim of providing a preliminary<br />

instrument to access <strong>and</strong> manage more easily the enormous amount of data<br />

coming from the biological repositories.<br />

References<br />

1. ArkDB, http://www.thearkdb.org/arkdb//index.jsp<br />

2. AureoList, http://genolist.pasteur.fr/AureoList<br />

3. BeetleBase, http://www.bioinformatics.ksu.edu/BeetleBase<br />

4. <strong>Bio</strong>cyc website, http://biocyc.org<br />

5. Bovilist world-wide web server, http://genolist.pasteur.fr/BoviList<br />

6. C<strong>and</strong>idadb: Database dedicated to the analysis of the genome of the human<br />

fungal pathogen, c<strong>and</strong>ida albicans, http://genolist.pasteur.fr/C<strong>and</strong>idaDB<br />

7. Central aspergilus data repository, www.cadre.man.ac.uk<br />

8. Chado: The gmod database schema, http://www.gmod.org/chado

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!