You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
<strong>EMBL</strong>-EBI<br />
The GO Editorial Office<br />
Previous and current research<br />
The Gene Ontology (GO) project (www.geneontology.org) is a collaborative effort to construct and<br />
use ontologies to facilitate the biologically meaningful annotation of genes and their products in<br />
a wide variety of organisms. At the EBI, the GO Editorial Office plays a key role in managing the<br />
distributed task of developing and maintaining the GO vocabularies, and contributes to a number<br />
of other GO project efforts, including documentation, web presence, software testing and user<br />
support.<br />
The Gene Ontology Consortium (GOC) provides the scientific community with a consistent and<br />
robust infrastructure, in the form of biological ontologies, for describing, integrating, and comparing<br />
the structures of genetic elements and the functional roles of gene products within and between<br />
organisms. The GO ontologies cover three key biological domains that are shared by all<br />
organisms:<br />
• molecular function defines the tasks performed by individual gene products; examples<br />
include aminoacyl-tRNA ligase activity and translation elongation factor activity;<br />
Midori Harris<br />
PhD 1997, Cornell University,<br />
Ithaca, NY.<br />
Scientific Curator,<br />
Saccharomyces Genome<br />
Database, Stanford<br />
University, Stanford, CA.<br />
GO Editor at <strong>EMBL</strong>-EBI since<br />
2001.<br />
• biological process defines broad biological goals, such as signal transduction or ribosome<br />
assembly, that are accomplished by ordered assemblies of molecular functions;<br />
• cellular component describes subcellular structures, locations and macromolecular complexes; examples include cytoplasm, ribosome<br />
and translation release factor complex.<br />
In addition, sequence features are covered by the Sequence Ontology, which is maintained separately from the three GO ontologies (Eilbeck<br />
et al., 2005).<br />
The ontologies in GO are structured as directed acyclic graphs<br />
(DAGs), wherein any term may have one or more parents and<br />
zero, one, or more children. Within each vocabulary, terms are<br />
defined and parent–child relationships between terms are specified.<br />
A child term is a subset of its parent(s). The GO vocabularies<br />
have long defined two semantic relationships between<br />
parent and child terms: is_a and part_of. The is_a relationship<br />
means that a term is a subclass of its parent; part of may mean<br />
‘physically part of ’ (as in the cellular component ontology) or<br />
‘subprocess of ’ (as in the biological process ontology). New relationships<br />
representing biological regulation have recently<br />
been added, as described below. The figure shows a portion of<br />
the GO cellular component DAG.<br />
Future projects and goals<br />
The GO Editorial Office will continue to work closely with the<br />
rest of the GO Consortium and with biological experts to ensure<br />
that the ontologies are comprehensive, logically rigorous<br />
and biologically accurate. Improvements begun in 2008 on signal<br />
transduction, transcription, and other topics will therefore<br />
continue in 2009. The new regulates relationships will be used to<br />
create the first links between the biological process and molecular<br />
function ontologies, with additional types of links to follow.<br />
Work on recasting many complex process terms as explicit crossproducts<br />
with orthogonal ontologies such as the ChEBI ontology<br />
and the cell ontology will also continue.<br />
GO terms are organised in directed acyclic graphs (DAGs) – hierarchical<br />
structures in which any ‘child’ (more specialised term) can have many<br />
‘parents’ (less specialised terms). For example, the cellular component term<br />
chloroplast envelope has two parents, reflecting the fact that it is a part of<br />
the chloroplast and a type of envelope. Any gene that is annotated to this<br />
term is automatically annotated to both chloroplast and envelope. Some<br />
terms and relationships have been omitted for clarity.<br />
Selected references<br />
The Gene Ontology Consortium (2008). The Gene Ontology (GO)<br />
Project in 2008. Nucleic Acids Res., 36, D0-D<br />
Eilbeck, K. et al. (2005). The Sequence Ontology: a tool for the<br />
unification of genome annotations. Genome Biol., 6, R<br />
The Gene Ontology Consortium (2001). Creating the Gene Ontology<br />
resource: design and implementation. Genome Res., 11, 125-133<br />
75