21.11.2014 Views

ayout 1 - EMBL Grenoble

ayout 1 - EMBL Grenoble

ayout 1 - EMBL Grenoble

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>EMBL</strong>-EBI<br />

The GO Editorial Office<br />

Previous and current research<br />

The Gene Ontology (GO) project (www.geneontology.org) is a collaborative effort to construct and<br />

use ontologies to facilitate the biologically meaningful annotation of genes and their products in<br />

a wide variety of organisms. At the EBI, the GO Editorial Office plays a key role in managing the<br />

distributed task of developing and maintaining the GO vocabularies, and contributes to a number<br />

of other GO project efforts, including documentation, web presence, software testing and user<br />

support.<br />

The Gene Ontology Consortium (GOC) provides the scientific community with a consistent and<br />

robust infrastructure, in the form of biological ontologies, for describing, integrating, and comparing<br />

the structures of genetic elements and the functional roles of gene products within and between<br />

organisms. The GO ontologies cover three key biological domains that are shared by all<br />

organisms:<br />

• molecular function defines the tasks performed by individual gene products; examples<br />

include aminoacyl-tRNA ligase activity and translation elongation factor activity;<br />

Midori Harris<br />

PhD 1997, Cornell University,<br />

Ithaca, NY.<br />

Scientific Curator,<br />

Saccharomyces Genome<br />

Database, Stanford<br />

University, Stanford, CA.<br />

GO Editor at <strong>EMBL</strong>-EBI since<br />

2001.<br />

• biological process defines broad biological goals, such as signal transduction or ribosome<br />

assembly, that are accomplished by ordered assemblies of molecular functions;<br />

• cellular component describes subcellular structures, locations and macromolecular complexes; examples include cytoplasm, ribosome<br />

and translation release factor complex.<br />

In addition, sequence features are covered by the Sequence Ontology, which is maintained separately from the three GO ontologies (Eilbeck<br />

et al., 2005).<br />

The ontologies in GO are structured as directed acyclic graphs<br />

(DAGs), wherein any term may have one or more parents and<br />

zero, one, or more children. Within each vocabulary, terms are<br />

defined and parent–child relationships between terms are specified.<br />

A child term is a subset of its parent(s). The GO vocabularies<br />

have long defined two semantic relationships between<br />

parent and child terms: is_a and part_of. The is_a relationship<br />

means that a term is a subclass of its parent; part of may mean<br />

‘physically part of ’ (as in the cellular component ontology) or<br />

‘subprocess of ’ (as in the biological process ontology). New relationships<br />

representing biological regulation have recently<br />

been added, as described below. The figure shows a portion of<br />

the GO cellular component DAG.<br />

Future projects and goals<br />

The GO Editorial Office will continue to work closely with the<br />

rest of the GO Consortium and with biological experts to ensure<br />

that the ontologies are comprehensive, logically rigorous<br />

and biologically accurate. Improvements begun in 2008 on signal<br />

transduction, transcription, and other topics will therefore<br />

continue in 2009. The new regulates relationships will be used to<br />

create the first links between the biological process and molecular<br />

function ontologies, with additional types of links to follow.<br />

Work on recasting many complex process terms as explicit crossproducts<br />

with orthogonal ontologies such as the ChEBI ontology<br />

and the cell ontology will also continue.<br />

GO terms are organised in directed acyclic graphs (DAGs) – hierarchical<br />

structures in which any ‘child’ (more specialised term) can have many<br />

‘parents’ (less specialised terms). For example, the cellular component term<br />

chloroplast envelope has two parents, reflecting the fact that it is a part of<br />

the chloroplast and a type of envelope. Any gene that is annotated to this<br />

term is automatically annotated to both chloroplast and envelope. Some<br />

terms and relationships have been omitted for clarity.<br />

Selected references<br />

The Gene Ontology Consortium (2008). The Gene Ontology (GO)<br />

Project in 2008. Nucleic Acids Res., 36, D0-D<br />

Eilbeck, K. et al. (2005). The Sequence Ontology: a tool for the<br />

unification of genome annotations. Genome Biol., 6, R<br />

The Gene Ontology Consortium (2001). Creating the Gene Ontology<br />

resource: design and implementation. Genome Res., 11, 125-133<br />

75

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!