22.08.2016 Views

Annual Scientific Report 2015

EMBL_EBI_ASR_2015_DigitalEdition

EMBL_EBI_ASR_2015_DigitalEdition

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Vertebrate Genomics<br />

The Vertebrate Genomics team works together with the Vertebrate Annotation,<br />

Genome Analysis and Genomics Technology Infrastructure teams to create and<br />

deliver resources of the Ensembl project. It is responsible for data management<br />

for large-scale projects, including BLUEPRINT. It maintains the International<br />

Genome Sample Resource, which incorporates 1000 Genomes Project data, and<br />

the curated NHGRI-EBI GWAS Catalog. These resources are publicly available<br />

and widely used by the scientific community, including members of our team<br />

who conduct research into evolution, epigenetics and transcriptional regulation.<br />

83<br />

Vertebrate Genomics is responsible for the scientific<br />

leadership of the Ensembl project, the development<br />

of the Ensembl website and Ensembl outreach and<br />

training. We are also actively involved in the Global<br />

Alliance for Genomics and Health (GA4GH).<br />

Variation Annotation activities in the Vertebrate<br />

Genomics team, coordinated by Fiona Cunningham,<br />

include enriching our part of the GWAS Catalog, which<br />

is created jointly with the Samples, Phenotypes and<br />

Ontologies team. This resource grew substantially upon<br />

migration to EMBL-EBI in early <strong>2015</strong>.<br />

Resequencing Informatics, coordinated by Laura<br />

Clarke, provides data coordination activities for many<br />

projects and consortia, including the European Bank for<br />

induced pluripotent Stem Cells (EBiSC), BLUEPRINT<br />

and HipSci. This part of our team also leads the<br />

International Genome Sample Resource (IGSR), which<br />

is a continuation of the 1000 Genomes Project.<br />

Our research projects focus the evolution of<br />

transcriptional regulation, understanding tissue<br />

specificity and annotation of the non-coding genome.<br />

Based on comparative regulatory genomics techniques,<br />

our work provides some of the most definitive results<br />

on how transcription factor binding evolves across the<br />

vertebrate lineage. Our studies of tissue-specific gene<br />

regulation, published in <strong>2015</strong>, included clarifying<br />

the role of CTCF, cohesin and other genomic<br />

structural proteins.<br />

Major achievements<br />

Ensembl<br />

We released five comprehensive updates to Ensembl in<br />

addition to a special update of our website supporting<br />

GRCh37, the previous version of the human genome<br />

assembly. In <strong>2015</strong> we focused on improving and<br />

extending four key areas of the Ensembl website: new<br />

views and tools, performance and usability of existing<br />

views, support for track hubs and mirror sites. We<br />

also introduced a new visualisation for long-range<br />

<strong>2015</strong> EMBL-EBI <strong>Annual</strong> <strong>Scientific</strong> <strong>Report</strong><br />

connections between genomic regions such as<br />

enhancer–promoter interactions.<br />

We promoted and facilitated the use of Ensembl<br />

by researchers, clinicians and educators through<br />

97 in-person workshops, conference attendance,<br />

high-quality online training, sustained social media<br />

engagemen and usability testing of new interfaces. We<br />

conducted workshops in several developing countries,<br />

including Colombia, Malawi and Tanzania. We also<br />

held workshops at major conferences including ASHG,<br />

ESHG, PAG and PAGAsia.<br />

Data Management and Coordination<br />

In <strong>2015</strong> we updated the data from several major projects<br />

to reflect the new GRCh38 human reference assembly.<br />

This involved realignment of more than 70 Terabases of<br />

DNA sequence from the 1000 Genomes Project for the<br />

IGSR and updating all BLUEPRINT analyses.<br />

We performed a major upgrade of the HipSci website to<br />

improve the discoverability of individual cell lines and<br />

related information.<br />

We helped launch the Functional Annotation of<br />

Animal Genomes (FAANG) project, which is producing<br />

comprehensive maps of functional elements in the<br />

genomes of domesticated animal species. In this project,<br />

we led efforts to define data and metadata standards to<br />

ensure the outputs will be reusable, well into the future.<br />

Genome-Wide Association<br />

Studies Catalog<br />

We helped relocate the GWAS Catalog software<br />

infrastructure from the National Human Genome<br />

Research Institute (NHGRI) in the US to EMBL-EBI.<br />

The new NHGRI-EBI GWAS Catalog website serves<br />

as a single point of access, and includes an improved<br />

search interface using SOLR technology and supporting<br />

ontology expansion queries. Our curation team adds new<br />

studies to the resource, which now offers data from more<br />

than 2000 studies representing nearly 15 000 SNP–trait<br />

associations.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!