Annual Scientific Report 2015

Recommendations

Info

Gene Expression The Gene Expression team handles the acquisition, curation, quality control, statistical analysis and visualisation of functional genomics data at EMBL-EBI, focusing on microarray, high-throughput sequencing-based gene expression and related proteomics data. We are responsible for several core EMBL-EBI resources, including the Expression Atlas, which enables users to query for information about gene expression, and the ArrayExpress archive of functional genomics data. We contribute substantially to online and face-to-face training in transcriptomics, in particular relating to our team’s resources but also for related topics such as next-generation sequencing. We are a centre of excellence for RNA-sequencing quality control and analysis, the results of which are used by numerous resources at EMBL-EBI and externally. We are increasingly interested in epigenetic analysis, for example methylation, and work towards placing transcriptomic data in a broader regulatory context. We are part of Open Targets (formerly the Centre for Therapeutic Target Validation, CTTV) and the Cancer Genome Atlas Pan-Cancer analysis project. Analysis and visualisation on plant data is also a major component of our work through our involvement in Gramene project. We collaborate closely with the Brazma, Marioni, Stegle and Teichmann research groups at EMBL-EBI and with the Choudhary group at the Wellcome Trust Sanger Institute, developing new methods and algorithms, integrating new types of data across multiple platforms, and investigating relationships between transcriptomics and proteomics data in the context of cancer genomics. Major achievements ArrayExpress, Expression Atlas and related projects In 2015 we capitalised on the deployment and continual improvement of Annotare, the ArrayExpress submission tool, and focused our curation efforts on datasets in the Expression Atlas, which held 100 000 assays in December 2015 (a six-fold increase compared to 2014). These assays included 157 RNA-seq experiments, over 7000 differential comparisons across 26 organisms, and 568 plant experiments. At the end of 2015 the Baseline Expression Atlas contained 46 RNA-seq studies, including data from many high impact studies (e.g. GTEx and FANTOM5) and its first proteomics study. We improved the Expression Atlas interface substantially, applying many enhancements in its presentation of search results (e.g. faceting). We developed new functionalities that will be available to users in early 2016, for example gene co-expression and a new Bioconductor package for easy access to Atlas data in R language. The Expression Atlas now contributes transcriptomic data and visualisations to many resources, including the Open Targets (formerly CTTV), Ensembl, Reactome, Plant Reactome and International Mouse Phenotyping Consortium portals. We developed an RNA-seq pipeline and adapted it to help analyse public RNA-seq data for major species in the European Nucleotide Archive’s Sequence Read Archive. This functionality resulted in 148 000 processed sequencing runs in 85 species by the end of the year. Where applicable, this data is included in both the Expression Atlas and Ensembl. Future plans In 2016 our development efforts for the biology-centric Expression Atlas will centre on integration of baseline RNA-sequencing gene expression and proteomics data. The BioStudies database, developed by the Sarkans team, will serve as the back-end for dealing with new types of data, including molecular imaging data. We will continue to expand our analyses and develop intuitive visualisation methods for both the existing data in Expression Atlas and for novel data types, such as epigenetic (methylation), genetic (eQTL), single-cell RNA-seq and smallRNA-seq. We will also complete the analysis of public RNA-seq data in major species and make the raw results available publicly. As a part of the pan-cancer project of the ICGC, we will continue to investigate aberrant transcription patterns across many cancer types. 99 2015 EMBL-EBI Annual Scientific Report
Robert Petryszak Gene Expression MPhil in Computer Speech and Language Processing, University of Cambridge, UK. At EMBL-EBI since 2003. Team Leader since 2015. Selected publications Frankish A, et al. (2015) Comparison of GENCODE and RefSeq gene annotation and the impact of reference geneset on variant effect prediction. BMC Genomics 16 Suppl 8:s2 Kolesnikov N, Hastings E, Keays M, et al. (2015) ArrayExpress update--simplifying data submissions. Nucleic Acids Res. 43:D1113-D1116 Petryszak R, et al. (2016) Expression Atlas update-an integrated database of gene and protein expression in humans, animals and plants. Nucleic Acids Res. 44:D746-D752 Tello-Ruiz MK, et al. (2016) Gramene 2016: comparative plant genomics and pathway resources. Nucleic Acids Res. 44:D1133-D1140 Expression Atlas: baseline expression in tissues and cell lines for human gene REG1B. 2015 EMBL-EBI Annual Scientific Report 100
Page 1 and 2:
The European Bioinformatics Institu
Page 3 and 4:
SERVICE TEAMS TRAINING PROGRAMME RE
Page 5 and 6:
Foreword We are pleased to present
Page 7 and 8:
awareness amongst some of our stron
Page 9 and 10:
Chemical biology The 17 million nov
Page 11 and 12:
The most extensive catalogue of str
Page 13 and 14:
“ EMBL -EBI services are the back
Page 15 and 16:
European Nucleotide Archive The ENA
Page 17 and 18:
Vertebrate Genomics Paul Flicek Bro
Page 19 and 20:
Functional Genomics Alvis Brazma
Page 21 and 22:
Pfam Pfam is a database of protein
Page 23 and 24:
Protein Data Bank in Europe Gerard
Page 25 and 26:
MetaboLights MetaboLights is a data
Page 27 and 28:
Proteomics Services and Molecular I
Page 29 and 30:
BioSamples The BioSamples database
Page 31 and 32:
“ EMBL -EBI is a critical mass of
Page 33 and 34:
EMBL International PhD Programme at
Page 35 and 36:
“ It would be a considerable loss
Page 37 and 38:
The Birney group used methods devel
Page 39 and 40:
Marioni group • Improved and exte
Page 41 and 42:
“ Because I work for a micro biot
Page 43 and 44:
Industry workshops • In silico AD
Page 45 and 46:
The work of our institute relies on
Page 47 and 48:
Web production Rodrigo Lopez System
Page 49 and 50: 2015 EMBL-EBI Annual Scientific Rep
Page 51 and 52: Capital investment Support from the
Page 53 and 54: In 2015 our core data resources con
Page 55 and 56: Joint publications Most of our 299
Page 57 and 58: One from Many: Perspectives on a Mu
Page 61 and 62: European Nucleotide Archive • Mar
Page 63 and 64: Technical Services Cluster Scientif
Page 65 and 66: Expression Atlas • Oregon State U
Page 67 and 68: Photo: Uma Maheswari 2015 EMBL-EBI
Page 71 and 72: 037. Chiapparino A, Maeda K, Turei
Page 73 and 74: 115. Jakubec D, Hostas J, Laskowski
Page 75 and 76: 192. Perez-Riverol Y, Xu QW, Wang R
Page 77 and 78: 269. van den Berg BA, Reinders MJ,
Page 79 and 80: Director Ewan Birney Admininstratio
Page 83 and 84: Guy Cochrane European Nucleotide Ar
Page 85 and 86: Vertebrate Genomics Research The mo
Page 87 and 88: Daniel Zerbino Ensembl Genome Analy
Page 89 and 90: Future plans We will continue to de
Page 91 and 92: Andy Yates Genome Technology and In
Page 93 and 94: Paul Kersey Non-vertebrate Genomics
Page 95 and 96: Justin Paschall Variation Archive M
Page 97 and 98: Alvis Brazma Functional Genomics Ph
Page 99: Ugis Sarkans Functional Genomics De
Page 103 and 104: Rob Finn Sequence Families PhD in B
Page 105 and 106: Maria-Jesus Martin Protein Function
Page 107 and 108: Claire O’Donovan Protein Function
Page 109 and 110: (such as the on-going EMDataBank Ma
Page 111 and 112: Sameer Velankar PDBe Content and In
Page 113 and 114: containing the mapping between comp
Page 115 and 116: of 14 leading European labs in Meta
Page 117 and 118: Henning Hermjakob Proteomic service
Page 119 and 120: coimmunoprecipitation coimmunopreci
Page 121 and 122: development of Europe PMC as a plat
Page 123 and 124: Mouse informatics In 2015 we contin
Page 127 and 128: Train online, EMBL-EBI’s web-base
Page 129 and 130: Nils Koelling Quantitative genetics
Page 133 and 134: Pedro Beltrao PhD in Biology, Unive
Page 135 and 136: Ewan Birney PhD 2000, Wellcome Trus
Page 137 and 138: Anton Enright PhD in Computational
Page 139 and 140: Nick Goldman PhD University of Camb
Page 141 and 142: John Marioni PhD in Applied Mathema
Page 143 and 144: Julio-Saez Rodriguez PhD University
Page 145 and 146: Oliver Stegle PhD in Physics, Unive
Page 147 and 148: Future plans The Teichmann group wi
Page 149 and 150: findings regarding association were
Page 151 and 152:
2015 EMBL-EBI Annual Scientific Rep
Page 153 and 154:
Future plans The Industry Programme
Page 155 and 156:
2015 EMBL-EBI Annual Scientific Rep
Page 157 and 158:
Reporting on usage We further devel
Page 159 and 160:
to find the support they need. The
Page 161 and 162:
Petteri Jokinen Systems & Networkin
Page 163 and 164:
Standby Facility and Database Disas
Page 165 and 166:
External Relations leads on brand a
Page 167 and 168:
Mark Green EMBL-EBI Administration
show all

Annual Scientific Report 2015

Create successful ePaper yourself

Delete template?

Save as template?