Annual Scientific Report 2015
EMBL_EBI_ASR_2015_DigitalEdition
EMBL_EBI_ASR_2015_DigitalEdition
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Thornton Group<br />
Proteins: Structure,<br />
Function and Evolution<br />
The goal of our research is to understand how biology works at the molecular<br />
level, with a particular focus on protein structure and evolution and ageing.<br />
We explore how enzymes perform catalysis by gathering<br />
relevant data from the literature and developing novel<br />
software tools that allow us to characterise enzyme<br />
mechanisms and navigate the catalytic and substrate<br />
space. In parallel, we investigate the evolution of<br />
these enzymes to discover how they can evolve new<br />
mechanisms and specificities. This involves integrating<br />
heterogeneous data with phylogenetic relationships<br />
within protein families, which are based on proteinstructure<br />
classification data derived by colleagues at<br />
University College London (UCL). The practical goal<br />
of this research is to improve the prediction of function<br />
from sequence and structure and to enable the design of<br />
new proteins or small molecules with novel functions.<br />
We also explore sequence variation between individuals,<br />
especially those variants related to diseases.<br />
To understand more about the molecular basis of<br />
ageing in different organisms, we participate in a close<br />
collaboration with experimental biologists at UCL. Our<br />
role is to analyse functional genomics data from flies,<br />
worms and mice and, by developing new software tools,<br />
relate these observations to effects on lifespan.<br />
Major achievements<br />
Chemistry and evolution of isomerases<br />
Biologists are challenged with the functional<br />
interpretation of vast amounts of sequencing data<br />
derived from genomics initiatives. Among all known<br />
proteins, the function of enzymes is probably the most<br />
investigated and best described at the molecular level.<br />
Together with enzymes changing the redox state of<br />
substrates and transferring chemical groups between<br />
molecules, isomerases catalyse interconversion<br />
of isomers, molecules sharing the same atomic<br />
composition but different arrangements of chemical<br />
groups. In this study (Martinez-Cuesta et al, 2016),<br />
we catalogued the isomerisation reactions known to<br />
occur in biology using a combination of manual and<br />
computational approaches. This method provides<br />
a robust basis for comparison and clustering of the<br />
reactions into classes. Comparing our results with the<br />
Enzyme Commission (EC) classification, the standard<br />
approach to represent enzyme function on the basis of<br />
the overall chemistry of the catalysed reaction, expands<br />
our understanding of the biochemistry of isomerization.<br />
The grouping of reactions involving stereoisomerism<br />
is straightforward with two distinct types (racemases/<br />
epimerases and cis-trans isomerases), but reactions<br />
entailing structural isomerism are diverse and<br />
challenging to classify using a hierarchical approach.<br />
This study provided an overview of which isomerases<br />
occur in nature, how we should describe and classify<br />
them, and their diversity.<br />
Large-scale analysis exploring evolution<br />
of catalytic machineries and mechanisms<br />
in enzyme superfamilies<br />
Enzymes, as biological catalysts, form the basis of all<br />
forms of life, but how these proteins have evolved their<br />
functions remains a fundamental question in biology.<br />
Using a range of computational tools and resources we<br />
compiled information on all experimentally annotated<br />
changes in enzyme function within 379 structurally<br />
defined protein domain superfamilies, linking the<br />
changes observed in functions during evolution, to<br />
changes in reaction chemistry (Furnham et al, <strong>2015</strong>).<br />
Many superfamilies show changes in function at<br />
some level, although one function often dominates<br />
one superfamily. We used quantitative measures of<br />
changes in reaction chemistry to reveal the various<br />
types of chemical changes occurring during evolution<br />
and provided detailed examples. We used structural<br />
information of the enzymes’ active sites to examine how<br />
different superfamilies have changed their catalytic<br />
machinery during evolution. Some superfamilies<br />
have changed the reactions they perform without<br />
changing catalytic machinery; in others, large changes<br />
of enzyme function, both in terms of overall chemistry<br />
and substrate specificity, have been brought about by<br />
significant changes in catalytic machinery. Interestingly,<br />
the relatives of some superfamilies perform similar<br />
functions using different catalytic machineries. Our<br />
analysis highlighted characteristics of functional<br />
evolution across a wide range of superfamilies, providing<br />
insights that are useful in predicting the function of<br />
uncharacterised sequences and in the design of new<br />
synthetic enzymes.<br />
Longevity GWAS using the Drosophila<br />
Genetic Reference Panel<br />
The results of our genome-wide association study<br />
(GWAS) for Drosophila lifespan, based on 197<br />
Drosophila Genetic Reference Panel (DGRP) lines<br />
(Ivanov et al. <strong>2015</strong>), suggested that the top associated<br />
genes provide good candidates for further<br />
investigation into their relationship<br />
with lifespan and ageing. While our<br />
147<br />
<strong>2015</strong> EMBL-EBI <strong>Annual</strong> <strong>Scientific</strong> <strong>Report</strong>