22.08.2016 Views

Annual Scientific Report 2015

EMBL_EBI_ASR_2015_DigitalEdition

EMBL_EBI_ASR_2015_DigitalEdition

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Nick Goldman<br />

PhD University of Cambridge, 1992. Postdoctoral<br />

work at National Institute for Medical Research,<br />

London, 1991-1995, and University of<br />

Cambridge, 1995-2002. Wellcome Trust Senior<br />

Fellow, 1995-2006.<br />

and produced better-resolved evolutionary trees for the<br />

globeflower flies, gaining insights into the underlying<br />

causes of the flies’ incongruent trees.<br />

We continued to investigate structural and functional<br />

determinants of selective evolutionary constraint in<br />

mammals, focusing on the level of genes and domains.<br />

To facilitate the analysis of the mode of evolution, we<br />

developed a web service that integrates and displays<br />

structural information with selective constraints<br />

discovered using our Sitewise Likelihood Ratio<br />

(SLR) method.<br />

We regularly share our expertise with experimental<br />

wet-lab biologists studying specific biological problems,<br />

and in such a collaboration in <strong>2015</strong> contributed<br />

analysis of the evolutionary dynamics of DNA regions<br />

differentially methylated between different tissues.<br />

DNA in blood has more sites that become methylated<br />

compared to other tissues but, contrary to expectation,<br />

we found that this subset of sites shows the same<br />

evolutionary patterns as those that exhibit higher<br />

fractional methylation.<br />

Our work to re-purpose DNA as a medium for<br />

archiving digital information continues to be of great<br />

interest worldwide, and in <strong>2015</strong> the Biotechnology<br />

and Biological Sciences Research Council (BBSRC)<br />

supported our development of computational and<br />

laboratory DNA-handling technologies needed to bring<br />

DNA-storage closer to market. Extending state-ofthe-art<br />

methods in coding theory, we began modelling<br />

the statistical properties of both the storage medium<br />

itself and the errors induced by the “DNA synthesis<br />

> processing > storage > sequencing channel”, and<br />

developed algorithms to exploit the properties of this<br />

channel. We began developing a system that will allow<br />

mass storage of data on DNA with proven reliability and<br />

guaranteed efficiency at a level comparable to industry<br />

norms. Working closely with molecular biologists and<br />

DNA synthesis and sequencing specialists, we are<br />

developing solutions that will make DNA a viable choice<br />

for long-term, reliable and robust digital storage.<br />

Future plans<br />

We are dedicated to using mathematical modelling,<br />

statistics and computation to enable biologists to<br />

draw as much scientific value as possible from modern<br />

molecular sequence data. We will continue to improve<br />

and develop new methods for phylogenetic analysis,<br />

and new techniques to analyse incomplete datasets. We<br />

will apply of our cell lineage tree algorithm to single-cell<br />

sequencing data, and further develop the method to<br />

identify lineage divergences that are extremely difficult<br />

to pinpoint by manual analysis.<br />

At EMBL-EBI since 2002. EMBL Senior Scientist<br />

since 2009.<br />

Past work in the group was some of the first in<br />

phylogenetics to be able to relate protein sequence<br />

evolution to features of the entire evolving protein,<br />

rather than assuming mutations at different locations<br />

had independent effects. We will further develop our<br />

method to model the evolutionary forces acting on<br />

proteins involved in cellular information processing,<br />

shifting focus from the 3D structure of the evolving<br />

proteins to the interactions between binding pairs of<br />

molecules in signalling networks. The method will<br />

enable us to infer how evolutionary pressures have<br />

impacted the evolution of sequences.<br />

Clinicians are looking to genome sequencing to provide<br />

diagnostic aids and inform treatment decisions, for<br />

example in determining the correct antibiotic based<br />

on rapid determination of pathogen species and strain,<br />

or detecting mutations known to impact antibiotic<br />

resistance. We believe that state-of-the-art genomic<br />

analysis methods can assist clinicians and be further<br />

optimised to be fast and accurate. In collaboration<br />

with clinicians who have expertise in diagnostics and<br />

treatment policy, we will work on methods for informing<br />

their choices based on bacterial whole-genome<br />

sequencing. We will analyse the performance of existing<br />

methods for detecting antibiotic resistance using limited<br />

data sets, producing knowledge of value for linking the<br />

latest NGS technologies with the appropriate software<br />

for diagnostic and clinical applications.<br />

Selected publications<br />

Lowe R, Slodkowicz G, Goldman N, Rakyan VK (<strong>2015</strong>)<br />

The human blood DNA methylome displays a highly<br />

distinctive profile compared with other somatic tissues.<br />

Epigenetics 10:274–281<br />

Schwarz RF, et al. (<strong>2015</strong>) Changes in postural syntax<br />

characterize sensory modulation and natural variation<br />

of C. elegans locomotion. PLoS Comp Biol 11:e1004322<br />

Schwarz RF, et al. (<strong>2015</strong>) Spatial and temporal<br />

heterogeneity in high-grade serous ovarian cancer: a<br />

phylogenetic analysis. PLoS Medicine 12:e1001789<br />

Tan GM, et al. (<strong>2015</strong>) Current methods for automated<br />

filtering of multiple sequence alignments frequently<br />

worsen single-gene phylogenetic inference. Syst. Biol.<br />

64:778–791<br />

Truszkowski J and Goldman N (<strong>2015</strong>) Maximum<br />

likelihood phylogenetic inference is consistent on<br />

multiple sequence alignments, with or without gaps.<br />

Syst. Biol. doi: 10.1093/sysbio/syv089<br />

<strong>2015</strong> EMBL-EBI <strong>Annual</strong> <strong>Scientific</strong> <strong>Report</strong> 138

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!