03.12.2015 Views

bbc 2015

BBC2015_booklet

BBC2015_booklet

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

BeNeLux Bioinformatics Conference – Antwerp, December 7-8 <strong>2015</strong><br />

Abstract ID: O21<br />

Oral presentation<br />

10th Benelux Bioinformatics Conference <strong>bbc</strong> <strong>2015</strong><br />

O21. CLUB-MARTINI: SELECTING FAVORABLE INTERACTIONS<br />

AMONGST AVAILABLE CANDIDATES: A COARSE-GRAINED SIMULATION<br />

APPROACH TO SCORING DOCKING DECOYS<br />

Qingzhen Hou 1* , Kamil K. Belau 2 , Marc F. Lensink 3 , Jaap Heringa 1 & K. Anton Feenstra 1* .<br />

Center for Integrative Bioinformatics VU (IBIVU), VU University Amsterdam, De Boelelaan 1081A, 1081 HV<br />

Amsterdam, The Netherlands 1 ; Intercollegiate Faculty of Biotechnology, University of Gdańsk - Medical University of<br />

Gdańsk, Kładki 24, 80-822 Gdańsk, Poland 2 ; Institute for Structural and Functional Glycobiology (UGSF), CNRS<br />

UMR8576, FRABio FR3688, University Lille, 59000, Lille, France 3 .<br />

Protein-protein Interactions (PPIs) play a central role in all cellular processes. Large-scale identification of native binding<br />

orientations is essential to understand the role of particular protein-protein interactions in their biological context. We<br />

estimate the binding free energy using coarse-grained simulations with the MARTINI forcefield, and use those to rank<br />

decoys for 15 CAPRI benchmark targets. In our top 100 and top 10 ranked structures, for the 'easier' targets that have<br />

many near-native conformations, we obtain a strong enrichment of acceptable or better quality structures; for the 'hard'<br />

targets with very few near-native complexes in the decoys, our method is still able to retain structures which have native<br />

interface contacts. Moreover, CLUB-MARTINI is rather precise for some targets and able to pinpoint near-native<br />

binding modes in top 1, 5, 10 and 20 selections.<br />

INTRODUCTION<br />

Measuring binding free energy is essential to under­stand the<br />

relevance of particular protein-protein interactions in their<br />

biological context. Moreover, at the atomic scale, molecular<br />

simulations give us insight into the physically realistic details<br />

of these interactions. In our recent study, we successfully<br />

applied coarse-grained molecular dynamics simulations to<br />

estimate binding free energy with similar accuracy as and<br />

500-fold less time consuming than full atomistic simulation<br />

(May et al., 2014). The approach relied on the availability of<br />

crystal structures of the protein complex of interest. Here, we<br />

investigate the effectiveness of this approach as a scoring<br />

method to identify stable binding confor­mations out of<br />

docking decoys from protein docking.<br />

We apply our method as an evaluation method to rank more<br />

than 19 000 docked protein conformations, or ‘decoys’, for<br />

15 bench­mark targets from the Critical Assessment of<br />

PRedicted Interactions (CAPRI) (Lensink & Wodak, 2014).<br />

METHODS<br />

For each target, the binding free energy of all decoys was<br />

calculated, using the MARTINI forcefield as introduced<br />

before (May et al., 2014). In short, for a set of closely spaced<br />

separation distances, we calculate the constraint force applied<br />

to maintain the set distance. Integrating this force yields a<br />

potential of mean force (PMF), from which the binding free<br />

energy is extracted as the highest minus the lowest value.<br />

Previously, for accuracy, we used up to 20 replicate<br />

simulations for each distance in the PMF, but for efficiency,<br />

here we use only a single replicate initially. We then selected<br />

the lowest-scoring half to run an additional four replicates to<br />

obtain better sampling and more accurate estimates of the<br />

binding free energy. In total, we used approximately 800 000<br />

core-hours of compute time.<br />

RESULTS & DISCUSSION<br />

We obtained strong enrichment of acceptable and high<br />

quality structures in the TOP 100 based on our PMF free<br />

energies, as shown in Figure 1. We estimate the error of our<br />

energies to be significant. This can be approved by increasing<br />

sampling, but remains very expensive.<br />

Moreover, for several targets, we can select near-native<br />

structures in top 1, top 5 and top 10 as shown in Table 1,<br />

which means that, overall, our method is rather precise. From<br />

estimates of the error, we expect we can improve accuracy by<br />

extending the amount of sampling done at each distance. In<br />

conclusion, our approach can find favorable interactions from<br />

available candidates produced by docking programs. To the<br />

best of our knowledge, this is the first time interaction free<br />

energy from a coarse-grained force field is used as a scoring<br />

method to rank docking solutions at a large scale.<br />

FIG. 1. Enrichment in<br />

percentage of<br />

acceptable or better<br />

structures. For each of<br />

the 13 targets with<br />

acceptable or better<br />

decoys, two columns<br />

(from left to right)<br />

stand for CAPRI<br />

Score_set and top 100<br />

in our rank of binding<br />

free energy calculation. Red, orange and yellow represent the fractions of<br />

high, medium and acceptable quality structures over the number of all or<br />

selected docking decoys. The order (left to right) is based on the fraction<br />

of acceptable structures in each target (easy to difficult)<br />

Table 1. Success selections of top ranked structures<br />

Selection Target\Quality High Medium Acceptable<br />

Total<br />

(% )<br />

TOP 1<br />

T47 1 0 0 100<br />

T53 0 0 1 100<br />

T47 3 2 0 100<br />

TOP 5<br />

T41 0 0 4 80<br />

T53 0 0 3 60<br />

T37 0 2 0 40<br />

T47 7 3 0 100<br />

T41 0 1 7 80<br />

TOP 10 T53 0 1 5 60<br />

T37 0 3 0 30<br />

T50 0 0 1 10<br />

T47 14 6 0 100<br />

T41 0 4 13 85<br />

T53 0 3 9 60<br />

TOP 20 T37 0 4 2 30<br />

T50 0 0 3 15<br />

T40 1 2 0 15<br />

T46 0 0 1 5<br />

REFERENCES<br />

May, Pool, Van Dijk, Bijlard, Abeln, Heringa & Feenstra. Coarsegrained<br />

versus atomistic simulations: realistic interaction free energies<br />

for real proteins. Bioinformatics (2014) 30: 326-334.<br />

Lensink & Wodak. Score_set: A CAPRI benchmark for scoring protein<br />

complexes. Proteins (2014) 82:3163-3169.<br />

41

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!