bbc 2015
BBC2015_booklet
BBC2015_booklet
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
BeNeLux Bioinformatics Conference – Antwerp, December 7-8 <strong>2015</strong><br />
Abstract ID: O21<br />
Oral presentation<br />
10th Benelux Bioinformatics Conference <strong>bbc</strong> <strong>2015</strong><br />
O21. CLUB-MARTINI: SELECTING FAVORABLE INTERACTIONS<br />
AMONGST AVAILABLE CANDIDATES: A COARSE-GRAINED SIMULATION<br />
APPROACH TO SCORING DOCKING DECOYS<br />
Qingzhen Hou 1* , Kamil K. Belau 2 , Marc F. Lensink 3 , Jaap Heringa 1 & K. Anton Feenstra 1* .<br />
Center for Integrative Bioinformatics VU (IBIVU), VU University Amsterdam, De Boelelaan 1081A, 1081 HV<br />
Amsterdam, The Netherlands 1 ; Intercollegiate Faculty of Biotechnology, University of Gdańsk - Medical University of<br />
Gdańsk, Kładki 24, 80-822 Gdańsk, Poland 2 ; Institute for Structural and Functional Glycobiology (UGSF), CNRS<br />
UMR8576, FRABio FR3688, University Lille, 59000, Lille, France 3 .<br />
Protein-protein Interactions (PPIs) play a central role in all cellular processes. Large-scale identification of native binding<br />
orientations is essential to understand the role of particular protein-protein interactions in their biological context. We<br />
estimate the binding free energy using coarse-grained simulations with the MARTINI forcefield, and use those to rank<br />
decoys for 15 CAPRI benchmark targets. In our top 100 and top 10 ranked structures, for the 'easier' targets that have<br />
many near-native conformations, we obtain a strong enrichment of acceptable or better quality structures; for the 'hard'<br />
targets with very few near-native complexes in the decoys, our method is still able to retain structures which have native<br />
interface contacts. Moreover, CLUB-MARTINI is rather precise for some targets and able to pinpoint near-native<br />
binding modes in top 1, 5, 10 and 20 selections.<br />
INTRODUCTION<br />
Measuring binding free energy is essential to understand the<br />
relevance of particular protein-protein interactions in their<br />
biological context. Moreover, at the atomic scale, molecular<br />
simulations give us insight into the physically realistic details<br />
of these interactions. In our recent study, we successfully<br />
applied coarse-grained molecular dynamics simulations to<br />
estimate binding free energy with similar accuracy as and<br />
500-fold less time consuming than full atomistic simulation<br />
(May et al., 2014). The approach relied on the availability of<br />
crystal structures of the protein complex of interest. Here, we<br />
investigate the effectiveness of this approach as a scoring<br />
method to identify stable binding conformations out of<br />
docking decoys from protein docking.<br />
We apply our method as an evaluation method to rank more<br />
than 19 000 docked protein conformations, or ‘decoys’, for<br />
15 benchmark targets from the Critical Assessment of<br />
PRedicted Interactions (CAPRI) (Lensink & Wodak, 2014).<br />
METHODS<br />
For each target, the binding free energy of all decoys was<br />
calculated, using the MARTINI forcefield as introduced<br />
before (May et al., 2014). In short, for a set of closely spaced<br />
separation distances, we calculate the constraint force applied<br />
to maintain the set distance. Integrating this force yields a<br />
potential of mean force (PMF), from which the binding free<br />
energy is extracted as the highest minus the lowest value.<br />
Previously, for accuracy, we used up to 20 replicate<br />
simulations for each distance in the PMF, but for efficiency,<br />
here we use only a single replicate initially. We then selected<br />
the lowest-scoring half to run an additional four replicates to<br />
obtain better sampling and more accurate estimates of the<br />
binding free energy. In total, we used approximately 800 000<br />
core-hours of compute time.<br />
RESULTS & DISCUSSION<br />
We obtained strong enrichment of acceptable and high<br />
quality structures in the TOP 100 based on our PMF free<br />
energies, as shown in Figure 1. We estimate the error of our<br />
energies to be significant. This can be approved by increasing<br />
sampling, but remains very expensive.<br />
Moreover, for several targets, we can select near-native<br />
structures in top 1, top 5 and top 10 as shown in Table 1,<br />
which means that, overall, our method is rather precise. From<br />
estimates of the error, we expect we can improve accuracy by<br />
extending the amount of sampling done at each distance. In<br />
conclusion, our approach can find favorable interactions from<br />
available candidates produced by docking programs. To the<br />
best of our knowledge, this is the first time interaction free<br />
energy from a coarse-grained force field is used as a scoring<br />
method to rank docking solutions at a large scale.<br />
FIG. 1. Enrichment in<br />
percentage of<br />
acceptable or better<br />
structures. For each of<br />
the 13 targets with<br />
acceptable or better<br />
decoys, two columns<br />
(from left to right)<br />
stand for CAPRI<br />
Score_set and top 100<br />
in our rank of binding<br />
free energy calculation. Red, orange and yellow represent the fractions of<br />
high, medium and acceptable quality structures over the number of all or<br />
selected docking decoys. The order (left to right) is based on the fraction<br />
of acceptable structures in each target (easy to difficult)<br />
Table 1. Success selections of top ranked structures<br />
Selection Target\Quality High Medium Acceptable<br />
Total<br />
(% )<br />
TOP 1<br />
T47 1 0 0 100<br />
T53 0 0 1 100<br />
T47 3 2 0 100<br />
TOP 5<br />
T41 0 0 4 80<br />
T53 0 0 3 60<br />
T37 0 2 0 40<br />
T47 7 3 0 100<br />
T41 0 1 7 80<br />
TOP 10 T53 0 1 5 60<br />
T37 0 3 0 30<br />
T50 0 0 1 10<br />
T47 14 6 0 100<br />
T41 0 4 13 85<br />
T53 0 3 9 60<br />
TOP 20 T37 0 4 2 30<br />
T50 0 0 3 15<br />
T40 1 2 0 15<br />
T46 0 0 1 5<br />
REFERENCES<br />
May, Pool, Van Dijk, Bijlard, Abeln, Heringa & Feenstra. Coarsegrained<br />
versus atomistic simulations: realistic interaction free energies<br />
for real proteins. Bioinformatics (2014) 30: 326-334.<br />
Lensink & Wodak. Score_set: A CAPRI benchmark for scoring protein<br />
complexes. Proteins (2014) 82:3163-3169.<br />
41