19.02.2013 Views

Reviews in Computational Chemistry Volume 18

Reviews in Computational Chemistry Volume 18

Reviews in Computational Chemistry Volume 18

SHOW MORE
SHOW LESS
  • No tags were found...

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

58 The Use of Scor<strong>in</strong>g Functions <strong>in</strong> Drug Discovery Applications<br />

volume correction term is used, and the sampl<strong>in</strong>g cutoff (the radius of the<br />

reference sphere) is set to only 6 A ˚ . The <strong>in</strong>dividual potentials have the form<br />

EijðrÞ ¼ kTðln½r ij ðrÞ=r ij Š ln½rðrÞŠÞ ½5Š<br />

The pair potentials <strong>in</strong> Eq. [5] are used <strong>in</strong> comb<strong>in</strong>ation with other potentials,<br />

depend<strong>in</strong>g on one (prote<strong>in</strong> or ligand) atom type only, that express the propensity<br />

of an atom type to be buried with<strong>in</strong> a lipophilic prote<strong>in</strong> environment upon<br />

complex formation. Contributions of these surface potentials and the pair<br />

potentials are weighted equally <strong>in</strong> the f<strong>in</strong>al scor<strong>in</strong>g function. DrugScore was<br />

developed with the aim of differentiat<strong>in</strong>g between correctly docked ligand<br />

structures versus decoy (arbitrarily placed) structures for the same prote<strong>in</strong>–<br />

ligand pair.<br />

A different type of reference state was chosen by Mitchell et al. 85 The<br />

pair <strong>in</strong>teraction energy is written as<br />

EijðrÞ ¼kT ln½1 þ m ij sŠ kT ln½1 þ m ij sr ij ðrÞ=rðrÞŠ<br />

Here, the number density r ij ðrÞ is def<strong>in</strong>ed as <strong>in</strong> Eq. [4], but it is normalized by<br />

the number density of all atom pairs at this same distance <strong>in</strong>stead of by the<br />

number of pairs ij <strong>in</strong> the whole reference volume. The variable m ij is the number<br />

of atom pairs ij found <strong>in</strong> the data set of prote<strong>in</strong>–ligand complexes, and s is<br />

an empirical factor that def<strong>in</strong>es the weight of each observation. This potential<br />

is comb<strong>in</strong>ed with a van der Waals potential as a reference state to compensate<br />

for the lack of sampl<strong>in</strong>g at short distances and for certa<strong>in</strong> underrepresented<br />

atom pairs. Apart from data on 90 prote<strong>in</strong>–ligand complexes used <strong>in</strong> the orig<strong>in</strong>al<br />

validation, no further application has been published.<br />

CRITICAL ASSESSMENT OF CURRENT<br />

SCORING FUNCTIONS<br />

Influence of the Tra<strong>in</strong><strong>in</strong>g Data<br />

All fast scor<strong>in</strong>g functions share a number of deficiencies that one should<br />

be aware of for any application. First, most scor<strong>in</strong>g functions are <strong>in</strong> some way<br />

fitted to or derived from experimental data. The result<strong>in</strong>g functions necessarily<br />

reflect the accuracy of the data that were used <strong>in</strong> their derivation. A general<br />

problem with empirical scor<strong>in</strong>g functions is the fact that the experimental<br />

b<strong>in</strong>d<strong>in</strong>g energies are compiled from many different sources and therefore<br />

form <strong>in</strong>consistent data sets conta<strong>in</strong><strong>in</strong>g systematic experimental errors. Scor<strong>in</strong>g<br />

functions not only reflect the quality, but also the type of experimental data on<br />

which they are based. Most scor<strong>in</strong>g functions are still derived from data on<br />

mostly high-aff<strong>in</strong>ity receptor–ligand complexes. Moreover, many of these

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!