02.10.2012 Views

Identification of Natural-Product-Derived ... - AnalytiCon Discovery

Identification of Natural-Product-Derived ... - AnalytiCon Discovery

Identification of Natural-Product-Derived ... - AnalytiCon Discovery

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Virtual Screening for 5-Lipoxygenase Inhibitors Journal <strong>of</strong> Medicinal Chemistry, 2007, Vol. 50, No. 11 2643<br />

Chart 3. Virtual Hits a<br />

a Retrieved by similarity searching in <strong>AnalytiCon</strong>’s NAT-5 compound collection with molecules 1 and 2 as queries.<br />

(PPP; hydrogen-bond donor, hydrogen-bond acceptor, lipophilic,<br />

positive ionizable, negative ionizable) pairs were determined for<br />

topological distances <strong>of</strong> 0-9 bonds. This resulted in a 150dimensional<br />

descriptor vector. PPP-pair counts were scaled by PPPtype<br />

occurrence. For each <strong>of</strong> the 43 reference molecules, the 10<br />

most similar molecules were retrieved from the screening pool<br />

(<strong>AnalytiCon</strong> <strong>Discovery</strong> libraries v.2004: 1298 unique pure natural<br />

products in the MEGx collection and 7839 unique natural-productderived<br />

compounds denoted as NATx; <strong>AnalytiCon</strong> <strong>Discovery</strong><br />

GmbH, Hermannswerder Haus 17, D-14473 Potsdam, Germany)<br />

by use <strong>of</strong> the Euclidian distance metric.<br />

(B) CATS-Charge. This similarity searching approach is based<br />

on the correlation vector concept <strong>of</strong> Gasteiger and co-workers, 21<br />

using estimated three-dimensional conformations and partial atom<br />

charges. 9 The three-dimensional Euclidean distances <strong>of</strong> all atompair<br />

combinations in one molecule were calculated. Distances within<br />

a certain range (0.16 Å) were allocated to the same bin. The charges<br />

<strong>of</strong> the two atoms that form a pair were multiplied to yield a single<br />

charge value per pair. Charge values that were assigned to the same<br />

bin were summed up. We used 101 bin borders in equidistant steps<br />

from 0 to 16 Å. All distances greater than 16 Å were associated<br />

with the last bin. The output was a 100-dimensional correlation<br />

vector, which characterizes the molecules by means <strong>of</strong> their partial<br />

atom charge distribution. The correlation vector representation was<br />

calculated as<br />

A<br />

CV d ) ∑ i)1<br />

A<br />

∑δij (qiqj ) d<br />

j)1<br />

where d is the atom-atom distance, qi and qj are partial atom<br />

charges, A is the number <strong>of</strong> atoms, and δ is the Kronecker delta<br />

that equates to 1 if a pair exists and 0 otherwise. The spatial<br />

structures for the COBRA database were calculated with the<br />

program CORINA (Molecular Networks GmbH, Nägelsbachstrasse<br />

25, D-91052 Erlangen, Germany), 22 and partial atom charges,<br />

including hydrogens, were assigned by use <strong>of</strong> the PETRA s<strong>of</strong>tware<br />

(Molecular Networks GmbH, Nägelsbachstrasse 25, D-91052<br />

Erlangen, Germany).<br />

(C) CATS-TripleCharge. The difference between the CATS-<br />

Charge and the CATS-TripleCharge descriptor lies in atom pair<br />

counting. While with the CATS-Charge descriptor all atom pairs<br />

within a certain distance are collected in the same bin, the CATS-<br />

TripleCharge descriptor differentiates between charge combinations<br />

(1)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!