Reviews in Computational Chemistry Volume 18
Reviews in Computational Chemistry Volume 18
Reviews in Computational Chemistry Volume 18
- No tags were found...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
34 Cluster<strong>in</strong>g Methods and Their Uses <strong>in</strong> <strong>Computational</strong> <strong>Chemistry</strong><br />
Acceptance of these methods—and <strong>in</strong>clusion of them as rout<strong>in</strong>e operations<br />
with<strong>in</strong> such applications as lead-f<strong>in</strong>d<strong>in</strong>g strategies, QSAR analyses, and compound<br />
acquisition—has been a gradual process rather than an abrupt revolution.<br />
The current decade should see this process cont<strong>in</strong>ue as the methodologies<br />
are ref<strong>in</strong>ed. The push for such advancement appears to be com<strong>in</strong>g aga<strong>in</strong> from<br />
the <strong>in</strong>formation retrieval community but also from the data m<strong>in</strong><strong>in</strong>g community,<br />
which has made significant progress. The emphasis of current research<br />
is turn<strong>in</strong>g toward the quality of the resultant clusters. It has been shown<br />
that, us<strong>in</strong>g representatives selected from clusters for lead-f<strong>in</strong>d<strong>in</strong>g can <strong>in</strong>crease<br />
the active hit rate significantly and consistently.<br />
The results so far <strong>in</strong> chemistry are promis<strong>in</strong>g, but research <strong>in</strong> other areas<br />
outside of chemistry suggests that cluster<strong>in</strong>g is still a blunt <strong>in</strong>strument that can<br />
be sharpened by ref<strong>in</strong>ements. An example of this ref<strong>in</strong>ement is to be able to<br />
handle mixed or nonnumerical data, and another example is to take more consideration<br />
of cluster sizes, shapes, and distribution. The exist<strong>in</strong>g methods and<br />
implementations used to analyze chemical data sets do an impressive job when<br />
compared with the situation a decade ago. What is excit<strong>in</strong>g is the number of<br />
new ideas that are be<strong>in</strong>g generated that should result <strong>in</strong> significant advances <strong>in</strong><br />
the next decade.<br />
REFERENCES<br />
1. P. H. A. Sneath and R. R. Sokal, Numerical Taxonomy, W. H. Freeman, San Francisco, CA,<br />
1973.<br />
2. L. Kaufman and P. J. Rousseeuw, F<strong>in</strong>d<strong>in</strong>g Groups <strong>in</strong> Data: An Introduction to Cluster<br />
Analysis, Wiley-Interscience, New York, 1990.<br />
3. B. S. Everitt, Cluster Analysis, 3rd ed., Edward Arnold, London, 1993.<br />
4. A. D. Gordon, Classification, 2nd ed., Chapman and Hall, London, 1999.<br />
5. P. Willett, Similarity and Cluster<strong>in</strong>g <strong>in</strong> Chemical Information Systems, Research Studies Press,<br />
Letchworth, UK, 1987.<br />
6. N. Bratchell, Chemom. Intell. Lab. Systems, 6, 105 (1989). Cluster Analysis.<br />
7. J. M. Barnard and G. M. Downs, J. Chem. Inf. Comput. Sci., 32 (6), 644 (1992). Cluster<strong>in</strong>g of<br />
Chemical Structures on the Basis of Two-Dimensional Similarity Measures.<br />
8. G. M. Downs and P. Willett, <strong>in</strong> Advanced Computer-Assisted Techniques <strong>in</strong> Drug Discovery,<br />
H. van de Waterbeemd, Ed., VCH Publishers, We<strong>in</strong>heim, 1994, pp. 111–130. Cluster<strong>in</strong>g of<br />
Chemical Structure Databases for Compound Selection.<br />
9. R. A. Lewis, S. D. Pickett, and D. E. Clark, <strong>in</strong> <strong>Reviews</strong> <strong>in</strong> <strong>Computational</strong> <strong>Chemistry</strong>, K. B.<br />
Lipkowitz and D. B. Boyd, Eds., Wiley-VCH, New York, 2000, Vol. 16, pp. 1–51.<br />
Computer-Aided Molecular Diversity Analysis and Comb<strong>in</strong>atorial Library Design.<br />
10. Clustan Ltd., 16 K<strong>in</strong>gsburgh Road, Ed<strong>in</strong>burgh, UK. http://www.clustan.com.<br />
11. SAS Institute Inc., SAS Campus Drive, Cary, NC 27513, USA. http://www.sas.com.<br />
12. Barnard Chemical Information Ltd., 46 Uppergate Road, Stann<strong>in</strong>gton, Sheffield S6 6BX, UK.<br />
http://www.bci.gb.com.<br />
13. Chemical Comput<strong>in</strong>g Group Inc., 1010 Sherbrooke Street West, Suite 910, Montreal, Quebec<br />
H3A 2R7, Canada. http://www.chemcomp.com.