19.02.2013 Views

Reviews in Computational Chemistry Volume 18

Reviews in Computational Chemistry Volume 18

Reviews in Computational Chemistry Volume 18

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

34 Cluster<strong>in</strong>g Methods and Their Uses <strong>in</strong> <strong>Computational</strong> <strong>Chemistry</strong><br />

Acceptance of these methods—and <strong>in</strong>clusion of them as rout<strong>in</strong>e operations<br />

with<strong>in</strong> such applications as lead-f<strong>in</strong>d<strong>in</strong>g strategies, QSAR analyses, and compound<br />

acquisition—has been a gradual process rather than an abrupt revolution.<br />

The current decade should see this process cont<strong>in</strong>ue as the methodologies<br />

are ref<strong>in</strong>ed. The push for such advancement appears to be com<strong>in</strong>g aga<strong>in</strong> from<br />

the <strong>in</strong>formation retrieval community but also from the data m<strong>in</strong><strong>in</strong>g community,<br />

which has made significant progress. The emphasis of current research<br />

is turn<strong>in</strong>g toward the quality of the resultant clusters. It has been shown<br />

that, us<strong>in</strong>g representatives selected from clusters for lead-f<strong>in</strong>d<strong>in</strong>g can <strong>in</strong>crease<br />

the active hit rate significantly and consistently.<br />

The results so far <strong>in</strong> chemistry are promis<strong>in</strong>g, but research <strong>in</strong> other areas<br />

outside of chemistry suggests that cluster<strong>in</strong>g is still a blunt <strong>in</strong>strument that can<br />

be sharpened by ref<strong>in</strong>ements. An example of this ref<strong>in</strong>ement is to be able to<br />

handle mixed or nonnumerical data, and another example is to take more consideration<br />

of cluster sizes, shapes, and distribution. The exist<strong>in</strong>g methods and<br />

implementations used to analyze chemical data sets do an impressive job when<br />

compared with the situation a decade ago. What is excit<strong>in</strong>g is the number of<br />

new ideas that are be<strong>in</strong>g generated that should result <strong>in</strong> significant advances <strong>in</strong><br />

the next decade.<br />

REFERENCES<br />

1. P. H. A. Sneath and R. R. Sokal, Numerical Taxonomy, W. H. Freeman, San Francisco, CA,<br />

1973.<br />

2. L. Kaufman and P. J. Rousseeuw, F<strong>in</strong>d<strong>in</strong>g Groups <strong>in</strong> Data: An Introduction to Cluster<br />

Analysis, Wiley-Interscience, New York, 1990.<br />

3. B. S. Everitt, Cluster Analysis, 3rd ed., Edward Arnold, London, 1993.<br />

4. A. D. Gordon, Classification, 2nd ed., Chapman and Hall, London, 1999.<br />

5. P. Willett, Similarity and Cluster<strong>in</strong>g <strong>in</strong> Chemical Information Systems, Research Studies Press,<br />

Letchworth, UK, 1987.<br />

6. N. Bratchell, Chemom. Intell. Lab. Systems, 6, 105 (1989). Cluster Analysis.<br />

7. J. M. Barnard and G. M. Downs, J. Chem. Inf. Comput. Sci., 32 (6), 644 (1992). Cluster<strong>in</strong>g of<br />

Chemical Structures on the Basis of Two-Dimensional Similarity Measures.<br />

8. G. M. Downs and P. Willett, <strong>in</strong> Advanced Computer-Assisted Techniques <strong>in</strong> Drug Discovery,<br />

H. van de Waterbeemd, Ed., VCH Publishers, We<strong>in</strong>heim, 1994, pp. 111–130. Cluster<strong>in</strong>g of<br />

Chemical Structure Databases for Compound Selection.<br />

9. R. A. Lewis, S. D. Pickett, and D. E. Clark, <strong>in</strong> <strong>Reviews</strong> <strong>in</strong> <strong>Computational</strong> <strong>Chemistry</strong>, K. B.<br />

Lipkowitz and D. B. Boyd, Eds., Wiley-VCH, New York, 2000, Vol. 16, pp. 1–51.<br />

Computer-Aided Molecular Diversity Analysis and Comb<strong>in</strong>atorial Library Design.<br />

10. Clustan Ltd., 16 K<strong>in</strong>gsburgh Road, Ed<strong>in</strong>burgh, UK. http://www.clustan.com.<br />

11. SAS Institute Inc., SAS Campus Drive, Cary, NC 27513, USA. http://www.sas.com.<br />

12. Barnard Chemical Information Ltd., 46 Uppergate Road, Stann<strong>in</strong>gton, Sheffield S6 6BX, UK.<br />

http://www.bci.gb.com.<br />

13. Chemical Comput<strong>in</strong>g Group Inc., 1010 Sherbrooke Street West, Suite 910, Montreal, Quebec<br />

H3A 2R7, Canada. http://www.chemcomp.com.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!