10.01.2015 Views

Package 'fpc' - open source solution for an Internet free, intelligent ...

Package 'fpc' - open source solution for an Internet free, intelligent ...

Package 'fpc' - open source solution for an Internet free, intelligent ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

24 cluster.stats<br />

pearsongamma correlation between dist<strong>an</strong>ces <strong>an</strong>d a 0-1-vector where 0 me<strong>an</strong>s same cluster, 1<br />

me<strong>an</strong>s different clusters. "Normalized gamma" in Halkidi et al. (2001).<br />

dunn minimum separation / maximum diameter. Dunn index, see Halkidi et al. (2002).<br />

dunn2<br />

entropy<br />

wb.ratio<br />

ch<br />

\<br />

Note<br />

cwidegap<br />

widestgap<br />

sindex<br />

minimum average dissimilarity between two cluster / maximum average within<br />

cluster dissimilarity, <strong>an</strong>other version of the family of Dunn indexes.<br />

entropy of the distribution of cluster memberships, see Meila(2007).<br />

average.within/average.between.<br />

Calinski <strong>an</strong>d Harabasz index (Calinski <strong>an</strong>d Harabasz 1974, optimal in Millig<strong>an</strong><br />

<strong>an</strong>d Cooper 1985; generalised <strong>for</strong> dissimilarites in Hennig <strong>an</strong>d Liao 2010)<br />

vector of widest within-cluster gaps.<br />

widest within-cluster gap.<br />

separation index, see argument sepindex.<br />

corrected.r<strong>an</strong>d corrected R<strong>an</strong>d index (if alt.clustering has been specified), see Gordon (1999,<br />

p. 198).<br />

vi<br />

variation of in<strong>for</strong>mation (VI) index (if alt.clustering has been specified), see<br />

Meila (2007).<br />

Because cluster.stats processes a full dissimilarity matrix, it isn’t suitable <strong>for</strong> large data sets.<br />

You may consider distcritmulti in that case.<br />

Author(s)<br />

Christi<strong>an</strong> Hennig http://www.homepages.ucl.ac.uk/~ucakche/<br />

References<br />

Calinski, T., <strong>an</strong>d Harabasz, J. (1974) A Dendrite Method <strong>for</strong> Cluster Analysis, Communications in<br />

Statistics, 3, 1-27.<br />

Gordon, A. D. (1999) Classification, 2nd ed. Chapm<strong>an</strong> <strong>an</strong>d Hall.<br />

Halkidi, M., Batistakis, Y., Vazirgi<strong>an</strong>nis, M. (2001) On Clustering Validation Techniques, Journal<br />

of Intelligent In<strong>for</strong>mation Systems, 17, 107-145.<br />

Hennig, C. <strong>an</strong>d Liao, T. (2010) Comparing latent class <strong>an</strong>d dissimilarity based clustering <strong>for</strong> mixed<br />

type variables with application to social stratification. Research report no. 308, Department of<br />

Statistical Science, UCL. http://www.ucl.ac.uk/Stats/research/reports/psfiles/rr308.<br />

pdf. Revised version accepted <strong>for</strong> publication by Journal of the Royal Statistical Society Series C.<br />

Kaufm<strong>an</strong>, L. <strong>an</strong>d Rousseeuw, P.J. (1990). "Finding Groups in Data: An Introduction to Cluster<br />

Analysis". Wiley, New York.<br />

Meila, M. (2007) Comparing clusterings<strong>an</strong> in<strong>for</strong>mation based dist<strong>an</strong>ce, Journal of Multivariate<br />

Analysis, 98, 873-895.<br />

Millig<strong>an</strong>, G. W. <strong>an</strong>d Cooper, M. C. (1985) An examination of procedures <strong>for</strong> determining the number<br />

of clusters. Psychometrika, 50, 159-179.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!