Package 'fpc' - open source solution for an Internet free, intelligent ...
Package 'fpc' - open source solution for an Internet free, intelligent ...
Package 'fpc' - open source solution for an Internet free, intelligent ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
24 cluster.stats<br />
pearsongamma correlation between dist<strong>an</strong>ces <strong>an</strong>d a 0-1-vector where 0 me<strong>an</strong>s same cluster, 1<br />
me<strong>an</strong>s different clusters. "Normalized gamma" in Halkidi et al. (2001).<br />
dunn minimum separation / maximum diameter. Dunn index, see Halkidi et al. (2002).<br />
dunn2<br />
entropy<br />
wb.ratio<br />
ch<br />
\<br />
Note<br />
cwidegap<br />
widestgap<br />
sindex<br />
minimum average dissimilarity between two cluster / maximum average within<br />
cluster dissimilarity, <strong>an</strong>other version of the family of Dunn indexes.<br />
entropy of the distribution of cluster memberships, see Meila(2007).<br />
average.within/average.between.<br />
Calinski <strong>an</strong>d Harabasz index (Calinski <strong>an</strong>d Harabasz 1974, optimal in Millig<strong>an</strong><br />
<strong>an</strong>d Cooper 1985; generalised <strong>for</strong> dissimilarites in Hennig <strong>an</strong>d Liao 2010)<br />
vector of widest within-cluster gaps.<br />
widest within-cluster gap.<br />
separation index, see argument sepindex.<br />
corrected.r<strong>an</strong>d corrected R<strong>an</strong>d index (if alt.clustering has been specified), see Gordon (1999,<br />
p. 198).<br />
vi<br />
variation of in<strong>for</strong>mation (VI) index (if alt.clustering has been specified), see<br />
Meila (2007).<br />
Because cluster.stats processes a full dissimilarity matrix, it isn’t suitable <strong>for</strong> large data sets.<br />
You may consider distcritmulti in that case.<br />
Author(s)<br />
Christi<strong>an</strong> Hennig http://www.homepages.ucl.ac.uk/~ucakche/<br />
References<br />
Calinski, T., <strong>an</strong>d Harabasz, J. (1974) A Dendrite Method <strong>for</strong> Cluster Analysis, Communications in<br />
Statistics, 3, 1-27.<br />
Gordon, A. D. (1999) Classification, 2nd ed. Chapm<strong>an</strong> <strong>an</strong>d Hall.<br />
Halkidi, M., Batistakis, Y., Vazirgi<strong>an</strong>nis, M. (2001) On Clustering Validation Techniques, Journal<br />
of Intelligent In<strong>for</strong>mation Systems, 17, 107-145.<br />
Hennig, C. <strong>an</strong>d Liao, T. (2010) Comparing latent class <strong>an</strong>d dissimilarity based clustering <strong>for</strong> mixed<br />
type variables with application to social stratification. Research report no. 308, Department of<br />
Statistical Science, UCL. http://www.ucl.ac.uk/Stats/research/reports/psfiles/rr308.<br />
pdf. Revised version accepted <strong>for</strong> publication by Journal of the Royal Statistical Society Series C.<br />
Kaufm<strong>an</strong>, L. <strong>an</strong>d Rousseeuw, P.J. (1990). "Finding Groups in Data: An Introduction to Cluster<br />
Analysis". Wiley, New York.<br />
Meila, M. (2007) Comparing clusterings<strong>an</strong> in<strong>for</strong>mation based dist<strong>an</strong>ce, Journal of Multivariate<br />
Analysis, 98, 873-895.<br />
Millig<strong>an</strong>, G. W. <strong>an</strong>d Cooper, M. C. (1985) An examination of procedures <strong>for</strong> determining the number<br />
of clusters. Psychometrika, 50, 159-179.