14.03.2014 Views

Modeling and Multivariate Methods - SAS

Modeling and Multivariate Methods - SAS

Modeling and Multivariate Methods - SAS

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Chapter 18 Clustering Data 471<br />

K-Means Clustering<br />

Single Step enables you to step through the clustering process one iteration at a time using a Step<br />

button, or automate the process using a Go button.<br />

Use within-cluster std deviations If you do not use this option, all distances are scaled by an overall<br />

estimate of the st<strong>and</strong>ard deviation of each variable. If you use this option, distances are scaled by the<br />

st<strong>and</strong>ard deviation estimated for each cluster.<br />

Shift distances using sampling rates assumes that you have a mix of unequally sized clusters, <strong>and</strong><br />

points should give preference to being assigned to larger clusters because there is a greater prior<br />

probability that it is from a larger cluster. This option is an advanced feature. The calculations for this<br />

option are implied, but not shown for normal mixtures.<br />

K-Means Report<br />

Clicking Go in the Control Panel in Figure 18.5 produces the K-Means report, shown in Figure 18.6.<br />

Figure 18.6 K-Means Report<br />

The report gives summary statistics for each cluster:<br />

• count of number of observations<br />

• means for each variable<br />

• st<strong>and</strong>ard deviations for each variable.<br />

The Cluster Comparison report gives fit statistics to compare different numbers of clusters. For KMeans<br />

Clustering <strong>and</strong> Self Organizing Maps, the fit statistic is CCC (Cubic Clustering Criterion). For Normal<br />

Mixtures, the fit statistic is BIC or AICc. Robust Normal Mixtures does not provide a fit statistic.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!