14.03.2014 Views

Modeling and Multivariate Methods - SAS

Modeling and Multivariate Methods - SAS

Modeling and Multivariate Methods - SAS

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Chapter 20 Performing Discriminant Analysis 497<br />

Discriminating Groups<br />

Click Apply This Model to estimate the model. After estimation <strong>and</strong> scoring are done, two reports are<br />

produced: a Canonical Plot (Figure 20.5), <strong>and</strong> a Scoring Report.<br />

Canonical Plot<br />

The Canonical Plot shows the points <strong>and</strong> multivariate means in the two dimensions that best separate the<br />

groups.<br />

Figure 20.5 Canonical Plot<br />

• Each row in the data set is a point, controlled by the Canonical Options > Show Points option.<br />

• Each multivariate mean is a labeled circle. The size of the circle corresponds to a 95% confidence limit<br />

for the mean. Groups that are significantly different tend to have non-intersecting circles. This is<br />

controlled by the Canonical Options > Show Means CL Ellipses option.<br />

• The directions of the variables in the canonical space is shown by labeled rays emanating from the gr<strong>and</strong><br />

mean. This is controlled by the Canonical Options > Show Biplot Rays option. You can drag the<br />

center of the biplot rays to other places in the graph.<br />

• The option Show Normal 50% Contours shows areas that contain roughly 50% of the points for that<br />

group if the assumptions are correct. Under linear discriminant analysis, they are all the same size <strong>and</strong><br />

shape.<br />

In order to have the points color-coded like the centroid circles, use the Color Points option or button.<br />

This is equivalent to Rows > Color or Mark by Column, coloring by the classification column.<br />

The canonical plot can also be referred to as a biplot when both the points <strong>and</strong> the variable direction rays are<br />

shown together, as in Figure 20.5. It is identical to the Centroid plot produced in the Manova personality of<br />

the Fit Model platform.<br />

Discriminant Scores<br />

The scores report shows how well each point is classified. The first five columns of the report represent the<br />

actual (observed) data values, showing row numbers, the actual classification, the distance to the mean of<br />

that classification, <strong>and</strong> the associated probability. JMP graphs -Log(Prob) to show the loss in log-likelihood<br />

when a point is predicted poorly. When the red bar is large, the point is being poorly predicted.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!