14.03.2014 Views

Modeling and Multivariate Methods - SAS

Modeling and Multivariate Methods - SAS

Modeling and Multivariate Methods - SAS

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

452 Correlations <strong>and</strong> <strong>Multivariate</strong> Techniques Chapter 17<br />

<strong>Multivariate</strong> Platform Options<br />

Figure 17.4 Example of an Outlier<br />

Mahalanobis Distance<br />

The Mahalanobis Outlier Distance plot shows the Mahalanobis distance of each point from the multivariate<br />

mean (centroid). The st<strong>and</strong>ard Mahalanobis distance depends on estimates of the mean, st<strong>and</strong>ard deviation,<br />

<strong>and</strong> correlation for the data. The distance is plotted for each observation number. Extreme multivariate<br />

outliers can be identified by highlighting the points with the largest distance values. See “Computations <strong>and</strong><br />

Statistical Details” on page 454, for more information.<br />

Jackknife Distances<br />

The Jackknife Distances plot shows distances that are calculated using a jackknife technique. The distance<br />

for each observation is calculated with estimates of the mean, st<strong>and</strong>ard deviation, <strong>and</strong> correlation matrix<br />

that do not include the observation itself. The jack-knifed distances are useful when there is an outlier. In<br />

this case, the Mahalanobis distance is distorted <strong>and</strong> tends to disguise the outlier or make other points look<br />

more outlying than they are.<br />

T 2 Statistic<br />

The T 2 plot shows distances that are the square of the Mahalanobis distance. This plot is preferred for<br />

multivariate control charts. The plot includes the value of the calculated T 2 statistic, as well as its upper<br />

control limit. Values that fall outside this limit might be outliers.<br />

Saving Distances <strong>and</strong> Values<br />

You can save any of the distances to the data table by selecting the Save option from the red triangle menu<br />

for the plot.<br />

Note: There is no formula saved with the distance column. This means that the distance is not recomputed<br />

if you modify the data table. If you add or delete columns, or change values in the data table, you should<br />

select Analyze > <strong>Multivariate</strong> <strong>Methods</strong> > <strong>Multivariate</strong> again to compute new distances.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!