14.03.2014 Views

Modeling and Multivariate Methods - SAS

Modeling and Multivariate Methods - SAS

Modeling and Multivariate Methods - SAS

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Chapter 3 Fitting St<strong>and</strong>ard Least Squares Models 101<br />

Row Diagnostics<br />

Durbin-Watson Test<br />

Displays the Durbin-Watson statistic to test whether the errors have<br />

first-order autocorrelation. The autocorrelation of the residuals is also<br />

shown. The Durbin-Watson table has a popup option that computes <strong>and</strong><br />

displays the exact probability associated with the statistic. This<br />

Durbin-Watson table is appropriate only for time series data when you<br />

suspect that the errors are correlated across time.<br />

Note: The computation of the Durbin-Watson exact probability can be<br />

time-intensive if there are many observations. The space <strong>and</strong> time needed for<br />

the computation increase with the square <strong>and</strong> the cube of the number of<br />

observations, respectively.<br />

Leverage Plots<br />

To graphically view the significance of the model or focus attention on whether an effect is significant, you<br />

want to display the data by focusing on the hypothesis for that effect. You might say that you want more of<br />

an X-ray picture showing the inside of the data rather than a surface view from the outside. The leverage<br />

plot gives this view of your data; it offers maximum insight into how the fit carries the data.<br />

The effect in a model is tested for significance by comparing the sum of squared residuals to the sum of<br />

squared residuals of the model with that effect removed. Residual errors that are much smaller when the<br />

effect is included in the model confirm that the effect is a significant contribution to the fit.<br />

The graphical display of an effect’s significance test is called a leverage plot. See Sall (1990). This type of plot<br />

shows for each point what the residual would be both with <strong>and</strong> without that effect in the model. Leverage<br />

plots are found in the Row Diagnostics submenu of the Fit Model report.<br />

A leverage plot is constructed as illustrated in Figure 3.38. The distance from a point to the line of fit shows<br />

the actual residual. The distance from the point to the horizontal line of the mean shows what the residual<br />

error would be without the effect in the model. In other words, the mean line in this leverage plot represents<br />

the model where the hypothesized value of the parameter (effect) is constrained to zero.<br />

Historically, leverage plots are referred to as a partial-regression residual leverage plot by Belsley, Kuh, <strong>and</strong><br />

Welsch (1980) or an added variable plot by Cook <strong>and</strong> Weisberg (1982).<br />

The term leverage is used because a point exerts more influence on the fit if it is farther away from the<br />

middle of the plot in the horizontal direction. At the extremes, the differences of the residuals before <strong>and</strong><br />

after being constrained by the hypothesis are greater <strong>and</strong> contribute a larger part of the sums of squares for<br />

that effect’s hypothesis test.<br />

The fitting platform produces a leverage plot for each effect in the model. In addition, there is one special<br />

leverage plot titled Whole Model that shows the actual values of the response plotted against the predicted<br />

values. This Whole Model leverage plot dramatizes the test that all the parameters (except intercepts) in the<br />

model are zero. The same test is reported in the Analysis of Variance report.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!