14.03.2014 Views

Modeling and Multivariate Methods - SAS

Modeling and Multivariate Methods - SAS

Modeling and Multivariate Methods - SAS

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Chapter 4 Fitting Stepwise Regression Models 141<br />

The Stepwise Report<br />

Table 4.4 Description of Current Model Statistics (Continued)<br />

RSquare<br />

RSquare Adj<br />

Cp<br />

Proportion of the variation in the response that can be attributed to terms in<br />

the model rather than to r<strong>and</strong>om error.<br />

Adjusts R 2 to make it more comparable over models with different numbers<br />

of parameters by using the degrees of freedom in its computation. The<br />

adjusted R 2 is useful in stepwise procedure because you are looking at many<br />

different models <strong>and</strong> want to adjust for the number of terms in the model.<br />

Mallow’s C p criterion for selecting a model. It is an alternative measure of<br />

total squared error <strong>and</strong> can be defined as follows:<br />

C p<br />

=<br />

SSE p <br />

-----------<br />

<br />

s 2 – ( N – 2p)<br />

<br />

where s 2 is the MSE for the full model <strong>and</strong> SSE p is the sum-of-squares error<br />

for a model with p variables, including the intercept. Note that p is the<br />

number of x-variables+1. If C p is graphed with p, Mallows (1973)<br />

recommends choosing the model where C p first approaches p.<br />

p<br />

AICc<br />

Number of parameters in the model, including the intercept.<br />

Corrected Akaike’s Information Criterion, defined as follows:<br />

AICc = -2loglikelihood + 2k( k+<br />

1)<br />

2k + ---------------------- n – k – 1<br />

where k is the number of estimated parameters, including intercept <strong>and</strong> error<br />

terms in the model, <strong>and</strong> n is the number of observations in the data set.<br />

Burnham <strong>and</strong> Anderson (2004) discuss using AIC c for model selection. The<br />

best model has the smallest value, as discussed in Akaike (1974).<br />

BIC<br />

Bayesian Information Criterion defined as<br />

-2loglikelihood + k ln(n)<br />

where k is the number of parameters, <strong>and</strong> n is the sample size.<br />

Forward Selection Example<br />

In forward selection, terms are entered into the model <strong>and</strong> most significant terms are added until all of the<br />

terms are significant.<br />

1. Complete the steps in “Example Using Stepwise Regression” on page 135.<br />

Notice that the default selection for Direction is Forward.<br />

2. Click Step.<br />

From the top figure in Figure 4.4, you can see that after one step, the most significant term, Runtime, is<br />

entered into the model.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!