14.03.2014 Views

Modeling and Multivariate Methods - SAS

Modeling and Multivariate Methods - SAS

Modeling and Multivariate Methods - SAS

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

146 Fitting Stepwise Regression Models Chapter 4<br />

Models with Nominal <strong>and</strong> Ordinal Terms<br />

The progress of multiterm inclusion is a balance between numerator degrees of freedom <strong>and</strong> opportunities<br />

to improve the fit. When there are significant interaction terms, often several terms enter at the same step. If<br />

the Step button is clicked once, Ct*T is entered along with its two contained effects Ct <strong>and</strong> T. However, a<br />

step back is not symmetric because a crossed term can be removed without removing its two component<br />

terms. Note that Ct now has 2 degrees of freedom because if Stepwise removes Ct, it also removes Ct*T.<br />

Models with Nominal <strong>and</strong> Ordinal Terms<br />

Traditionally, stepwise regression has not addressed the situation when there are categorical terms in the<br />

model. When nominal or ordinal terms are in regression models, they are carried as sets of dummy or<br />

indicator columns. When there are only two levels, there is no problem because they generate only a single<br />

column. However, for more than two levels, multiple columns must be h<strong>and</strong>led. The convention in JMP for<br />

nominal variables in st<strong>and</strong>ard platforms is to model these terms so that the parameter estimates average out<br />

to zero across all the levels.<br />

In the stepwise platform, categorical variables (nominal <strong>and</strong> ordinal) are coded in a hierarchical fashion,<br />

which is different from the other least squares fitting platforms. In hierarchical coding, the levels of the<br />

categorical variable are considered in some order <strong>and</strong> a split is made to make the two groups of levels that<br />

most separate the means of the response. Then, each group is further subdivided into its most separated<br />

subgroups, <strong>and</strong> so on, until all the levels are distinguished into k - 1 terms for k levels.<br />

For nominal terms, the order of levels is determined by the means of the Ys. For ordinal terms, the order is<br />

fixed.<br />

Example of a Model with a Nominal Term<br />

1. Open the Football.jmp sample data table.<br />

2. Select Analyze > Fit Model.<br />

3. Select Speed <strong>and</strong> click Y.<br />

4. Select Weight <strong>and</strong> Position2 <strong>and</strong> click Add.<br />

Notice that Position2 is a nominal variable with values representing football positions.<br />

5. For Personality, select Stepwise.<br />

6. Click Run.<br />

Figure 4.11 Position Hierarchy

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!