14.03.2014 Views

Modeling and Multivariate Methods - SAS

Modeling and Multivariate Methods - SAS

Modeling and Multivariate Methods - SAS

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

664 Statistical Details Appendix A<br />

The Factor Models<br />

It turns out that the design columns for missing cells for any interaction will always knock out degrees of<br />

freedom for the main effect (for nominal factors). Thus, there is a direct relation between the<br />

nonestimability of least squares means <strong>and</strong> the loss of degrees of freedom for testing the effect corresponding<br />

to these least squares means.<br />

How does this compare with what GLM does? GLM <strong>and</strong> JMP do the same test when there are no missing<br />

cells. That is, they effectively test that the least squares means are equal. But when GLM encounters<br />

singularities, it focuses out these cells in different ways, depending on whether they are Type III or Type IV.<br />

For Type IV, it looks for estimable combinations that it can find. These might not be unique, <strong>and</strong> if you<br />

reorder the levels, you might get a different result. For Type III, it does some orthogonalization of the<br />

estimable functions to obtain a unique test. But the test might not be very interpretable in terms of the cell<br />

means.<br />

The JMP approach has several points in its favor, although at first it might seem distressing that you might<br />

lose more degrees of freedom than with GLM:<br />

1. The tests are philosophically linked to LSMs.<br />

2. The tests are easy computationally, using reduction sum of squares for reparameterized models.<br />

3. The tests agree with Hocking’s “Effective Hypothesis Tests”.<br />

4. The tests are whole marginal tests, meaning they always go completely across other effects in interactions.<br />

The last point needs some elaboration: Consider a graph of the expected values of the cell means in the<br />

previous example with a missing cell for A3B2.<br />

B1<br />

A3B2?<br />

B2<br />

A3B2?<br />

A1 A2 A3<br />

The graph shows expected cell means with a missing cell. The means of the A1 <strong>and</strong> A2 cells are profiled<br />

across the B levels. The JMP approach says you can’t test the B main effect with a missing A3B2 cell, because<br />

the mean of the missing cell could be anything, as allowed by the interaction term. If the mean of the<br />

missing cell were the higher value shown, the B effect would likely test significant. If it were the lower, it<br />

would likely test nonsignificant. The point is that you don’t know. That is what the least squares means are<br />

saying when they are declared nonestimable. That is what the hypotheses for the effects should be saying<br />

too—that you don’t know.<br />

If you want to test hypotheses involving margins for subsets of cells, then that is what GLM Type IV does.<br />

In JMP you would have to construct these tests yourself by partitioning the effects with a lot of calculations<br />

or by using contrasts.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!