13.11.2012 Views

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

258 MODELS FOR MATCHED PAIRS<br />

from the discrepancy between n13 and n31. For that pair, the standardized residual<br />

equals (44 − 17)/ √ (44 + 17) = 3.5. Consumers of High Point changed <strong>to</strong> Sanka<br />

more often than the reverse. Otherwise, the symmetry model fits most of the table<br />

fairly well.<br />

The quasi-symmetry model has G 2 = 10.0 and X 2 = 8.5, with df = 6. Permitting<br />

the marginal distributions <strong>to</strong> differ yields a better fit than the symmetry model<br />

provides. We will next see how <strong>to</strong> use this information <strong>to</strong> construct a test of marginal<br />

homogeneity.<br />

8.4.4 Testing Marginal Homogeneity Using Symmetry and Quasi-Symmetry<br />

For the quasi-symmetry model, log(πij /πji) = βi − βj for all i and j, marginal<br />

homogeneity is the special case in which all βi = 0. This special case is the symmetry<br />

model. In other words, for the quasi-symmetry model, marginal homogeneity is<br />

equivalent <strong>to</strong> symmetry.<br />

To test marginal homogeneity, we can test the null hypothesis that the symmetry<br />

(S) model holds against the alternative hypothesis of quasi symmetry (QS). The<br />

likelihood-ratio test compares the G2 goodness-of-fit statistics,<br />

G 2 (S | QS) = G 2 (S) − G 2 (QS)<br />

For I × I tables, the test has df = I − 1.<br />

For Table 8.5, the 5 × 5 table on choice of coffee brand at two purchases,<br />

G 2 (S) = 22.5 and G 2 (QS) = 10.0. The difference G 2 (S | QS) = 12.5, based on<br />

df = 4, provides evidence of differing marginal distributions (P = 0.014).<br />

Section 8.3.1 described other tests of H0: marginal homogeneity, based on the<br />

ML fit under H0 and using pi+ − p+i, i = 1,...,I. For the coffee data, these gave<br />

similar results as using G 2 (S | QS). Those other tests do not assume that the quasisymmetry<br />

model holds. In practice, however, for nominal classifications the statistic<br />

G 2 (S | QS) usually captures most of the information about marginal heterogeneity<br />

even if the quasi-symmetry model shows lack of fit.<br />

8.4.5 An Ordinal Quasi-Symmetry Model<br />

The symmetry and quasi-symmetry models treat the classifications as nominal. A<br />

special case of quasi-symmetry often is useful when the categories are ordinal. Let<br />

u1 ≤ u2 ≤···≤uI denote ordered scores for both the row and column categories.<br />

The ordinal quasi-symmetry model is<br />

log(πij /πji) = β(uj − ui) (8.11)<br />

This is a special case of the quasi-symmetry model (8.10) in which {βi} have a linear<br />

trend. The symmetry model is the special case β = 0.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!