13.11.2012 Views

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

218 LOGLINEAR MODELS FOR CONTINGENCY TABLES<br />

which translates <strong>to</strong> (0.42, 0.47) for the odds ratio. The odds of injury for passengers<br />

wearing seat belts were less than half the odds for passengers not wearing them, for<br />

each gender–location combination. The fitted odds ratios in Table 7.11 also suggest<br />

that, other fac<strong>to</strong>rs being fixed, injury was more likely in rural than urban accidents<br />

and more likely for females than males. Also, the estimated odds that males used seat<br />

belts are only 0.63 times the estimated odds for females.<br />

7.2.7 Three-Fac<strong>to</strong>r Interaction<br />

Interpretations are more complicated when a model contains three-fac<strong>to</strong>r terms. Such<br />

terms refer <strong>to</strong> interactions, the association between two variables varying across levels<br />

of the third variable. Table 7.10 shows results of adding a single three-fac<strong>to</strong>r term <strong>to</strong><br />

model (GI, GL, GS, IL, IS, LS). Of the four possible models, (GLS, GI, IL, IS) fits<br />

best. Table 7.9 also displays its fit.<br />

For model (GLS, GI, IL, IS), each pair of variables is conditionally dependent,<br />

and at each level of I the association between G and L or between G and S or<br />

between L and S varies across the levels of the remaining variable. For this model, it<br />

is inappropriate <strong>to</strong> interpret the GL, GS, and LS two-fac<strong>to</strong>r terms on their own. For<br />

example, the presence of the GLS term implies that the GS odds ratio varies across<br />

the levels of L. Because I does not occur in a three-fac<strong>to</strong>r term, the conditional odds<br />

ratio between I and each variable is the same at each combination of levels of the<br />

other two variables. The first three lines of Table 7.11 report the fitted odds ratios for<br />

the GI, IL, and IS associations.<br />

When a model has a three-fac<strong>to</strong>r term, <strong>to</strong> study the interaction, calculate fitted<br />

odds ratios between two variables at each level of the third. Do this at any levels of<br />

remaining variables not involved in the interaction. The bot<strong>to</strong>m six lines of Table 7.11<br />

illustrates this for model (GLS, GI, IL, IS). For example, the fitted GS odds ratio of<br />

0.66 for (L = urban) refers <strong>to</strong> four fitted values for urban accidents, both the four<br />

with (injury = no) and the four with (injury = yes); that is,<br />

0.66 = (7273.2)(10, 959.2)/(11, 632.6)(10, 358.9)<br />

= (1009.8)(389.8)/(713.4)(834.1)<br />

7.2.8 Large Samples and Statistical Versus Practical Significance<br />

The sample size can strongly influence results of any inferential procedure. We are<br />

more likely <strong>to</strong> detect an effect as the sample size increases. This suggests a cautionary<br />

remark. For small sample sizes, reality may be more complex than indicated by the<br />

simplest model that passes a goodness-of-fit test. By contrast, for large sample sizes,<br />

statistically significant effects can be weak and unimportant.<br />

We saw above that model (GLS, GI, IL, IS) seems <strong>to</strong> fit much better than<br />

(GI, GL, GS, IL, IS, LS): The difference in G 2 values is 23.4 − 7.5 = 15.9, based<br />

on df = 5 − 4 = 1(P = 0.0001). The fitted odds ratios in Table 7.11, however,<br />

show that the three-fac<strong>to</strong>r interaction is weak. The fitted odds ratio between any two

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!