13.11.2012 Views

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

PROBLEMS 163<br />

PROBLEMS<br />

5.1 For the horseshoe crab data (available at www.stat.ufl.edu/∼aa/<br />

intro-cda/appendix.html), fit a model using weight and width as predic<strong>to</strong>rs.<br />

a. Report the prediction equation.<br />

b. Conduct a likelihood-ratio test of H0: β1 = β2 = 0. Interpret.<br />

c. Conduct separate likelihood-ratio tests for the partial effects of each variable.<br />

Why does neither test show evidence of an effect when the test in (b)<br />

shows very strong evidence?<br />

5.2 For the horseshoe crab data, use a stepwise procedure <strong>to</strong> select a model for<br />

the probability of a satellite when weight, spine condition, and color (nominal<br />

scale) are the predic<strong>to</strong>rs. Explain each step of the process.<br />

5.3 For the horseshoe crab data with width, color, and spine as predic<strong>to</strong>rs, suppose<br />

you start a backward elimination process with the most complex model possible.<br />

Denoted by C ∗ S ∗ W , it uses main effects for each term as well as the<br />

three two-fac<strong>to</strong>r interactions and the three-fac<strong>to</strong>r interaction. Table 5.9 shows<br />

the fit for this model and various simpler models.<br />

a. Conduct a likelihood-ratio test comparing this model <strong>to</strong> the simpler model<br />

that removes the three-fac<strong>to</strong>r interaction term but has all the two-fac<strong>to</strong>r<br />

interactions. Does this suggest that the three-fac<strong>to</strong>r term can be removed<br />

from the model?<br />

b. At the next stage, if we were <strong>to</strong> drop one term, explain why we would select<br />

model C ∗ S + C ∗ W .<br />

c. For the model at this stage, comparing <strong>to</strong> the model S + C ∗ W results in<br />

an increased deviance of 8.0 on df = 6(P = 0.24); comparing <strong>to</strong> the model<br />

W + C ∗ S has an increased deviance of 3.9 on df = 3(P = 0.27). Which<br />

term would you take out?<br />

Table 5.9. Logistic Regression Models for Horseshoe<br />

Crab <strong>Data</strong><br />

Model Predic<strong>to</strong>rs Deviance df AIC<br />

1 C ∗ S ∗ W 170.44 152 212.4<br />

2 C ∗ S + C ∗ W + S ∗ W 173.68 155 209.7<br />

3a C ∗ S + S ∗ W 177.34 158 207.3<br />

3b C ∗ W + S ∗ W 181.56 161 205.6<br />

3c C ∗ S + C ∗ W 173.69 157 205.7<br />

4a S + C ∗ W 181.64 163 201.6<br />

4b W + C ∗ S 177.61 160 203.6<br />

5 C + S + W 186.61 166 200.6

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!