13.11.2012 Views

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

3.4 STATISTICAL INFERENCE AND MODEL CHECKING 85<br />

3.4.2 Example: Snoring and Heart Disease Revisited<br />

Section 3.2.2 used a linear probability model <strong>to</strong> describe the probability of heart<br />

disease π(x)as a linear function of snoring level x for data in Table 3.1. The ML model<br />

fit is ˆπ = 0.0172 + 0.0198x, where the snoring effect ˆβ = 0.0198 has SE = 0.0028.<br />

The Wald test of H0: β = 0 against Ha: β �= 0 treats<br />

z = ˆβ/SE = 0.0198/0.0028 = 7.1<br />

as standard normal, or z 2 = 50.0 as chi-squared with df = 1. This provides extremely<br />

strong evidence of a positive snoring effect on the probability of heart disease (P <<br />

0.0001). We obtain similar strong evidence from a likelihood-ratio test comparing this<br />

model <strong>to</strong> the simpler one having β = 0. That chi-squared statistic equals −2(L0 −<br />

L1) = 65.8 with df = 1 (P < 0.0001). The likelihood-ratio 95% confidence interval<br />

for β is (0.0145, 0.0255).<br />

3.4.3 The Deviance<br />

Let LM denote the maximized log-likelihood value for a model M of interest. Let<br />

LS denote the maximized log-likelihood value for the most complex model possible.<br />

This model has a separate parameter for each observation, and it provides a perfect<br />

fit <strong>to</strong> the data. The model is said <strong>to</strong> be saturated.<br />

For example, suppose M is the linear probability model, π(x) = α + βx, applied<br />

<strong>to</strong> the 4 × 2 Table 3.1 on snoring and heart disease. The model has two parameters<br />

for describing how the probability of heart disease changes for the four levels of x =<br />

snoring. The corresponding saturated model has a separate parameter for each of the<br />

four binomial observations: π(x) = π1 for never snorers, π(x) = π2 for occasional<br />

snorers, π(x) = π3 for snoring nearly every night, π(x) = π4 for snoring every night.<br />

The ML estimate for πi is simply the sample proportion having heart disease at level<br />

i of snoring.<br />

Because the saturated model has additional parameters, its maximized log<br />

likelihood LS is at least as large as the maximized log likelihood LM for a simpler<br />

model M. The deviance of a GLM is defined as<br />

Deviance =−2[LM − LS]<br />

The deviance is the likelihood-ratio statistic for comparing model M <strong>to</strong> the saturated<br />

model. It is a test statistic for the hypothesis that all parameters that are in the saturated<br />

model but not in model M equal zero. GLM software provides the deviance, so it is<br />

not necessary <strong>to</strong> calculate LM or LS.<br />

For some GLMs, the deviance has approximately a chi-squared distribution. For<br />

example, in Section 5.2.2 we will see this happens for binary GLMs with a fixed<br />

number of explana<strong>to</strong>ry levels in which each observation is a binomial variate having<br />

relatively large counts of successes and failures. For such cases, the deviance

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!