13.11.2012 Views

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

9.2 MARGINAL MODELING: THE GEE APPROACH 279<br />

This model assumes that the linear time effect β3 is the same for each group. The<br />

sample proportions in Table 9.2, however, show a higher rate of improvement for the<br />

new drug. A more realistic model permits the time effect <strong>to</strong> differ by drug. We do this<br />

by including a drug-by-time interaction term,<br />

logit[P(Yt = 1)] =α + β1s + β2d + β3t + β4(d × t)<br />

Here, β3 describes the time effect for the standard drug (d = 0) and β3 + β4 describes<br />

the time effect for the new drug (d = 1).<br />

We will fit this model, interpret the estimates, and make inferences in Section 9.2.<br />

We will see that an estimated slope (on the logit scale) for the standard drug is<br />

ˆβ3 = 0.48. For the new drug the estimated slope increases by ˆβ4 = 1.02, yielding an<br />

estimated slope of ˆβ3 + ˆβ4 = 1.50.<br />

9.1.3 Conditional Models for a Repeated Response<br />

The models just considered describe how P(Yt = 1), the probability of a normal<br />

response at time t, depends on severity, drug, and time, for a randomly selected subject.<br />

By contrast, for matched pairs Section 8.2.3 presented a different type of model<br />

that describes probabilities at the subject level. That model permits heterogeneity<br />

among subjects, even at fixed levels of the explana<strong>to</strong>ry variables.<br />

Let Yit denote the response for subject i at time t. For the depression data, a<br />

subject-specific analog of the model just considered is<br />

logit[P(Yit = 1)] =αi + β1s + β2d + β3t + β4(d × t)<br />

Each subject has their own intercept (αi), reflecting variability in the probability<br />

among subjects at a particular setting (s,d,t)for the explana<strong>to</strong>ry variables.<br />

This is called a conditional model, because the effects are defined conditional on<br />

the subject. For example, the model identifies β3 as the time effect for a given subject<br />

using the standard drug. The effect is subject-specific, because it is defined at the<br />

subject level. By contrast, the effects in the marginal models specified in the previous<br />

subsection are population-averaged, because they refer <strong>to</strong> averaging over the entire<br />

population rather than <strong>to</strong> individual subjects.<br />

The remainder of this chapter focuses only on marginal models. The following<br />

chapter presents conditional models and also discusses issues relating <strong>to</strong> the choice<br />

of model.<br />

9.2 MARGINAL MODELING: THE GENERALIZED<br />

ESTIMATING EQUATIONS (GEE) APPROACH<br />

ML fitting of marginal logit models is difficult. We will not explore the technical<br />

reasons here, but basically, it is because the models refer <strong>to</strong> marginal probabilities<br />

whereas the likelihood function refers <strong>to</strong> the joint distribution of the clustered

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!