13.11.2012 Views

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

CHAPTER 9<br />

Modeling Correlated, Clustered<br />

Responses<br />

Many studies observe the response variable for each subject repeatedly, at several<br />

times (such as in longitudinal studies) or under various conditions. Repeated<br />

measurement occurs commonly in health-related applications. For example, a<br />

physician might evaluate patients at weekly intervals regarding whether a new drug<br />

treatment is successful. Repeated observations on a subject are typically correlated.<br />

Correlated observations can also occur when the response variable is observed for<br />

matched sets of subjects. For example, a study of fac<strong>to</strong>rs that affect childhood obesity<br />

might sample families and then observe the children in each family. A matched set<br />

consists of children within a particular family. Children from the same family may<br />

tend <strong>to</strong> respond more similarly than children from different families. Another example<br />

is a (survival, nonsurvival) response for each fetus in a litter of a pregnant mouse, for<br />

a sample of pregnant mice exposed <strong>to</strong> various dosages of a <strong>to</strong>xin. Fetuses from the<br />

same litter are likely <strong>to</strong> be more similar than fetuses from different litters.<br />

We will refer <strong>to</strong> a matched set of observations as a cluster. For repeated<br />

measurement on subjects, the set of observations for a given subject forms a cluster.<br />

Observations within a cluster are usually positively correlated. Analyses should take<br />

the correlation in<strong>to</strong> account. Analyses that ignore the correlation can estimate model<br />

parameters well, but the standard error estima<strong>to</strong>rs can be badly biased.<br />

The next two chapters generalize methods of the previous chapter for matched pairs<br />

<strong>to</strong> matched sets and <strong>to</strong> include explana<strong>to</strong>ry variables. Section 9.1 describes a class of<br />

marginal models and contrasts them with conditional models, which Chapter 10<br />

presents. To fit marginal models, Section 9.2 discusses a method using generalized<br />

estimating equations (GEE). This is a multivariate method that, for discrete<br />

data, is computationally simpler than ML. Section 9.2 models a data set with a<br />

binary response, and Section 9.3 considers multinomial responses. The final section<br />

An <strong>Introduction</strong> <strong>to</strong> <strong>Categorical</strong> <strong>Data</strong> <strong>Analysis</strong>, Second Edition. By Alan Agresti<br />

Copyright © 2007 John Wiley & Sons, Inc.<br />

276

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!