13.11.2012 Views

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

10.2 EXAMPLES OF RANDOM EFFECTS MODELS FOR BINARY DATA 309<br />

possible because the joint distribution of the responses is not specified. In addition,<br />

this approach does not explicitly include random effects and therefore does not allow<br />

these effects <strong>to</strong> be estimated.<br />

The conditional modeling approach is preferable if one wants <strong>to</strong> fully model the<br />

joint distribution. The marginal modeling approach focuses only on the marginal<br />

distribution. The conditional modeling approach is also preferable if one wants <strong>to</strong><br />

estimate cluster-specific effects or estimate their variability, or if one wants <strong>to</strong> specify<br />

a mechanism that could generate positive association among clustered observations.<br />

For example, some methodologists use conditional models whenever the main focus<br />

is on within-cluster effects. In the depression study (Section 10.2.6), the conditional<br />

model is appropriate if we want the estimate of the time effect <strong>to</strong> be “within-subject,”<br />

describing the time trend at the subject level.<br />

By contrast, if the main focus is on comparing groups that are independent samples,<br />

effects of interest are “between-cluster” rather than “within-cluster.” It may then be<br />

adequate <strong>to</strong> estimate effects with a marginal model. For example, if after a period of<br />

time we mainly want <strong>to</strong> compare the rates of depression for those taking the new drug<br />

and for those taking the standard drug, a marginal model is adequate. In many surveys<br />

or epidemiological studies, a goal is <strong>to</strong> compare the relative frequency of occurrence<br />

of some outcome for different groups in a population. Then, quantities of primary<br />

interest include between-group odds ratios comparing marginal probabilities for the<br />

different groups.<br />

When marginal effects are the main focus, it is simpler <strong>to</strong> model the margins<br />

directly. One can then parameterize the model so regression parameters have a direct<br />

marginal interpretation. Developing a more detailed model of the joint distribution<br />

that generates those margins, as a random effects model does, provides greater opportunity<br />

for misspecification. For instance, with longitudinal data the assumption that<br />

observations are independent, given the random effect, need not be realistic.<br />

Latent variable constructions used <strong>to</strong> motivate model forms (such as the probit and<br />

cumulative logit) usually apply more naturally at the cluster level than the marginal<br />

level. Given a conditional model, one can in principle recover information about<br />

marginal distributions, although this may require extra work not readily done by<br />

standard software. That is, a conditional model implies a marginal model, but a<br />

marginal model does not itself imply a conditional model. In this sense, a conditional<br />

model has more information.<br />

We have seen that parameters describing effects are usually larger in conditional<br />

models than marginal models, moreso as variance components increase. Usually,<br />

though, the significance of an effect (e.g., as determined by the ratio of estimate<br />

<strong>to</strong> SE) is similar for the two model types. If one effect seems more important than<br />

another in a conditional model, the same is usually true with a marginal model. The<br />

choice of the model is usually not crucial <strong>to</strong> inferential conclusions.<br />

10.2.8 Conditional Models: Random Effects Versus Conditional ML<br />

For the fixed effects approach with cluster-specific terms, a difficulty is that the<br />

model has a large number of parameters. To estimate the other effects in the model,

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!