13.11.2012 Views

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

284 MODELING CORRELATED, CLUSTERED RESPONSES<br />

Table 9.5. Estimates and Standard Errors (in<br />

Parentheses) for Logistic Models Fitted <strong>to</strong> Tera<strong>to</strong>logy<br />

<strong>Data</strong> of Table 9.4<br />

Type of Logistic Model Fitting<br />

Parameter Binomial ML GEE<br />

Intercept 1.14 (0.13) 1.21 (0.27)<br />

Group 2 −3.32 (0.33) −3.37 (0.43)<br />

Group 3 −4.48 (0.73) −4.58 (0.62)<br />

Group 4 −4.13 (0.48) −4.25 (0.60)<br />

Overdispersion None ˆρ = 0.19<br />

Note: Binomial ML assumes no overdispersion; GEE has exchangeable<br />

working correlation. The intercept term gives result for group 1<br />

(placebo) alone.<br />

Suppose an application has positive within-cluster correlation, as often happens<br />

in practice and as seems <strong>to</strong> be the case here. Then, standard errors for betweencluster<br />

effects (such as comparisons of separate treatment groups) and standard errors<br />

of estimated means within clusters tend <strong>to</strong> be larger than when the observations<br />

are independent. We see this in Table 9.5 except for group 3 and its comparison<br />

with the placebo group. With positive within-cluster correlation, standard errors for<br />

within-cluster effects, such as a slope for a trend in the repeated measurements on a<br />

subject, tend <strong>to</strong> be smaller than when the observations are independent.<br />

9.2.5 Limitations of GEE Compared with ML<br />

Because the GEE method specifies the marginal distributions and the correlation<br />

structure but not the complete multivariate distribution, there is no likelihood function.<br />

In this sense, the GEE method is a multivariate type of quasi-likelihood method. So,<br />

its estimates are not ML estimates.<br />

For clustered data, the GEE method is much simpler computationally than ML<br />

and much more readily available in software. However, it has limitations. Because<br />

it does not have a likelihood function, likelihood-ratio methods are not available for<br />

checking fit, comparing models, and conducting inference about parameters. Instead<br />

inference uses statistics, such as Wald statistics, based on the approximate normality<br />

of the estima<strong>to</strong>rs <strong>to</strong>gether with their estimated covariance matrix. Such inference is<br />

reliable mainly for very large samples. Otherwise, the empirically based standard<br />

errors tend <strong>to</strong> underestimate the true ones.<br />

Some software attempts <strong>to</strong> improve on Wald-type inference by making tests available<br />

that mimic the way score tests are constructed, if one had a likelihood function.<br />

These generalized score tests also incorporate empirical information in forming<br />

standard error estimates, and they are preferable <strong>to</strong> Wald tests.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!