13.11.2012 Views

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

5.3 EFFECTS OF SPARSE DATA 155<br />

Table 5.7. Clinical Trial Relating Treatment (X) <strong>to</strong> Response<br />

(Y ) for Five Centers (Z), with XY and YZ Marginal Tables<br />

Response (Y ) YZ Marginal<br />

Center (Z) Treatment (X) Success Failure Success Failure<br />

1 Active drug 0 5<br />

Placebo 0 9<br />

2 Active drug 1 12<br />

Placebo 0 10<br />

3 Active drug 0 7<br />

Placebo 0 5<br />

4 Active drug 6 3<br />

Placebo 2 6<br />

5 Active drug 5 9<br />

Placebo 2 12<br />

XY Active drug 12 36<br />

Marginal Placebo 4 42<br />

Source: Diane Connell, Sandoz Pharmaceuticals Corp.<br />

0 14<br />

1 22<br />

0 12<br />

8 9<br />

7 21<br />

(1 = success, 0 = failure). For these data, let Y = Response, X = Treatment (Active<br />

drug or Placebo), and Z = Center. Centers 1 and 3 had no successes. Thus, the 5 × 2<br />

marginal table relating center <strong>to</strong> response, collapsed over treatment, contains zero<br />

counts. This marginal table is shown in the last two columns of Table 5.7.<br />

For these data, consider the model<br />

logit[P(Y = 1) = α + βx + β Z k<br />

Because centers 1 and 3 had no successes, the ML estimates of the terms βZ 1 and βZ 3<br />

pertaining <strong>to</strong> their effects equal −∞. The fitted logits for those centers equal −∞,<br />

for which the fitted probability of success is 0.<br />

In practice, software notices that the likelihood function continually increases as<br />

βZ 1 and βZ 3 decrease <strong>to</strong>ward −∞, but the fitting algorithm may “converge” at large<br />

negative values. For example, SAS (PROC GENMOD) reports ˆβ Z 1 and ˆβ Z 3 <strong>to</strong> both<br />

be about −26 with standard errors of about 200,000. Since the software uses default<br />

coding that sets βZ 5 = 0, these estimates refer <strong>to</strong> contrasts of each center with the last<br />

one. If we remove α from the model, then one of {βZ k } is no longer redundant. Each<br />

center parameter then refers <strong>to</strong> that center alone rather than a contrast with another<br />

center. Most software permits fitting a model parameterized in this way by using a

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!