01.06.2013 Views

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

516 Empirical methods for categorical data<br />

Table 15.5 Frequencies (and percentages) of ABO blood groups <strong>in</strong> patients with peptic ulcer, patients<br />

with gastric cancer and controls (Snedecor & Cochran, 1989, Ex. 11.10.2).<br />

Blood<br />

group<br />

Peptic<br />

ulcer (%)<br />

Gastric<br />

cancer (%) Controls (%) Total<br />

O 983 (55) 383 (43) 2892 (48) 4258<br />

A 679 (38) 416 (47) 2625 (43) 3720<br />

B 134(7) 84(10) 570 (9) 788<br />

Total 1796 (100) 883 (100) 6087 (100) 8766<br />

As noted earlier, the total of the X 2 1 statistics is a little different from the X 2 4 value of<br />

40 54for the whole table, but the discrepancy is slight. The value of X 2 1 for (c) gives<br />

P ˆ 0 021, giv<strong>in</strong>g rise to some doubt about the lack of association of blood groups and<br />

gastric cancer. The outstand<strong>in</strong>g contrast is that between the proportions of group O <strong>in</strong> the<br />

peptic ulcer patients and <strong>in</strong> the other subjects.<br />

The process of collaps<strong>in</strong>g rows and columns could have been speeded up by comb<strong>in</strong><strong>in</strong>g<br />

some of the 1 DF contrasts <strong>in</strong>to 2 DF contrasts. For example, (a) and (b) could have been<br />

comb<strong>in</strong>ed <strong>in</strong> a 2 3 table, (e). This gives an X 2 …2† of 1 01, scarcely different from the sum of<br />

0 32 and 0 68, represent<strong>in</strong>g the overall association of blood groups A and B with the two<br />

disease groups and controls. Or, to provide an overall picture of the association of blood<br />

groups with gastric cancer, (a) and (c) could have been comb<strong>in</strong>ed <strong>in</strong> a 3 2 table, (f). This<br />

gives an X 2 …2† of 5 64(very close to 0 32 ‡ 5 30), which is, of course, less significant than<br />

of 5 30 from (c).<br />

the X 2 …1†<br />

There are many ways of subdivid<strong>in</strong>g a cont<strong>in</strong>gency table. In Example 15.5,<br />

the elementary table (a) could have been chosen as any one of the 2 2 tables<br />

form<strong>in</strong>g part of the whole table. The choice, as <strong>in</strong> that example, will often be<br />

data-dependentÐthat is, made after an <strong>in</strong>itial <strong>in</strong>spection of the data. There is,<br />

therefore, the risk of data-dredg<strong>in</strong>g, and this should be recognized <strong>in</strong> any <strong>in</strong>terpretation<br />

of the analysis. In Example 15.5, of course, the association between<br />

group O and peptic ulcer is too strong to be expla<strong>in</strong>ed away by data-dredg<strong>in</strong>g.<br />

15.6 Comb<strong>in</strong>ation of 2 3 2tables<br />

Sometimes a number of 2 2 tables, all bear<strong>in</strong>g on the same question, are<br />

available, and it seems natural to comb<strong>in</strong>e the evidence for an association<br />

between the row and column factors. For example, there may be a number of<br />

retrospective studies, each provid<strong>in</strong>g evidence about a possible association<br />

between a certa<strong>in</strong> disease and a certa<strong>in</strong> environmental factor. Or, <strong>in</strong> a multicentre<br />

cl<strong>in</strong>ical trial, each centre may provide evidence about a possible difference<br />

between the proportions of patients whose condition is improved with treatment<br />

A and treatment B. These are examples of stratification. In the cl<strong>in</strong>ical trial, for

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!