01.06.2013 Views

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

156 Analys<strong>in</strong>g variances<br />

Comparison of two counts<br />

Suppose that x1 is a count which can be assumed to follow a Poisson distribution<br />

with mean m 1. Similarly let x2 be a count <strong>in</strong>dependently follow<strong>in</strong>g a<br />

Poisson distribution with mean m 2. How might we test the null hypothesis that<br />

m 1 ˆ m 2?<br />

One approach would be to use the fact that the variance of x1 x2 is m 1 ‡ m 2<br />

(by virtue of (3.19) and (4.9)). The best estimate of m 1 ‡ m 2 on the basis of<br />

the available <strong>in</strong>formation is x1 ‡ x2. On the null hypothesis E…x1 x2† ˆ<br />

m 1 m 2 ˆ 0, and x1 x2 can be taken to be approximately normally distributed<br />

unless m 1 and m 2 are very small. Hence,<br />

z ˆ x1 x2<br />

p …5:7†<br />

…x1 ‡ x2†<br />

can be taken as approximately a standardized normal deviate.<br />

Asecond approach has already been <strong>in</strong>dicated <strong>in</strong> the test for the comparison<br />

of proportions <strong>in</strong> paired samples (§4.5). Of the total frequency x1 ‡ x2, a portion<br />

x1 is observed <strong>in</strong> the first sample. Writ<strong>in</strong>g r ˆ x1 and n ˆ x1 ‡ x2 <strong>in</strong> (4.17) we have<br />

1<br />

z ˆ x1 2 …x1 ‡ x2†<br />

p 1<br />

ˆ<br />

2 …x1 ‡ x2†<br />

x1 x2<br />

p<br />

…x1 ‡ x2†<br />

as <strong>in</strong> (5.7). The two approaches thus lead to exactly the same test procedure.<br />

Athird approach uses a rather different application of the x2 test from that<br />

described for the 2 2 table <strong>in</strong> §4.5, the total frequency of x1 ‡ x2 now be<strong>in</strong>g<br />

divided <strong>in</strong>to two components rather than four. Correspond<strong>in</strong>g to each observed<br />

frequency we can consider the expected frequency, on the null hypothesis, to be<br />

1<br />

2 …x1 ‡ x2†:<br />

Observed x1 x2<br />

Expected<br />

1<br />

2 …x1 ‡ x2†<br />

1<br />

2 …x1 ‡ x2†<br />

Apply<strong>in</strong>g the usual formula (4.30) for a x 2 statistic, we have<br />

X 2 ˆ ‰x1<br />

1<br />

2 …x1 ‡ x2†Š 2<br />

1<br />

2 …x1 ‡ x2†<br />

ˆ …x1 x2† 2<br />

:<br />

x1 ‡ x2<br />

As for (4.30) X 2 follows the x 2 …1†<br />

‡ ‰x2<br />

1<br />

2 …x1 ‡ x2†Š 2<br />

1<br />

2 …x1 ‡ x2†<br />

…5:8†<br />

distribution, which we already know to be<br />

the distribution of the square of a standardized normal deviate. It is therefore<br />

not surpris<strong>in</strong>g that X 2 given by (5.8) is precisely the square of z given by<br />

(5.7). The third approach is thus equivalent to the other two, and forms a<br />

particularly useful method of computation s<strong>in</strong>ce no square root is <strong>in</strong>volved<br />

<strong>in</strong> (5.8).

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!