01.06.2013 Views

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

dispersed reversed J-shape. With that assumption, the posterior distribution<br />

becomes Gamma (x ‡ 1<br />

2 , 1), and 2m has a x2 distribution on 2x ‡ 1 DF.<br />

Example 6.3<br />

Example 5.2 described a study <strong>in</strong> which x ˆ 33 workers died of lung cancer. The number<br />

expected at national death rates was 20 0.<br />

The Bayesian model described above, with a non-<strong>in</strong>formative prior, gives the posterior<br />

distribution Gamma (33 5, 1), and 2m has a x2 distribution on 67 DF. From computer<br />

tabulations of this distribution we f<strong>in</strong>d that P (m < 20 0) is 0 0036, very close to the<br />

frequentist one-sided mid-P significance level of 0 0037 quoted <strong>in</strong> Example 5.2.<br />

If, on the other hand, it was believed that local death rates varied around the national<br />

rate by relatively small <strong>in</strong>crements, a prior might be chosen to have a mean count of 20<br />

and a small standard deviation of, say, 2. Sett<strong>in</strong>g the mean of the prior to be ab ˆ 20 and<br />

its variance to be ab2 ˆ 4, gives a ˆ 100 and b ˆ 0 2, and the posterior distribution is<br />

Gamma (133, 0 1667). Thus, 12m has a x2 distribution on 266 DF, and computer tabulations<br />

show that P(m < 20 0) is 0 128. There is now considerably less evidence that the local<br />

death rate is excessive. The posterior estimate of the expected number of deaths is<br />

(133) (0 1667) ˆ 22 17. Note, however, that the observed count is somewhat <strong>in</strong>compatible<br />

with the prior assumptions. The difference between x and the prior mean is 33 20 ˆ 13.<br />

Its variance might be estimated as 33 ‡ 4 ˆ 37 and its standard error as 3<br />

p 7 ˆ 6 083. The<br />

difference is thus over twice its standard error, and the <strong>in</strong>vestigator might be well advised<br />

to reconsider prior assumptions.<br />

Analyses <strong>in</strong>volv<strong>in</strong>g the ratio of two counts can proceed from the approach<br />

described <strong>in</strong> §5.2 and illustrated <strong>in</strong> Examples 5.2 and 5.3. If two counts, x1 and<br />

x2, follow <strong>in</strong>dependent Poisson distributions with means m 1 and m 2, respectively,<br />

then, given the total count x1 ‡ x2, the observed count x1 is b<strong>in</strong>omially distributed<br />

with mean …x1 ‡ x2†m 1=…m 2 ‡ m 2†. The methods described earlier <strong>in</strong> this<br />

section for the Bayesian analysis of proportions may thus be applied also to<br />

this problem.<br />

6.4 Further comments on Bayesian methods<br />

Shr<strong>in</strong>kage<br />

6.4 Further comments on Bayesian methods 179<br />

The phenomenon of shr<strong>in</strong>kage was <strong>in</strong>troduced <strong>in</strong> §6.2 and illustrated <strong>in</strong> several of<br />

the situations described <strong>in</strong> that section and <strong>in</strong> §6.3. It is a common feature of<br />

parameter estimation <strong>in</strong> Bayesian analyses. The posterior distribution is determ<strong>in</strong>ed<br />

by the prior distribution and the likelihood based on the data, and its<br />

measures of location will tend to lie between those of the prior distribution and<br />

the central features of the likelihood function. The relative weights of these two<br />

determ<strong>in</strong>ants will depend on the variability of the prior and the tightness of the<br />

likelihood function, the latter be<strong>in</strong>g a function of the amount of data.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!