01.06.2013 Views

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

206 Regression and correlation<br />

discussed <strong>in</strong> §12.7. In that case, <strong>in</strong>creas<strong>in</strong>g the <strong>in</strong>itial sample size may have a<br />

smaller effect on the phenomenon of regression to the mean than would otherwise<br />

be so.<br />

Regression to the mean is not a problem when compar<strong>in</strong>g treatments <strong>in</strong> a<br />

randomized controlled trial s<strong>in</strong>ce both treatment groups will be <strong>in</strong>fluenced<br />

equally so that the difference will be unbiased, even though the mean improvements<br />

of both treatments will <strong>in</strong>clude a regression to the mean effect. Another<br />

approach is that after select<strong>in</strong>g patients by a screen<strong>in</strong>g test, a second pretreatment<br />

value is obta<strong>in</strong>ed on admission <strong>in</strong>to the study. This second value, and not<br />

the screen<strong>in</strong>g value, is used as the <strong>in</strong>itial value <strong>in</strong> the analysis (McDonald et al.,<br />

1983). The extent to which this approach will be successful <strong>in</strong> elim<strong>in</strong>at<strong>in</strong>g<br />

regression to the mean is dependent on the extent to which the second basel<strong>in</strong>e<br />

value is <strong>in</strong>dependent of the screen<strong>in</strong>g value (Senn, 1988).<br />

Newell and Simpson (1990) and Beach and Baron (1998) summarize the<br />

phenomenon of regression to the mean and provide additional references.<br />

Example 7.2<br />

Irwig et al. (1991) exam<strong>in</strong>ed the effects of regression to the mean on the screen<strong>in</strong>g of<br />

<strong>in</strong>dividuals to detect those with high levels of blood cholesterol. They considered three<br />

hypothetical populations with the follow<strong>in</strong>g mean values (all values quoted here and<br />

below be<strong>in</strong>g <strong>in</strong> units of mmol/l): A, 5 2; B, 5 8; C, 6 4. These values are based on survey<br />

data, as be<strong>in</strong>g characteristic of men under 35 years and women under 45 (A); men aged<br />

35±74 and women aged 45±64 (B); and women aged 65 and older (C). The calculations are<br />

based on the model described above, but with the assumption (aga<strong>in</strong> confirmed by<br />

observations) that the log of the cholesterol level, rather than the level itself, is normally<br />

distributed between and with<strong>in</strong> subjects, with constant with<strong>in</strong>-subject variance.<br />

Standard guidel<strong>in</strong>es would classify levels less than 5 2 as `desirable', levels between 5 2<br />

and 6 1 as `borderl<strong>in</strong>e high', and those above 6 1 as `high'. Irwig et al. illustrate the effects<br />

of regression to the mean <strong>in</strong> various ways. For <strong>in</strong>stance, an <strong>in</strong>dividual <strong>in</strong> group B with a<br />

s<strong>in</strong>gle screen<strong>in</strong>g measurement of 9 0 would have an estimated true mean of 8 4 with 80%<br />

confidence limits of 7 7 and 9 2. If the value of 9 0 was based on three measurements, the<br />

estimated true level would be 8 8 (limits 8 3, 9 3)Ða less pronounced regression effect than<br />

for s<strong>in</strong>gle measurements.<br />

An important concern is that <strong>in</strong>dividuals might be classified on the wrong side of a<br />

threshold. For <strong>in</strong>stance, an <strong>in</strong>dividual <strong>in</strong> group A with a screen<strong>in</strong>g measurement of<br />

4 9 would have a probability of 0 26 of hav<strong>in</strong>g a true mean above the threshold of 5 2.<br />

For a screen<strong>in</strong>g measurement of 5 8 there is a probability of 0 10 that the true level is<br />

below 5 2.<br />

Other calculations are concerned with the assessment of an <strong>in</strong>tervention <strong>in</strong>tended to<br />

lower blood cholesterol. Suppose that an <strong>in</strong>tervention produces a mean reduction of 13%<br />

(this figure be<strong>in</strong>g based on the results of a particular study). For an <strong>in</strong>dividual <strong>in</strong> group B<br />

with three measurements before and one after the <strong>in</strong>tervention, and a pre-<strong>in</strong>tervention<br />

mean of 7 8, an observed decrease of 25% corresponds to an estimated true decrease of<br />

19%, while an observed decrease of 0% (no apparent change) corresponds to an estimated

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!