01.06.2013 Views

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

The proportion of true positives among the apparent positives is sometimes<br />

called the predictive value of a positive test; the proportion of true negatives<br />

among the apparent negatives is the predictive value of a negative test. In case<br />

(a), the true prevalence is 500/1000 ˆ 0 5 and the predictive value of a positive<br />

test is high (0 9). In case (b), however, where the true prevalence is 100/1000 ˆ<br />

0 1, the same predictive value is only 0 5. The predictive values are conditional<br />

probabilities of the test results and may be calculated us<strong>in</strong>g Bayes' theorem<br />

(§3.3), with the prevalences of disease and non-disease as the prior probabilities<br />

and the likelihoods of the test results obta<strong>in</strong>ed from the sensitivity and specificity.<br />

Case (b) illustrates the position <strong>in</strong> many presymptomatic screen<strong>in</strong>g procedures<br />

where the true prevalence is low. Of the subjects found positive by the<br />

screen<strong>in</strong>g test, a rather high proportion may be false positives. To avoid this<br />

situation the test may sometimes be modified to reduce a, but such a step often<br />

results <strong>in</strong> an <strong>in</strong>creased value of b and hence a reduced value of 1 b; the number<br />

of false positives among the apparent positives will have been reduced, but so<br />

will the number of true positives detected.<br />

The sort of modification referred to <strong>in</strong> the last sentence is particularly<br />

relevant when the test, although dichotomous, is based on a cont<strong>in</strong>uous measurement.<br />

Examples are the diagnosis of diabetes by blood-sugar level or of<br />

glaucoma by <strong>in</strong>traocular pressure. Any change <strong>in</strong> the critical level of the measurement<br />

will affect a and b. One very simple model for this situation would be<br />

to assume that the variable, x, on which the test is based is normally distributed<br />

with the same variance s 2 for the normal and diseased populations, but with<br />

different means, m N and m D (Fig. 19.1). For any given a, the value of b depends<br />

solely on the standardized distance between the means,<br />

D ˆ mD :<br />

s<br />

If the critical value for the test is the mid-po<strong>in</strong>t between the means, 1<br />

2 …mN ‡ mD), a and b will both be equal to the s<strong>in</strong>gle-tail area of the normal distribution<br />

beyond a standardized deviate of 1<br />

2 D. To compare the merits of different tests one<br />

could, therefore, compare their values of D; tests with high values of D will<br />

differentiate between normal and diseased groups better than those with low<br />

values. There is a clear analogy here with the generalized distance as a measure of<br />

the effectiveness of a discrim<strong>in</strong>ant function (p. 467); the discrim<strong>in</strong>ation is performed<br />

here by the s<strong>in</strong>gle variable x.<br />

Instead of an all-or-none classification as healthy or diseased, it may sometimes<br />

be useful to express the strength of the evidence for any <strong>in</strong>dividual fall<strong>in</strong>g<br />

<strong>in</strong>to each of the two groups. For the model described above, the logarithm of the<br />

likelihood ratio is l<strong>in</strong>early related to x, as shown <strong>in</strong> the lower part of Fig. 19.1.<br />

(This is a particular case of the more general result for discrim<strong>in</strong>ant functions<br />

m N<br />

19.9 Diagnostic tests 695

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!