01.06.2013 Views

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

294 Analys<strong>in</strong>g non-normal data<br />

probability to either element of a pair, with the other element be<strong>in</strong>g given<br />

the other label. Moreover, this can be repeated <strong>in</strong>dependently for each pair.<br />

In practice, this means that all possible changes of sign can be applied to<br />

each with<strong>in</strong>-pair difference, giv<strong>in</strong>g 2n possible permutations when there are n<br />

pairs.<br />

The idea beh<strong>in</strong>d permutation tests underlies the justification of several wellknown<br />

tests. This has just been demonstrated for the Wilcoxon rank sum test<br />

and it is also the case for the exact test for 2 2 tables (see §4.5). In the latter<br />

case, if there are two b<strong>in</strong>ary samples of sizes r1 and r2 then, <strong>in</strong> general, several<br />

permutations of the sample labels will give rise to the same table. The number of<br />

such permutations, divided by … r1‡r2†,<br />

will be the probability of obta<strong>in</strong><strong>in</strong>g that<br />

r1<br />

table under the null hypothesis; this approach gives a comb<strong>in</strong>atorial argument<br />

for the justification of (4.35). Fisher (1966, para. 21) also used permutation<br />

arguments to justify the use of t tests under certa<strong>in</strong> circumstances, even when<br />

the usual assumption of normality might be <strong>in</strong>appropriate. The same ideas<br />

underlie randomization tests, which are tests justified by the actual randomization<br />

of treatments to experimental units.<br />

In addition to provid<strong>in</strong>g theoretical grounds to justify certa<strong>in</strong> types of tests,<br />

the ideas beh<strong>in</strong>d permutation tests can be used directly <strong>in</strong> applications. They may<br />

be particularly helpful if there is doubt about the validity of other types of tests<br />

or when the calculation of quantities, such as standard errors, needed for other<br />

approaches, is <strong>in</strong>tractable.<br />

Example 10.8<br />

Suppose it was decided to assess whether the standard deviation of weight ga<strong>in</strong> was the<br />

same <strong>in</strong> the high- and low-prote<strong>in</strong> groups of Example 4 4. The sample standard deviations<br />

are 21 39 g <strong>in</strong> the high-prote<strong>in</strong> group and 20 62 g <strong>in</strong> the low-prote<strong>in</strong> group, giv<strong>in</strong>g a ratio<br />

of 1 037. A straightforward approach would be to apply the variance ratio test from §5 1<br />

to 1 0372 ˆ 1 076, with degrees of freedom 11 and 6. This gives a two-sided significance<br />

level of 0 979. However, as mentioned <strong>in</strong> §5 1, the variance ratio test is sensitive to the<br />

assumption that the ga<strong>in</strong>s <strong>in</strong> weight follow a normal distribution and an alternative which<br />

makes no such assumption is a permutation test.<br />

Table 10.5 shows the weight changes and the second column of the table <strong>in</strong>dicates the<br />

type of diet fed to each rat. If the null hypothesis that the standard deviations are equal is<br />

true then the observed ratio, namely 1 037, will be typical of the ratio of the standard<br />

deviations for the groups labelled L and H for any permutation of the column of Ls and<br />

Hs. With 12 observations, <strong>in</strong> one group and seven <strong>in</strong> the other there are 19<br />

12 ˆ 50 388<br />

permutations and the ratio of the standard deviations can be computed for each of these;<br />

the results are shown <strong>in</strong> the histogram <strong>in</strong> Fig. 10.1. As deviations from the null hypothesis<br />

<strong>in</strong> either direction are of equal <strong>in</strong>terest, a value should be considered `more extreme' than<br />

the observed ratio of 1 037 if it exceeds 1 037 or if it is less than 1 037 1 ˆ 0 964. There are<br />

46 669 such values, so the permutation P value is 46 669=50 388 ˆ 0 926, which is very<br />

similar to that obta<strong>in</strong>ed from the variance ratio test.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!