01.06.2013 Views

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

36 Describ<strong>in</strong>g data<br />

107 01. The geometric mean, calculated from the logs and then transformed back<br />

to the orig<strong>in</strong>al scale, is 102 36. The median is 100 00, closer to the geometric<br />

mean than to the arithmetic mean, as one might expect.<br />

The log transform is one example of the way <strong>in</strong> which raw data can often<br />

usefully be transformed to a new scale of measurement before analyses are<br />

carried out. The use of transformations is described <strong>in</strong> more detail <strong>in</strong> §§10.8 and<br />

11.10. In particular, the log transform will be seen to have other uses besides that<br />

of reduc<strong>in</strong>g positive skewness.<br />

2.6 Measures of variation<br />

When the mean value of a series of measurements has been obta<strong>in</strong>ed, it is usually a<br />

matter of considerable <strong>in</strong>terest to express the degree of variation or scatter around<br />

this mean. Are the read<strong>in</strong>gs all rather close to the mean or are some of them<br />

scattered widely <strong>in</strong> each direction? This question is important for purely descriptive<br />

reasons, as we shall emphasize below. It is important also s<strong>in</strong>ce the measurement<br />

of variation plays a central part <strong>in</strong> the methods of statistical <strong>in</strong>ference which<br />

are described <strong>in</strong> this book. To take a simple example, the reliability of the mean of<br />

100 values of some variable depends on the extent to which the 100 read<strong>in</strong>gs differ<br />

among themselves; if they show little variation the mean value is more reliable,<br />

more precisely determ<strong>in</strong>ed, than if the 100 read<strong>in</strong>gs vary widely. The role of<br />

variation <strong>in</strong> statistical <strong>in</strong>ference will be clarified <strong>in</strong> later chapters of this book.<br />

At present we are concerned more with the descriptive aspects.<br />

In works of reference it is common to f<strong>in</strong>d a s<strong>in</strong>gle figure quoted for the value<br />

of some biological quantity and the reader may not always realize that the stated<br />

figure is some sort of average. In a textbook on nutrition, for example, we might<br />

f<strong>in</strong>d the vitam<strong>in</strong> A content of Cheddar cheese given as 390 micrograms per 100<br />

grams of cheese. Clearly, not all specimens of Cheddar cheese conta<strong>in</strong> precisely<br />

390 mg per 100 g; how much variation, then, is there from one piece of cheese to<br />

another? To take another example from nutrition, the daily energy requirement<br />

of a physically active man aged 25 years, of 180 cm and 73 kg, should be 12 0<br />

megajoules. This requirement must vary from one person to another; how large<br />

is the variation?<br />

There is unlikely to be a s<strong>in</strong>gle answer to questions of this sort, because the<br />

amount of variation to be found <strong>in</strong> a series of measurements will usually depend<br />

on the circumstances <strong>in</strong> which they are made and, <strong>in</strong> particular, on the way <strong>in</strong><br />

which these circumstances change from one read<strong>in</strong>g to another. Specimens of<br />

Cheddar cheese are likely to vary <strong>in</strong> their vitam<strong>in</strong> A content for a number of<br />

reasons: major differences <strong>in</strong> the place and method of manufacture; variation <strong>in</strong><br />

composition from one specimen to another even with<strong>in</strong> the same batch of<br />

manufacture; the age of the cheese, and so on. Variation <strong>in</strong> the recorded<br />

measurement may be partly due to measurement errorÐ<strong>in</strong> the method of

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!