01.06.2013 Views

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

42 Describ<strong>in</strong>g data<br />

xi x will then need to be written with several significant digits and doubt will<br />

arise as to whether an adequate number of significant digits was reta<strong>in</strong>ed for x.<br />

These difficulties have led to the widespread use of an alternative method of<br />

calculat<strong>in</strong>g the sum of squares about the mean, P …xi x† 2 . It is based on the<br />

fact that<br />

P<br />

… xi x†<br />

2 ˆ P x 2 P<br />

… xi†<br />

i<br />

2<br />

: …2:3†<br />

n<br />

This is called the short-cut formula for the sum of squares about the mean.<br />

The important po<strong>in</strong>t about (2.3) is that the computation is performed without<br />

the need to calculate <strong>in</strong>dividual deviations from the mean, xi x. The sum of<br />

squares of the orig<strong>in</strong>al observations, xi, is corrected by subtraction of a quantity<br />

dependent only on the mean (or, equivalently, the total) of the xi. This second<br />

term is therefore often called the correction term, and the whole expression a<br />

corrected sum of squares.<br />

The previous example is reworked <strong>in</strong> Table 2.7. We aga<strong>in</strong> have the result<br />

P<br />

…xi x† 2 ˆ 100, and the subsequent calculations follow as <strong>in</strong> Table 2.6.<br />

The short-cut formula avoids the need to square <strong>in</strong>dividual deviations with<br />

many significant digits, but <strong>in</strong>volves the squares of the xi, which may be large<br />

numbers. This rarely causes trouble us<strong>in</strong>g a calculator, although care must be<br />

taken to carry sufficient digits <strong>in</strong> the correction term to give the required number<br />

of digits <strong>in</strong> the difference between the two terms. (For example, if P x2 i ˆ 2025<br />

and … P xi† 2 =n ˆ 2019 3825, the retention of all these decimals will give<br />

Table 2.7 Calculation of estimated variance and<br />

standard deviation: short-cut formula (same data<br />

as <strong>in</strong> Table 2.6).<br />

xi<br />

x 2<br />

i<br />

8 64<br />

5 25<br />

4 16<br />

12 144<br />

15 225<br />

5 25<br />

7 49<br />

Ð Ð<br />

56 548<br />

P<br />

…xi x† 2 ˆ P x2 i … P xi† 2 =n<br />

ˆ 548 56 2 =7<br />

ˆ 548 448<br />

ˆ 100<br />

Subsequent steps as <strong>in</strong> Table 2.6

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!