01.06.2013 Views

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

12 Further regression models for a cont<strong>in</strong>uous<br />

response<br />

12.1 Polynomial regression<br />

Reference was made <strong>in</strong> §11.6 to the possibility of creat<strong>in</strong>g new predictor variables<br />

def<strong>in</strong>ed as the squares of exist<strong>in</strong>g variables, to cope with non-l<strong>in</strong>ear or curvil<strong>in</strong>ear<br />

relationships. This is an important idea, and is most easily studied <strong>in</strong> situations <strong>in</strong><br />

which there is orig<strong>in</strong>ally only one predictor variable x. Instead of the l<strong>in</strong>ear<br />

regression equation<br />

<strong>in</strong>troduced <strong>in</strong> §7.2, we consider the polynomial model<br />

E…y† ˆa ‡ bx …12:1†<br />

E…y† ˆa ‡ b 1x ‡ b 2x 2 ‡ ...‡ b px p : …12:2†<br />

The highest power of x, denoted here by p, is called the degree of the<br />

polynomial. Some typical shapes of low-degree polynomial curves are shown <strong>in</strong><br />

Fig. 12.1. The curve for p ˆ 2, when the term <strong>in</strong> x 2 is added, is called quadratic;<br />

that for p ˆ 3 cubic, and that for p ˆ 4 quartic. Clearly, a wide variety of curves<br />

can be represented by polynomials. The quadratic curve has one peak or trough;<br />

the cubic has at most two peaks or troughs; and so on. A particular set of data<br />

may be fitted well by a portion of a low-degree polynomial even though no peaks<br />

or troughs are present. In particular, data show<strong>in</strong>g a moderate amount of<br />

curvature can often be fitted adequately by a quadratic curve.<br />

The general pr<strong>in</strong>ciple of polynomial regression analysis is to regard the<br />

successive powers of x as separate predictor variables. Thus, to fit the p-degree<br />

polynomial (12.2), we could def<strong>in</strong>e x1 ˆ x, x2 ˆ x 2 , ...xp ˆ x p and apply the<br />

standard methods of §11.6. It will often be uncerta<strong>in</strong> which degree of polynomial<br />

is required. Considerations of simplicity suggest that as low an order as possible<br />

should be used; for example, we should normally use l<strong>in</strong>ear regression unless<br />

there is any particular reason to use a higher-degree polynomial. The usual<br />

approach is to use a slightly higher degree than one supposes to be necessary.<br />

The highest-degree terms can then be dropped successively so long as they<br />

contribute, separately or together, <strong>in</strong>crements to the sum of squares (SSq)<br />

which are non-significant when compared with the Residual SSq. Some problems<br />

aris<strong>in</strong>g from this approach are illustrated <strong>in</strong> Example 12.1, p. 379.<br />

378

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!