01.06.2013 Views

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

There is only one conventional polynomial of a given degree but the presence<br />

of the vector p means that there is an <strong>in</strong>f<strong>in</strong>ite number of fractional polynomials<br />

of degree m. On the one hand, this enriches the family of curves available to the<br />

analyst but, on the other, it complicates the fitt<strong>in</strong>g of a fractional polynomial. As<br />

po<strong>in</strong>ted out above, the coefficients bj can be found us<strong>in</strong>g l<strong>in</strong>ear regression<br />

techniques, but this is not the case unless the elements of p are effectively taken<br />

to be fixed. Treat<strong>in</strong>g p as another collection of parameters to be estimated from<br />

the data removes the simplicity from the fitt<strong>in</strong>g procedure that is one of the most<br />

attractive features of fractional polynomials. Royston and Altman (1994) propose<br />

the follow<strong>in</strong>g simple, heuristic approach to fitt<strong>in</strong>g fractional polynomials.<br />

The elements of p are assumed to belong to a small discrete set of possible<br />

powers, which is usually taken to be P ˆf 2, 1, 1<br />

2<br />

12.1 Polynomial regression 385<br />

,0, 1<br />

2<br />

,1,2, ..., Mg<br />

where M could be 3 or possibly max {3, m}. Each m-tuple taken from P with<br />

replacement def<strong>in</strong>es a fractional polynomial and the m-tuple which gives rise to<br />

the smallest residual mean square def<strong>in</strong>es the best-fitt<strong>in</strong>g fractional polynomial<br />

of degree m. It is also suggested that fractional polynomials of degree m and<br />

m ‡ 1 can be compared us<strong>in</strong>g the usual F-ratio statistic for the ratio of residual<br />

mean squares, but it must be remembered that the fractional polynomial of<br />

higher degree has two extra parameters, a power and a coefficient, so the<br />

numerator DF for the F statistic must be 2.<br />

Figure 12.3 shows the population trend data with the cubic polynomial<br />

previously fitted and the best-fitt<strong>in</strong>g fractional polynomial of degree 3. The<br />

fractional polynomial has p ˆ…0, 0, 1<br />

2 †Ðthat is, a quadratic <strong>in</strong> log(year) together<br />

with a term p year. The residual mean square for the fractional polynomial is<br />

0 300 compared with 0 365 for the cubic. The improvement <strong>in</strong> fit for a model<br />

which at least nom<strong>in</strong>ally has the same number of terms may be very worthwhile<br />

for some purposes. Interpolation and smooth<strong>in</strong>g with<strong>in</strong> the range of the data<br />

may be two such purposes. However, some caution is also advisable.<br />

The fractional polynomial shown <strong>in</strong> Fig. 12.3 has been found as a result of<br />

consider<strong>in</strong>g 120 fractional polynomials of degree 3 and a further 44 of degree 2 or<br />

1. The number of fractional polynomials considered obviously depends on the<br />

size of the set P. IfP has P elements then there are P fractional polynomials of<br />

degree 1, P…P ‡ 1†=2 of degree 2 and P…P ‡ 1†…P ‡ 2†=6 of degree 3. Some of the<br />

statistical features of a fractional polynomial fitted us<strong>in</strong>g the method outl<strong>in</strong>ed,<br />

such as the standard errors of the coefficients bj that are reported by the<br />

regression program, will be unreliable because they will not take account of the<br />

number of models that have been considered. This is, of course, true of conventional<br />

polynomials, but the number of alternatives that are available from<br />

consideration here is so much smaller that its effect is likely to be much less<br />

important. For some purposes, the standard errors are unimportant; <strong>in</strong> other<br />

cases it may be sensible to check the reliability of standard errors us<strong>in</strong>g techniques<br />

such as the bootstrap (see §10.7). When so many models are considered, it

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!