25.11.2014 Views

Biostatistics

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

756 CHAPTER 14 SURVIVAL ANALYSIS<br />

distribution. Excellent descriptions of the various models used to represent hazard functions<br />

are provided by Allison (4) and Kleinbaum and Klein (1).<br />

14.3 THE KAPLAN–MEIER PROCEDURE<br />

Now let us show how we may use the data usually collected in follow-up studies of the type<br />

we have been discussing to estimate the probability of surviving for a specified length of<br />

time. The method we use was introduced by Kaplan and Meier (5) and for that reason is<br />

called the Kaplan–Meier procedure. Since the procedure involves the successive multiplication<br />

of individual estimated probabilities, it is sometimes referred to as the product-limit<br />

method of estimating survival probabilities.<br />

As we shall see, the calculations include the computations of proportions of subjects in a<br />

samplewho survive for various lengths of time. We use these sample proportions as estimates of<br />

the probabilities of survival that we would expect to observe in the population represented by<br />

our sample. In mathematical terms we refer to the process as the estimation of a survivorship<br />

function. Frequency distributions and probability distributions may be constructed from<br />

observed survival times, and these observed distributions may show evidence of following<br />

some theoretical distribution of known functional form. When the form of the sampled<br />

distribution is unknown, it is recommended that the estimation of a survivorship function be<br />

accomplished by means of a nonparametric technique, of which the Kaplan–Meier procedure<br />

is one. Nonparametric techniques are defined and discussed in detail in Chapter 13.<br />

Calculations for the Kaplan–Meier Procedure<br />

We let<br />

n ¼ the number of subjects whose survival times are available<br />

p 1 ¼ the proportion of subjects surviving at least the first time period<br />

(day, month, year, etc.)<br />

p 2 ¼ the proportion of subjects surviving the second time period<br />

after having survived the first time period<br />

p 3 ¼ the proportion of subjects surviving the third time period<br />

after having survived the second time period<br />

.<br />

.<br />

p k ¼ the proportion of subjects surviving the kth time period<br />

after having survived the ðk 1Þth time period<br />

We use these proportions, which we may relabel ^p 1 ; ^p 2 ; ^p 3 ; ...; ^p k as estimates of the<br />

probability that a subject from the population represented by the sample will survive time<br />

periods 1, 2, 3, . . . , k, respectively.<br />

For any time period, t, where 1 t k, we estimate the probability of surviving the<br />

tth time period, p t , as follows:<br />

number of subjects surviving at least t 1<br />

^p t ¼ ð Þtime periods who also survive the tth period<br />

number of subjects alive at end of time period ðt 1Þ<br />

(14.3.1)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!