01.03.2013 Views

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

372 9 Survival Analysis<br />

the proportional hazards assumption, i.e., the assumption that the hazards can be<br />

expressed as:<br />

h1(t) = ψ h2(t), 9.36<br />

where the positive constant ψ is known as the hazard ratio, mentioned in 9.3.<br />

Let X be an indicator variable such that its value for the ith individual, xi, is 1 or<br />

0, according to the group membership of the individual. In order to impose a<br />

positive value to ψ, we rewrite formula 9.36 as:<br />

βx<br />

h ( t)<br />

= e i h0<br />

( t)<br />

. 9.37<br />

i<br />

Thus h2(t) = h0(t) <strong>and</strong> ψ = e β . This model can be generalised for p explanatory<br />

variables:<br />

η<br />

h ( t)<br />

e i<br />

i = h0<br />

( t)<br />

, with η i = β1<br />

x1i + β 2x2i<br />

+ K+<br />

β p x pi , 9.38<br />

where ηi is known as the risk score <strong>and</strong> h0(t) is the baseline hazard function, i.e.,<br />

the hazard that one would obtain if all independent explanatory variables were<br />

zero.<br />

The Cox regression model is the most general of the regression models for<br />

survival data since it does not assume any particular underlying survival<br />

distribution. The model is fitted to the data by first estimating the risk score using a<br />

log-likelihood approach <strong>and</strong> finally computing the baseline hazard by an iterative<br />

procedure. As a result of the model fitting process, one can obtain parameter<br />

estimates <strong>and</strong> plots for specific values of the explanatory variables.<br />

Example 9.10<br />

Q: Determine the Cox regression solution for the Heart Valve dataset (eventfree<br />

survival time), using Age as the explanatory variable. Compare the survivor<br />

functions <strong>and</strong> determine the estimated percentages of an event-free 10-year postoperative<br />

period for the mean age <strong>and</strong> for 20 <strong>and</strong> 60 years-old patients as well.<br />

A: <strong>STATISTICA</strong> determines the parameter βAge = 0.0214 for the Cox regression<br />

model. The chi-square test under the null hypothesis of “no Age influence” yields<br />

an observed p = 0.004. Therefore, variable Age is highly significant in the<br />

estimation of survival times, i.e., is an explanatory variable.<br />

Figure 9.9a shows the baseline survivor function. Figures 9.9b, c <strong>and</strong> d, show<br />

the survivor function plots for 20, 47.17 (mean age) <strong>and</strong> 60 years, respectively. As<br />

expected, the probability of a given post-operative event-free period decreases with<br />

age (survivor curves lower with age). From these plots, we see that the estimated<br />

percentages of patients with post-operative event-free 10-year periods are 80%,<br />

65% <strong>and</strong> 59% for 20, 47.17 (mean age) <strong>and</strong> 60 year-old patients, respectively.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!