01.03.2013 Views

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

7.5 Logit <strong>and</strong> Probit Models 325<br />

sections, are no longer applicable. For the logit <strong>and</strong> probit models, some sort of the<br />

chi-square test described in Chapter 5 is usually applied in order to assess the<br />

goodness of fit of the model. <strong>SPSS</strong> <strong>and</strong> <strong>STATISTICA</strong> afford another type of chisquare<br />

test based on the log-likelihood of the model. Let L0 represent the loglikelihood<br />

for the null model, i.e., where all slope parameters are zero, <strong>and</strong> L1 the<br />

log-likelihood of the fitted model. In the test used by <strong>STATISTICA</strong>, the following<br />

quantity is computed:<br />

L = −2(L0 − L1),<br />

which, under the null hypothesis that the null model perfectly fits the data, has a<br />

chi-square distribution with p − 1 degrees of freedom. The test used by <strong>SPSS</strong> is<br />

similar, using only the quantity –2 L1, which, under the null hypothesis, has a chisquare<br />

distribution with n − p degrees of freedom.<br />

In Example 7.21, the chi-square test is significant for both the logit <strong>and</strong> probit<br />

models; therefore, we reject the null hypothesis that the null model fits the data<br />

perfectly. In other words, the estimated parameters b1 (0.23 <strong>and</strong> 0.138 for the logit<br />

<strong>and</strong> probit models, respectively) have a significant contribution for the fitted<br />

models.<br />

1.2<br />

1.0<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0.0<br />

HG<br />

-0.2<br />

-5 0 5 10 15 20 25 30 35 40 45<br />

Figure 7.24. Logistic response for the clay classification problem, using variable<br />

HG (obtained with <strong>STATISTICA</strong>). The circles represent the observed data.<br />

Table 7.18. Classification matrix for the clay dataset, using predictor HG in the<br />

logit or probit models.<br />

Predicted Age = 1 Predicted Age = 0 Error rate<br />

Observed Age = 1 65 4 94.2<br />

Observed Age = 0 10 15 60.0

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!