13.11.2012 Views

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

4.2 INFERENCE FOR LOGISTIC REGRESSION 109<br />

The term ˆα + ˆβx in the exponents of the prediction equation (4.4) is the estimated<br />

linear predic<strong>to</strong>r in the logit transform of π(x). This estimated logit has large-sample<br />

SE given by the estimated square root of<br />

Var( ˆα + ˆβx) = Var( ˆα) + x 2 Var( ˆβ) + 2x Cov( ˆα, ˆβ)<br />

A 95% confidence interval for the true logit is ( ˆα + ˆβx) ± 1.96(SE). Substituting the<br />

endpoints of this interval for α + βx in the two exponents in equation (4.4) gives a<br />

corresponding interval for the probability.<br />

For example, at x = 26.5 for the horseshoe crab data, the estimated logit is<br />

−12.351 + 0.497(26.5) = 0.825. Software reports estimated covariance matrix for<br />

( ˆα, ˆβ) of<br />

Estimated Covariance Matrix<br />

Parameter Intercept width<br />

Intercept 6.9102 −0.2668<br />

width −0.2668 0.0103<br />

A covariance matrix has variances of estimates on the main diagonal and covariances<br />

off that diagonal. Here, �Var( ˆα) = 6.9102, �Var( ˆβ) = 0.0103, �Cov( ˆα, ˆβ) =−0.2668.<br />

Therefore, the estimated variance of this estimated logit equals<br />

�Var( ˆα) + x 2 �Var( ˆβ) + 2x �Cov( ˆα, ˆβ) = 6.9102 + (26.5) 2 (0.0103)<br />

+ 2(26.5)(−0.2668)<br />

or 0.038. The 95% confidence interval for the true logit equals 0.825 ± (1.96) √ 0.038,<br />

or (0.44, 1.21). From equation (4.4), this translates <strong>to</strong> the confidence interval<br />

{exp(0.44)/[1 + exp(0.44)], exp(1.21)/[1 + exp(1.21)]} = (0.61, 0.77)<br />

for the probability of satellites at width 26.5 cm.<br />

4.2.7 Standard Errors of Model Parameter Estimates ∗<br />

We have used only a single explana<strong>to</strong>ry variable so far, but the rest of the chapter<br />

allows additional predic<strong>to</strong>rs. The remarks of this subsection apply regardless of the<br />

number of predic<strong>to</strong>rs.<br />

Software fits models and provides the ML parameter estimates. The standard errors<br />

of the estimates are the square roots of the variances from the main diagonal of the<br />

covariance matrix. For example, from the estimated covariance matrix reported above<br />

in Section 4.2.6, the estimated width effect of 0.497 in the logistic regression model<br />

has SE = √ 0.0103 = 0.102.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!