13.11.2012 Views

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

3.5 FITTING GENERALIZED LINEAR MODELS 89<br />

3.5.2 Wald, Likelihood-Ratio, and Score Inference Use<br />

the Likelihood Function<br />

Figure 3.7 shows a generic plot of a log-likelihood function L for a parameter β.<br />

This plot illustrates the Wald, likelihood-ratio, and score tests of H0: β = 0. The<br />

log-likelihood function for some GLMs, including binomial logistic regression<br />

models and Poisson loglinear models, has concave (bowl) shape. The ML estimate ˆβ is<br />

the point at which the log-likelihood takes its highest value. The Wald test is based on<br />

the behavior of the log-likelihood function at the ML estimate ˆβ, having chi-squared<br />

form ( ˆβ/SE) 2 . The SE of ˆβ depends on the curvature of the log-likelihood function<br />

at the point where it is maximized, with greater curvature giving smaller SE values.<br />

The score test is based on the behavior of the log-likelihood function at the null<br />

value for β of 0. It uses the size of the derivative (slope) of the log-likelihood function,<br />

evaluated at the null hypothesis value of the parameter. The derivative at β = 0 tends<br />

<strong>to</strong> be larger in absolute value when ˆβ is further from that null value. The score statistic<br />

also has an approximate chi-squared distribution with df = 1. We shall not present<br />

the general formula for score statistics, but many test statistics in this text are this<br />

type. An example is the Pearson statistic for testing independence. An advantage of<br />

the score statistic is that it exists even when the ML estimate ˆβ is infinite. In that case,<br />

one cannot compute the Wald statistic.<br />

The likelihood-ratio test combines information about the log-likelihood function<br />

both at ˆβ and at the null value for β of 0. It compares the log-likelihood values L1<br />

at ˆβ and L0 at β = 0 using the chi-squared statistic −2(L0 − L1). In Figure 3.7, this<br />

statistic is twice the vertical distance between values of the log-likelihood function at<br />

ˆβ and at β = 0. In a sense, this statistic uses the most information of the three types<br />

of test statistic. It is usually more reliable than the Wald statistic, especially when n<br />

is small <strong>to</strong> moderate.<br />

Figure 3.7. Information used in Wald, likelihood-ratio, and efficient score tests.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!