06.06.2013 Views

Theory of Statistics - George Mason University

Theory of Statistics - George Mason University

Theory of Statistics - George Mason University

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

488 6 Statistical Inference Based on Likelihood<br />

pɛ(ɛ) = pɛ(y − Xβ), (6.59)<br />

wrt a given σ-finite measure.<br />

In the linear model (6.58), if ɛ ∼ N(0, σ 2 ), as we usually assume, we can<br />

easily identify ηi, T(yi), and ζ(ηi) in equation (6.32), and <strong>of</strong> course, h(yi) ≡ 1.<br />

This is a location-scale family.<br />

Generalized Linear Models<br />

A model as in equation (6.58) has limitations. Suppose, for example, that we<br />

are interested in modeling a response that is binary, for example, two states <strong>of</strong><br />

a medical patient, “diseased” or “disease-free”. As usual, we set up a random<br />

variable to map the sample space to IR:<br />

Y : {disease-free,diseased} ↦→ {0, 1}.<br />

The linear model X = Zβ + ɛ does not make sense. It is continuous and<br />

unbounded.<br />

A more useful model may address Pr(X = 0).<br />

To make this more concrete, consider the situation in which several groups<br />

<strong>of</strong> subjects are each administered a given dose <strong>of</strong> a drug, and the number<br />

responding in each group is recorded. The data consist <strong>of</strong> the counts yi responding<br />

in the ith group, which received a level xi <strong>of</strong> the drug.<br />

A basic model is<br />

P(Yi = 0|xi) = 1 − πi<br />

(6.60)<br />

P(Yi = 1|xi) = πi<br />

The question is how does π depend on x?<br />

A linear dependence, π = β0+β1x does not fit well in this kind <strong>of</strong> situation<br />

– unless we impose restrictions, π would not be between 0 and 1.<br />

We can try a transformation to [0, 1].<br />

Suppose we impose an invertible function on<br />

that will map it into [0, 1]:<br />

or<br />

η = β0 + β1x<br />

We call this a link function.<br />

A common model following this setup is<br />

π = h(η), (6.61)<br />

g(π) = η. (6.62)<br />

πx = Φ(β0 + β1x), (6.63)<br />

where Φ is the normal cumulative distribution function, and β0 and β1 are<br />

unknown parameters to be estimated. This is called a probit model. The link<br />

function in this case is Φ −1 .<br />

<strong>Theory</strong> <strong>of</strong> <strong>Statistics</strong> c○2000–2013 James E. Gentle

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!