05.12.2012 Views

Student Notes To Accompany MS4214: STATISTICAL INFERENCE

Student Notes To Accompany MS4214: STATISTICAL INFERENCE

Student Notes To Accompany MS4214: STATISTICAL INFERENCE

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

When θ ∈ R1 we can define the score function as the first derivative of the log-likelihood<br />

S(θ) = ∂<br />

ln L(θ).<br />

∂θ<br />

The maximum likelihood estimate (MLE) ˆ θ of θ is the solution to the score equation<br />

S(θ) = 0.<br />

At the maximum, the second partial derivative of the log-likelihood is negative, so we<br />

define the curvature at ˆ θ as I( ˆ θ) where<br />

I(θ) = − ∂2<br />

ln L(θ).<br />

∂θ2 We can check that a solution ˆ θ of the equation S(θ) = 0 is actually a maximum by<br />

checking that I( ˆ θ) > 0. A large curvature I( ˆ θ) is associated with a tight or strong<br />

peak, intuitively indicating less uncertainty about θ. In likelihood theory I(θ) is a<br />

key quantity called the observed Fisher information, and I( ˆ θ) is the observed Fisher<br />

information evaluated at the MLE ˆ θ. Although I(θ) is a function, I( ˆ θ) is a scalar.<br />

The likelihood function L(θ|x) supplies an order of preference or plausibility among<br />

possible values of θ based on the observed y. It ranks the plausibility of possible values<br />

of θ by how probable they make the observed y. If P (x|θ = θ1) > P (x|θ = θ2) then<br />

the observed x makes θ = θ1 more plausible than θ = θ2, and consequently from<br />

(2.2), L(θ1|x) > L(θ2|x). The likelihood ratio L(θ1|x)/L(θ2|x) = f(θ1|x)/f(θ2|x) is<br />

a measure of the plausibility of θ1 relative to θ2 based on the observed fact y. The<br />

relative likelihood L(θ1|x)/L(θ2|x) = k means that the observed value x will occur k<br />

times more frequently in repeated samples from the population defined by the value θ1<br />

than from the population defined by θ2. Since only ratios of likelihoods are meaningful,<br />

it is convenient to standardize the likelihood with respect to its maximum. Define the<br />

relative likelihood as R(θ|x) = L(θ|x)/L( ˆ θ|x). The relative likelihood varies between 0<br />

and 1. The MLE ˆ θ is most plausible value of θ in that it makes the observed sample<br />

most probable. The relative likelihood measures the plausibility of any particular value<br />

of θ relative to that of ˆ θ.<br />

When the random variables X1, . . . , Xn are mutually independent we can write the<br />

joint density as<br />

fX(x) =<br />

j=1<br />

where x = (x1, . . . , xn) ′ is a realization of the random vector X = (X1, . . . , Xn) ′ , and<br />

the likelihood function becomes<br />

LX(θ|x) =<br />

n�<br />

n�<br />

j=1<br />

fXj (xj)<br />

fXj (xj|θ).<br />

When the densities fXj (xj) are identical, we unambiguously write f(xj).<br />

19

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!