06.06.2013 Views

Theory of Statistics - George Mason University

Theory of Statistics - George Mason University

Theory of Statistics - George Mason University

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

588 8 Nonparametric and Robust Inference<br />

<br />

E(pK(y)) ≈<br />

IRd <br />

κ(u) p(y) − (V u) T ∇p(y) + 1<br />

2 (V u)T <br />

Hp(y)V u du<br />

= p(y) − 0 + 1<br />

2 trace V T Hp(y)V . (8.63)<br />

To second order in the elements <strong>of</strong> V (that is, to terms in O(|V | 2 )), the bias<br />

at the point y is therefore<br />

1<br />

2 trace V V T Hp(y) . (8.64)<br />

Using the same kinds <strong>of</strong> expansions and approximations as in equations<br />

(8.62) and (8.63) to evaluate E (pK(y)) 2 to get an expression <strong>of</strong> order<br />

O(|V |/n), and subtracting the square <strong>of</strong> the expectation in equation (8.63),<br />

we get the approximate variance at y as<br />

or<br />

V(pK(y)) ≈ p(y)<br />

n|V |<br />

<br />

IR d<br />

Integrating this, because p is a density, we have<br />

(κ(u)) 2 du,<br />

V(pK(y)) ≈ p(y)<br />

S(κ). (8.65)<br />

n|V |<br />

AIV pK<br />

S(κ)<br />

= , (8.66)<br />

n|V |<br />

and integrating the square <strong>of</strong> the asymptotic bias in expression (8.64), we have<br />

AISB <br />

1<br />

pK =<br />

4<br />

<br />

T<br />

trace V Hp(y)V 2<br />

dy. (8.67)<br />

IR d<br />

These expressions are much simpler in the univariate case, where the<br />

smoothing matrix V is the smoothing parameter or window width h. We have<br />

a simpler approximation for E(pK(y)) than that given in equation (8.63),<br />

E(pK(y)) ≈ p(y) + 1<br />

2 h2p ′′ <br />

(y) u 2 κ(u)du,<br />

and from this we get a simpler expression for the AISB. After likewise simplifying<br />

the AIV, we have<br />

IR<br />

AMISE S(κ) 1<br />

pK = +<br />

nh 4 σ4 κh4R(p), (8.68)<br />

where we have left the kernel unscaled (that is, u 2 κ(u)du = σ 2 K ).<br />

Minimizing this with respect to h, we have the optimal value <strong>of</strong> the smoothing<br />

parameter<br />

<strong>Theory</strong> <strong>of</strong> <strong>Statistics</strong> c○2000–2013 James E. Gentle

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!