06.06.2013 Views

Theory of Statistics - George Mason University

Theory of Statistics - George Mason University

Theory of Statistics - George Mason University

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

298 3 Basic Statistical <strong>Theory</strong><br />

which the sample was drawn. In the jackknife method, we compute the statistic<br />

T using only a subset <strong>of</strong> size n − d <strong>of</strong> the given dataset; that is, we delete<br />

a set <strong>of</strong> size d.<br />

There are <strong>of</strong> course<br />

C n d =<br />

<br />

n<br />

d<br />

such sets.<br />

Let T(−j) denote the estimator computed from the sample with the j th<br />

set <strong>of</strong> observations removed; that is, T(−j) is based on a sample <strong>of</strong> size n − d.<br />

The estimator T(−j) has properties similar to those <strong>of</strong> T. For example, if T is<br />

unbiased, so is T(−j). If T is not unbiased, neither is T(−j); its bias, however,<br />

is likely to be different.<br />

The mean <strong>of</strong> the T(−j),<br />

T (•) = 1<br />

C n d<br />

C n d<br />

<br />

T(−j), (3.161)<br />

can be used as an estimator <strong>of</strong> θ. The T(−j) may also provide some information<br />

about the estimator T from the full sample.<br />

For the case in which T is a linear functional <strong>of</strong> the ECDF, then T (•) =<br />

T, so the systematic partitioning <strong>of</strong> a random sample will not provide any<br />

additional information.<br />

Consider the weighted differences in the estimate for the full sample and<br />

the reduced samples:<br />

T ∗ j = nT − (n − d)T(−j). (3.162)<br />

The T ∗ j are called “pseudovalues”. (If T is a linear functional <strong>of</strong> the ECDF<br />

and d = 1, then T ∗ j = T(xj); that is, it is the estimator computed from the<br />

single observation, xj.)<br />

We call the mean <strong>of</strong> the pseudovalues the “jackknifed” T and denote it as<br />

J(T):<br />

J(T) = 1<br />

C n d<br />

j=1<br />

C n<br />

d<br />

<br />

j=1<br />

T ∗ j<br />

= T ∗ . (3.163)<br />

In most applications <strong>of</strong> the jackknife, it is common to take d = 1, in which<br />

case Cn d = n. The term “jackknife” is <strong>of</strong>ten reserved to refer to the case <strong>of</strong><br />

d = 1, and if d > 1, the term “delete d jackknife” is used. In the case <strong>of</strong> d = 1,<br />

we can also write J(T) as<br />

J(T) = T + (n − 1) <br />

T − T(•)<br />

or<br />

<strong>Theory</strong> <strong>of</strong> <strong>Statistics</strong> c○2000–2013 James E. Gentle

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!