ST3239: Survey Methodology - The Department of Statistics and ...
ST3239: Survey Methodology - The Department of Statistics and ...
ST3239: Survey Methodology - The Department of Statistics and ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
<strong>The</strong>orem 2.2.2<br />
E(ȳ) = µ,<br />
V ar(ȳ) = σ2<br />
n<br />
( ) N − n<br />
.<br />
N − 1<br />
Pro<strong>of</strong>.<br />
Note ȳ = 1 n (y 1 + ... + y n ). So<br />
E(ȳ) = 1 n (Ey 1 + ... + Ey n ) = 1 (nµ) = µ.<br />
n<br />
Now<br />
V ar(ȳ) = 1 n Cov( ∑ n n∑<br />
y 2 i , y j ) = 1 ∑ n n∑<br />
Cov(y<br />
i=1 j=1<br />
n 2<br />
i , y j )<br />
i=1 j=1<br />
⎛<br />
⎞<br />
= 1 ⎝ ∑ Cov(y<br />
n 2 i , y j ) + ∑ Cov(y i , y j ) ⎠<br />
i≠j<br />
i=j<br />
⎛<br />
⎞<br />
= 1 ⎝ ∑ (− σ2 n<br />
n 2 i≠j<br />
N − 1 ) + ∑<br />
V ar(y i ) ⎠<br />
i=1<br />
= 1 (<br />
)<br />
n(n − 1)(−<br />
σ2<br />
n 2 N − 1 ) + nσ2<br />
(<br />
(n − 1)(− 1<br />
= σ2<br />
n<br />
= σ2<br />
n<br />
( ) N − n<br />
N − 1<br />
N − 1 ) + 1 )<br />
Remark: From <strong>The</strong>orem 2.2.2, we see that ȳ is an unbiased estimator for µ. Also as n gets large<br />
(but n ≤ N), V ar(ȳ) tends to 0. This implies that ȳ will be a more accurate estimator for µ as n gets<br />
larger (but less than N). In particular, when n = N, we have a census <strong>and</strong> V ar(ȳ) = 0.<br />
Remark: In our previous statistics course, we usually sample {y 1 , y 2 , · · · , y n } from the population<br />
with replacement. <strong>The</strong>refore, {y 1 , y 2 , · · · , y n } are independent <strong>and</strong> identically distributed (i.i.d.).<br />
And recall we have results like<br />
E iid (ȳ) = µ,<br />
V ar iid (ȳ) = σ2<br />
n .<br />
Notice that V ar iid (ȳ) is different from V ar(ȳ) in <strong>The</strong>orem 2.2.2. In fact, for n > 1,<br />
V ar(ȳ) = σ2<br />
n<br />
( ) N − n<br />
< σ2<br />
N − 1 n = V ar iid(ȳ).<br />
Thus, for the same sample size n, sampling without replacement produces a less variable estimator <strong>of</strong><br />
µ. Why?<br />
7