TUTORIAL 2 SOLUTIONS #7.7.11 Consider a population of size four ...

TUTORIAL 2 SOLUTIONS 

#7.7.11 Consider a population of size four, 

the members of which have values x 1 ,x 2 ,x 3 ,x 4 . 

a. If simple random sampling were used, how 

many samples of size two are there? 

b. Suppose that rather than simple random 

sampling, the following sampling scheme 

is used. The possible samples of size two 

are 

{x 1 ,x 2 }, {x 2 ,x 3 }, {x 3 ,x 4 }, {x 1 ,x 4 } 

and the sampling is done in such a way 

that each of these four possible samples 

is equally likely. Is the sample mean unbiased? 

Solution 

a. The number of simple random samples 

of size 2 = ( 4 

2 

) 

=6. 

1

. Let µ denote the population mean. I.e. 

µ = x 1 + x 2 + x 3 + x 4 

. 

4 

We observe that 

E( ¯X) = 1 4 [x 1 + x 2 

+ x 2 + x 3 

2 2 

+ x 3 + x 4 

+ x 1 + x 4 

] 

2 2 

= x 1 + x 2 + x 3 + x 4 

4 

= µ. 

This implies that the sample mean is an 

unbiased estimate of the population mean 

(even though the sample is not a s.r.s.). 

2

#7.7.12 Consider simple random sampling 

with replacement. 

a. Show that 

s 2 = 1 

n − 1 

n∑ 

(X i − ¯X) 2 

i=1 

is an unbiased estimate of σ 2 . 

b. Is s an unbiased estimate of σ? 

c. Show that n −1 s 2 is an unbiased estimate 

of σ 2¯X. 

d. Show that n −1 N 2 s 2 is an unbiased estimate 

of σ 2 T . 

e. Show that ˆp(1− ˆp)/(n−1) is an unbiased 

estimate of σ 2ˆp . 

Solution First we observe that the sample 

X 1 ,...,X n is an i.i.d. sequence of random 

variables. 

3

a. Let µ = E(X 1 ) and Y i = X i −µ. Then 

E(Y i )=0, 

Var(Y i )=Var(X i )=σ 2 , 

E(s 2 )= 1 n∑ 

E[X 

n − 1 i − µ − ( ¯X − µ)] 2 

i=1 

= 1 n∑ 

E(Y 

n − 1 i − Ȳ )2 

i=1 

= 1 n∑ 

E(Y 2 

n − 1 

i − 2Y iȲ + Ȳ 2 ) 

i=1 

= nσ2 

n − 1 − 

+ n 

n − 1 E(Ȳ 2 ) 

= nσ2 

n − 1 − 

2n 

n − 1 E(Ȳ 2 ) 

n 

n − 1 E(Ȳ 2 ). 

4

Since E(Ȳ 2 ) = Var(Ȳ )=σ2 /n, 

E(s 2 )= 

nσ2 

n − 1 − 

= σ 2 . 

σ2 

n − 1 

This implies that s 2 is an unbiased estimate 

of σ 2 . 

b. No, s is not an unbiased estimate of σ 

since in general 

√ 

E(s) =E( 

√ 

s 2 ) 

< E(s 2 ) 

√ 

= σ 2 = σ. 

This follows from Jensen’s inequality and 

E(s) = √ E(s 2 ) only if s is a constant. 

5

c. From a. we have 

E(n −1 s 2 )= 1 n E(s2 ) 

= σ2 

n 

= σ 2¯X . 

That is n −1 s 2 is an unbiased estimate of the 

variance of ¯X. 

d. Recall that T = N ¯X and Var(T )is 

given by 

Consequently, 

σ 2 T 

=Var(N ¯X) 

= N 2 Var( ¯X) 

= N 2σ2 

n . 

E(n −1 N 2 s 2 )=n −1 N 2 E(s 2 ) 

= n −1 N 2 σ 2 

= σ 2 T . 

6

This shows that n −1 N 2 s 2 is an unbiased estimate 

of σ 2 T . 

e. Recall that ˆp = ¯X when the X i ’s only 

take values 0 or 1. Observe that 

σ 

2ˆp = σ2¯X 

= σ2 

n . 

Consequently, 

E 

ˆp(1 − ˆp) 

n − 1 

= 1 

n − 1 E[1 n 

= 

1 

n(n − 1) 

n∑ 

Xi 2 − ¯X 2 ] 

i=1 

n∑ 

E(X i − ¯X) 2 

i=1 

= 1 n E(s2 )= σ2 

n . 

This shows that ˆp(1 − ˆp)/(n − 1) is an unbiased 

estimate of σ 2ˆp . 7

#7.7.23 

a. Show that the standard error of an estimated 

proportion is largest when p = 

1/2. 

b. Use this result and Corollary B of Section 

7.3.2 to conclude that the quantity 

√ 

1 N − n 

2 N(n − 1) 

is a conservative estimate of the standard 

error of ˆp no matter what the value of p 

may be. 

c. Use the central limit theorem to conclude 

that the interval 

√ 

N − n 

ˆp ± 

N(n − 1) 

contains p with probability at least 0.95. 

8

Solution 

a. Recall that the sample is s.r.s. From 

page 214 of the text, the standard error of ˆp 

is given by 

√ 

σˆp = ( N − n − p) 

)p(1 . 

N − 1 n 

Treating σˆp as a function of p, we note that 

f(p) =p(1 − p), 0 ≤ p ≤ 1, is maximized 

when p =1/2. 

b. Using a. and Corollary B of Chapter 

7.3.2, we have 

ˆp(1 − ˆp) 

s 

2ˆp = 

n − 1 (1 − n N ) 

≤ 1 2 (1 − 1 2 )( 1 

n − 1 )(1 − n N ) 

= N − n 

4N(n − 1) . 

9

Taking square root, we have 

√ 

sˆp ≤ 1 N − n 

2 N(n − 1) . 

The r.h.s. is a conservative estimate of the 

standard error of ˆp. 

c. Using the CLT from s.r.s., a 95% CI for 

p is 

ˆp ± z 0.025 sˆp . 

Since z 0.025 =1.96 ≈ 2, it follows from b. 

that the interval 

ˆp ± 

√ 

N − n 

N(n − 1) . 

contains p with probability at least 0.95. 

10

#7.7.24 For a random sample of size n 

from a population of size N, consider the 

following as an estimate of µ: 

n∑ 

¯X c = c i X i 

i=1 

where the c i are fixed numbers and X 1 ,...,X n 

is the sample. 

a. Find a condition of the c i such that the 

estimate is unbiased. 

b. Show that the choice of c i that minimizes 

the variance of the estimate subject to 

this condition is c i = 1/n, where i = 

1,...,n. 

11

Solution Note that in this problem, a 

random sample is meant an i.i.d. sample. 

a. ¯Xc is an unbiased estimate of the population 

mean µ iff 

µ = E( ¯X 

n∑ 

c )= c i E(X i ) 

=( 

i=1 

n∑ 

c i )µ. 

i=1 

This implies that the condition for unbiasness 

is 

n∑ 

c i =1. 

i=1 

12

. Let Var(X 1 )=σ 2 . Then by the independence 

of X 1 ,...,X n , we have 

Var( ¯X 

n∑ 

c )=Var( c i X i ) 

= 

i=1 

n∑ 

c 2 i Var(X i) 

i=1 

n 

= σ 2 ∑ 

c 2 i . 

i=1 

We want to minimize ∑ n 

∑ i=1 c 2 i 

subject to 

ni=1 

c i =1. 

To do that we minimize the following Lagrangian 

function: 

n∑ 

f(c 1 ,...,c n ,λ)= c 2 n i + λ(1 − ∑ 

c i ). 

i=1 

i=1 

Here λ is the Lagrangian multiplier. 

13

We partial differentiate f wrt c 1 ,...,c n ,λ, 

∂f 

=2c 

∂c i − λ, i =1,...,n, 

i 

∂f 

n 

∂λ =1− ∑ 

c i . 

i=1 

Equating these partial derivatives to 0, we 

have 

c i = λ 2 , i =1,...,n, 

n∑ 

1= 

i=1 

= λn 2 . 

c i 

Solving for c i and λ, we have 

λ = 2 n , 

c i = 1 n , i =1,...,n. 

14

By taking 2nd partial derivatives of f, it 

can be shown that this gives the minimum 

(and not maximum) of f. 

15

TUTORIAL 2 SOLUTIONS #7.7.11 Consider a population of size four ...

Create successful ePaper yourself

Delete template?

Save as template?