09.01.2013 Views

IEOR 263B, Homework 8 Solution

IEOR 263B, Homework 8 Solution

IEOR 263B, Homework 8 Solution

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

(Adapted from Yifen Chen’s solution)<br />

1 Problem 1<br />

<strong>IEOR</strong> <strong>263B</strong>, <strong>Homework</strong> 8 <strong>Solution</strong><br />

Due April 16, 2009<br />

(a) Assuming that Yi is exponential with mean 8 with T = 10, compute αt and α<br />

αt = 1<br />

�<br />

� �<br />

∞<br />

αt+1F (αt+1) + ydF (y) =<br />

1 + r<br />

1<br />

�<br />

� ∞ � �<br />

αt+1<br />

αt+1F (αt+1) + ydF (y) − ydF (y)<br />

1 + r<br />

0<br />

0<br />

Then we have<br />

αt+1<br />

α ′ = [α1, α2, α3, α4, α5, α6, α7, α8, α9, α10]<br />

= [17.8835, 17.3236, 16.6750, 15.9140, 15.0065, 13.8988, 12.4998, 10.6318, 7.8431, 0.0000]<br />

By setting the total period as 500, we can find that the threshold α converge to 22.8871.<br />

(b) Repeat part (a) when the mean is 14.<br />

When mean=14, we have<br />

α ′ = [α1, α2, α3, α4, α5, α6, α7, α8, α9, α10]<br />

= [31.2961, 30.3162, 29.1812, 27.8497, 26.2614, 24.3228, 21.8747, 18.6057, 13.7255, 0.0000]<br />

By setting the total period as 500, we can find that the threshold α converge to 40.0525.<br />

(c) A number of you had problems in this part.<br />

1. The prior probability of the mean is 8 (λ = 1/8)is ρ0 By Bayes’ rule, the posterior probability<br />

ρt+1 = P [mean = 8|Y1, Y2, . . . , Yt+1] = 1 − P [mean = 14|Y1, Y2, . . . , Yt+1]<br />

=<br />

f(yt+1|λ = 1/8)ρt<br />

f(yt+1|λ = 1/8)ρt + f(yt+1|λ = 1/14)(1 − ρt)<br />

2. Using the approximation method, run the simulation N times. The number N should be large<br />

enough to ensure that your calculation of E(Profit, ρ0 = x), x = 0.1, . . . , 1.0 has converged. For<br />

example, compute the expected values for N = 1000, N = 5000, N = 10000, and see the percentage<br />

change in the values. You could set a criterion such as “choose N such that increasing N changes<br />

the expected values by less than 1%”. I don’t think any of you did a convergence study.<br />

3. For sensitivity, you want to look at something like<br />

E(Profit, ρ0 = 1.0) − E(Profit, ρ0 = 0.1)<br />

,<br />

E(Profit, ρ0 = 0.5)<br />

i.e. the relative range size. A statement like “the profit is insensitive to the prior because the total<br />

range is 2” is incorrect; as a counter-example, if the E(Profit, ρ0 = 0.5) = 0.1, then this range is<br />

actually huge. A number of you plotted E(Profit, ρ0 = x), x = 0.1, . . . , 1.0 against ρ0, which is<br />

good, but you really want to rescale the y-axis by E(Profit, ρ0 = 0.5) to get the relative comparison.<br />

1


2 Problem 2<br />

In each period, the amount of money the blackmailer already have is fixed and not relevant for decision<br />

making. Hence, in each period, the blackmailer only need consider the future return. In the last period, the<br />

value function of the blackmailer is<br />

�<br />

VN (uN ) = max R, max<br />

uN ∈[0,1] {uN(1<br />

�<br />

− uN)} = max<br />

�<br />

� �<br />

R, max − uN −<br />

uN ∈[0,1]<br />

1<br />

�2 2<br />

+ 1<br />

��<br />

4<br />

Hence, we can see that in period N, the blackmailer will offer a lump sum R to the victim if R > 1/4, otherwise,<br />

the blackmailer will demand a payment of u = 1/2 and the expected return is 1/4.<br />

In period t < N, if the blackmailer decide to demand a payment of ut, the probability that he can keep<br />

blackmailing is 1 − u and he can get u plus the future return, Vt+1(ut+1). So, the value function is<br />

�<br />

�<br />

Vt(u) = max R, max {(1 − u)[u + Vt+1(ut+1)]}<br />

(2)<br />

= max<br />

= max<br />

�<br />

u∈[0,1]<br />

�<br />

R, max<br />

u∈[0,1]<br />

� �<br />

R, max<br />

u∈[0,1]<br />

−<br />

−<br />

�<br />

u −<br />

�<br />

u −<br />

We can analyze this problem according to the value of R.<br />

First, observe that in period N − 1, if R ≥ 1/4, we have<br />

� ��<br />

2<br />

2<br />

(1 − Vt+1)<br />

(1 − Vt+1)<br />

+ Vt+1 +<br />

2<br />

4<br />

�2 (1 − Vt+1)<br />

+<br />

2<br />

(1 + Vt+1) 2<br />

��<br />

4<br />

VN−1(u) = max{R, max {(1 − u)(u + R)}}<br />

u∈[0,1]<br />

� � �<br />

� ��<br />

2<br />

(1 − R) (1 − R)2<br />

= max R, max − u − + R +<br />

u∈[0,1]<br />

2<br />

4<br />

= R +<br />

(1 − R)2<br />

4<br />

In this case, if R < 1, demanding a payment u is better than accepting a lump sum and the optimal uN−1<br />

will be (1 − R)/2. Also, we have VN−1 = (1+R)2<br />

< 1. Then in the previous period, from equation (2), u will be<br />

(1−VN−1)<br />

2<br />

(1+VN−1) 2<br />

4<br />

and VN−2 = 4 < 1. Hence, we know that even though the Value function is nondecreasing<br />

in −t, it is bounded above by one. This tell us that the blackmailer will always choose to demand a payment<br />

.<br />

ut = (1+VN−1)<br />

2<br />

However, if R ≥ 1, the optimal u is 0 because u ≥ 0 and the last term in the maximization is concave in u.<br />

The blackmailer gets the same return no matter what his decision is. However, we know that waiting for one<br />

period doesn’t give the blackmailer any benefit. Hence, we know if R > 1, the blackmailer will accept a lump<br />

sum R in the first period. If R < 1/4, we have<br />

� � �<br />

VN−1(u) = max R, max (1 − u) u +<br />

u∈[0,1]<br />

1<br />

���<br />

4<br />

� � �<br />

= max R, max − u −<br />

u∈[0,1]<br />

3<br />

�2 +<br />

8<br />

1<br />

��<br />

9<br />

+<br />

4 64<br />

= 1 9<br />

+<br />

4 64<br />

Again, demanding a payment u is better than accepting a lump sum. The optimal uN−1 will be 3/8. This is<br />

just the same as the case that R = 1/4.<br />

In conclusion, we have<br />

1. If R ≥ 1, the optimal policy is accept a lump sum R in the first period.<br />

2<br />

(1)<br />

(3)<br />

(4)


2. If 1 > R ≤ 1/4, the optimal policy is demanding a payment uk = (1+Vk+1)<br />

2 when k < N. And, in the last<br />

period, accept R instead of require a payment u.<br />

3. If R < 1/4, the optimal policy is always demanding a payment uk = (1+Vk+1)<br />

2 while the last u should be<br />

1/2 to maximize the whole profit.<br />

3

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!