29.07.2014 Views

CHAPTER 2: Markov Chains (part 3)

CHAPTER 2: Markov Chains (part 3)

CHAPTER 2: Markov Chains (part 3)

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

we have<br />

v = 1/(1 − β)<br />

Actually, in this example we can calculate v and u directly.<br />

Example A Maze A white rat is put into the maze<br />

1 2 3<br />

shock<br />

4 5 6<br />

7 8 9<br />

shock<br />

food<br />

In the absence of learning, one might hypothesize that the rat would move through the maze<br />

at random, i.e. if there are k ways to leave a com<strong>part</strong>ment, then the rat would choose each<br />

of them with equal probability 1/k. Assume that the rat makes one changes to some adjacent<br />

com<strong>part</strong>ment at each unit of time. Let X n be the com<strong>part</strong>ment occupied at stage n. Suppose<br />

that com<strong>part</strong>ment 9 contains food and 3 and 7 contain electrical shocking mechanisms.<br />

1. what is the transition probability matrix<br />

P =<br />

1 2 3 4 5 6 7 8 9<br />

1 0 1/2 0 1/2 0 0 0 0 0<br />

2 1/3 0 1/3 0 1/3 0 0 0 0<br />

3 0 0 1 0 0 0 0 0 0<br />

4 1/3 0 0 0 1/3 0 1/3 0 0<br />

5 0 1/4 0 1/4 0 1/4 0 1/4 0<br />

6 0 0 1/3 0 1/3 0 0 0 1/3<br />

7 0 0 0 0 0 0 1 0 0<br />

8 0 0 0 0 1/3 0 1/3 0 1/3<br />

9 0 0 0 0 0 0 0 0 1<br />

2. If the rat starts in com<strong>part</strong>ment 1, what is the probability that the rate encounters food<br />

before being shocked.<br />

Let T = min{n : X n = 3 or X n = 7 or X n = 9}. Let u i = E(X T = 9|X 0 = i), i,e. the<br />

probability that the rat absorbed by food. Note that there are 3 absorbing states 3, 7 and<br />

9. It is easy to see that<br />

u 3 = 0, u 7 = 0, u 9 = 1<br />

2

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!