CHAPTER 2: Markov Chains (part 3)
CHAPTER 2: Markov Chains (part 3)
CHAPTER 2: Markov Chains (part 3)
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
we have<br />
v = 1/(1 − β)<br />
Actually, in this example we can calculate v and u directly.<br />
Example A Maze A white rat is put into the maze<br />
1 2 3<br />
shock<br />
4 5 6<br />
7 8 9<br />
shock<br />
food<br />
In the absence of learning, one might hypothesize that the rat would move through the maze<br />
at random, i.e. if there are k ways to leave a com<strong>part</strong>ment, then the rat would choose each<br />
of them with equal probability 1/k. Assume that the rat makes one changes to some adjacent<br />
com<strong>part</strong>ment at each unit of time. Let X n be the com<strong>part</strong>ment occupied at stage n. Suppose<br />
that com<strong>part</strong>ment 9 contains food and 3 and 7 contain electrical shocking mechanisms.<br />
1. what is the transition probability matrix<br />
P =<br />
1 2 3 4 5 6 7 8 9<br />
1 0 1/2 0 1/2 0 0 0 0 0<br />
2 1/3 0 1/3 0 1/3 0 0 0 0<br />
3 0 0 1 0 0 0 0 0 0<br />
4 1/3 0 0 0 1/3 0 1/3 0 0<br />
5 0 1/4 0 1/4 0 1/4 0 1/4 0<br />
6 0 0 1/3 0 1/3 0 0 0 1/3<br />
7 0 0 0 0 0 0 1 0 0<br />
8 0 0 0 0 1/3 0 1/3 0 1/3<br />
9 0 0 0 0 0 0 0 0 1<br />
2. If the rat starts in com<strong>part</strong>ment 1, what is the probability that the rate encounters food<br />
before being shocked.<br />
Let T = min{n : X n = 3 or X n = 7 or X n = 9}. Let u i = E(X T = 9|X 0 = i), i,e. the<br />
probability that the rat absorbed by food. Note that there are 3 absorbing states 3, 7 and<br />
9. It is easy to see that<br />
u 3 = 0, u 7 = 0, u 9 = 1<br />
2