06.06.2013 Views

STOCHASTIC

STOCHASTIC

STOCHASTIC

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

INTRODUCTION TO DYNAMIC PROGRAMMING<br />

such that T"*(x) = 0. Naturally nx may vary with x. The termination condition<br />

implies that for each x e Q, nx is the smallest positive integer such that<br />

r»(x) = 0foralln^«x.<br />

For m 2: 1, let Sm be the subset of Q containing those points from which<br />

termination always occurs in m transitions or less but does not always occur<br />

in fewer than m transitions. That is, let Sm = {x e Q. | nx=m}. The termination<br />

assumption implies that Q= \J%=1Sm. Similarly, let S m be the set of all<br />

states in Q. from which termination always occurs in m transitions or less;<br />

that is, let 5"" = (Ji"=1 St. Should there exist an integer N such that nx ^ N<br />

for every xeQ and nx = N for at least one x, then the process is called an<br />

N-stage sequential decision process. N-stage processes cover a wide variety<br />

of important applications.<br />

The term "regeneration point" arises since the process is Markovian.<br />

Hence the probability law governing transitions from a given state is independent<br />

of what states preceded it. Here, the return vd(x) does not depend on<br />

how state x was attained. Note that {x} u Qx contains x and all states which<br />

are accessible from x. The following lemma demonstrates that v5 (x) depends<br />

only on those decisions in

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!