06.06.2013 Views

STOCHASTIC

STOCHASTIC

STOCHASTIC

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

W. T. ZIEMBA<br />

Prob(Xt+i£z\(X,,t) = x,dx) is called the probability transition function of<br />

the process and governs its evolution. The transition from stage t to stage<br />

t +1 may be viewed as shown in Fig. 1. The mechanism inside the box may be<br />

Current state (a>, r)<br />

Decision c/x<br />

New state (o/, /+l)<br />

Fig. 1. Transition from stage / to /+1.<br />

thought of as that which chooses a new state from £2(+t via the inputs and the<br />

probability distribution over Clt+l. Implicit in this process is the assumption<br />

that given x = (X,,t) and dx, the random variable Xt+1 is independent of<br />

the random variables Xs,s < t, and their corresponding decisions. Note that<br />

the deterministic case is that special instance when all probability mass in<br />

each stage t falls upon one member of CI,.<br />

A policy 5 is an ordered collection of decisions containing one decision<br />

for each state in Q. The policy space A is the collection of all such policies.<br />

Thus, a policy 5 e A prescribes a particular decision for each and every<br />

state x e £2 and the policy space consists of all possible combinations of<br />

decisions at the various states. The policy space is the Cartesian product of<br />

all the decision sets; that is, A = Ylx e n At • The symbol 5X will frequently<br />

be used to denote the decision in d that applies to state x. Given a particular<br />

policy (5, the process {(X,,t\te T} is Markovian. That is, given the present<br />

state, the futuie evolution of the process is independent of the past. Each<br />

policy d is assumed to have a real-valued return function Vd{x) which reflects<br />

the net reward that would accrue if the process were started in state x and<br />

the specified decisions in

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!