beamer - Vrije Universiteit Amsterdam

More documents

Recommendations

Info

§3.3.3 Markov Chains with Rewards ▶ Let f : I → be a reward or cost function; ▶ ∑ n k=1 f (Xk) is the total reward up to time n; ▶ limn→∞ 1 n ∑ n k=1 f (Xk) is the long-run average reward per unit of time; ▶ We wish to have an ergodic (or Markov-reward) property 1 lim n→∞ n n∑ f (Xk) = ∑ πjf (j) (w.p. 1) k=1 j∈I c⃝ Ad Ridder (VU) SOR– Fall 2012 28 / 36
Finite Unichain Case . Ergodic Theorem Finite Case . Suppose that the Markov chain satisfies the unichain condition with a finite set of transient states (|T| < ∞) and with a finite irreducible set of recurrent states (|R| < ∞). Then . 1 lim n→∞ n n∑ f (Xk) = ∑ πjf (j) (w.p. 1) k=1 Proof: let r be an arbitrary initial state 1 n n∑ k=1 f (Xk) = 1 n k=1 j∈I j∈I n∑ ∑ {Xk = j|X0 = r}f (j) = ∑ ( 1 n j∈I n∑ k=1 ) {Xk = j|X0 = r} f (j). Take n → ∞; interchange limit and finite sum allowed; apply empirical state average property (slide 12). c⃝ Ad Ridder (VU) SOR– Fall 2012 29 / 36
Page 1 and 2: Stochastic Operations Research Lect
Page 3 and 4: Example ⎛ ⎜ P = ⎜ ⎝ 0.6 0.4
Page 5 and 6: Recap ▶ State j transient ⇔ fjj
Page 7 and 8: Proper First-Passage Times . Coroll
Page 9 and 10: Compute Powers of P ⎛ P 129 ⎜ =
Page 11 and 12: Observations We might conclude 1. l
Page 13 and 14: Mean Return Times and Probabilities
Page 15 and 16: Probabilistic Averages II . Theorem
Page 17 and 18: Probabilistic Averages IV . Corolla
Page 19 and 20: Infinite Recurrent Sets In the situ
Page 21 and 22: Infinite Transient or Nonrecurrent
Page 23 and 24: Equilibrium Distribution . Definiti
Page 25 and 26: Limiting Probabilities . Equation (
Page 27: Summary Suppose Assumption 3.3.1 (e
Page 31 and 32: Expected Reward ▶ See Remark 3.3.
Page 33 and 34: Rewrite ▶ Rewrite to classic form
Page 35 and 36: Gauss-Seidel Method ▶ Construct a

beamer - Vrije Universiteit Amsterdam

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?