Slides in PDF - of Marcus Hutter

More documents

Recommendations

Info

Marcus Hutter - 98 - Universal Induction & Intelligence Feature Reinforcement Learning (FRL) Goal: Develop efficient general purpose intelligent agent. State-of-the-art: (a) AIXI: Incomputable theoretical solution. (b) MDP: Efficient limited problem class. (c) POMDP: Notoriously difficult. (d) PSRs: Underdeveloped. Idea: ΦMDP reduces real problem to MDP automatically by learning. Accomplishments so far: (i) Criterion for evaluating quality of reduction. (ii) Integration of the various parts into one learning algorithm. (iii) Generalization to structured MDPs (DBNs) ΦMDP is promising path towards the grand goal & alternative to (a)-(d) Problem: Find reduction Φ efficiently (generic optimization problem)
Marcus Hutter - 99 - Universal Induction & Intelligence Markov Decision Processes (MDPs) a computationally tractable class of problems • MDP Assumption: State s t := o t and r t are probabilistic functions of o t−1 and a t−1 only. • Further Assumption: State=observation space § is finite and small. Example MDP ✞ ✎☞ ✎☞ ✝✲ ✍✌ s 1 ✲ ✍✌ s 3 r 2 r 1 ✻ ✒ ✎☞ ✠ ✎☞ ❄ ✛ r 4 ☎ r 3 ✍✌ s 4 ✛ ✍✌ s 2 ✆ • Goal: Maximize long-term expected reward. • Learning: Probability distribution is unknown but can be learned. • Exploration: Optimal exploration is intractable but there are polynomial approximations. • Problem: Real problems are not of this simple form.
Page 1 and 2:
Foundations of Universal Induction
Page 3 and 4:
Marcus Hutter - 3 - Universal Induc
Page 5 and 6:
Page 7 and 8:
Page 9 and 10:
Page 11 and 12:
Marcus Hutter - 11 - Universal Indu
Page 13 and 14:
Page 15 and 16:
Page 17 and 18:
Page 19 and 20:
Page 21 and 22:
Page 23 and 24:
Page 25 and 26:
Page 27 and 28:
Page 29 and 30:
Page 31 and 32:
Page 33 and 34:
Page 35 and 36:
Page 37 and 38:
Page 39 and 40:
Page 41 and 42:
Page 43 and 44:
Page 45 and 46:
Page 47 and 48: Marcus Hutter - 47 - Universal Indu
Page 91 and 92: Language Tree (Re)construction base
Page 97: Marcus Hutter - 97 - Universal Indu
Page 101 and 102: Marcus Hutter - 101 - Universal Ind
Page 111: Marcus Hutter - 111 - Universal Ind
show all

Slides in PDF - of Marcus Hutter

Create successful ePaper yourself

Delete template?

Save as template?