Slides in PDF - of Marcus Hutter
Slides in PDF - of Marcus Hutter
Slides in PDF - of Marcus Hutter
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
<strong>Marcus</strong> <strong>Hutter</strong> - 34 - Universal Induction & Intelligence<br />
Sequential Decisions<br />
A prediction is very <strong>of</strong>ten the basis for some decision. The decision<br />
results <strong>in</strong> an action, which itself leads to some reward or loss.<br />
Let Loss(x t , y t ) ∈ [0, 1] be the received loss when tak<strong>in</strong>g action y t ∈Y<br />
and x t ∈X is the t th symbol <strong>of</strong> the sequence.<br />
For <strong>in</strong>stance, decision Y ={umbrella, sunglasses} based on weather<br />
forecasts X ={sunny, ra<strong>in</strong>y}. Loss sunny ra<strong>in</strong>y<br />
umbrella 0.1 0.3<br />
sunglasses 0.0 1.0<br />
The goal is to m<strong>in</strong>imize the µ-expected loss. More generally we def<strong>in</strong>e<br />
the Λ σ prediction scheme, which m<strong>in</strong>imizes the σ-expected loss:<br />
∑<br />
y Λ σ<br />
t := arg m<strong>in</strong> σ(x t |x