28.01.2015 Views

Slides in PDF - of Marcus Hutter

Slides in PDF - of Marcus Hutter

Slides in PDF - of Marcus Hutter

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>Marcus</strong> <strong>Hutter</strong> - 34 - Universal Induction & Intelligence<br />

Sequential Decisions<br />

A prediction is very <strong>of</strong>ten the basis for some decision. The decision<br />

results <strong>in</strong> an action, which itself leads to some reward or loss.<br />

Let Loss(x t , y t ) ∈ [0, 1] be the received loss when tak<strong>in</strong>g action y t ∈Y<br />

and x t ∈X is the t th symbol <strong>of</strong> the sequence.<br />

For <strong>in</strong>stance, decision Y ={umbrella, sunglasses} based on weather<br />

forecasts X ={sunny, ra<strong>in</strong>y}. Loss sunny ra<strong>in</strong>y<br />

umbrella 0.1 0.3<br />

sunglasses 0.0 1.0<br />

The goal is to m<strong>in</strong>imize the µ-expected loss. More generally we def<strong>in</strong>e<br />

the Λ σ prediction scheme, which m<strong>in</strong>imizes the σ-expected loss:<br />

∑<br />

y Λ σ<br />

t := arg m<strong>in</strong> σ(x t |x

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!