20.03.2015 Views

The Epoch-Greedy Algorithm for Contextual Multi-armed Bandits

The Epoch-Greedy Algorithm for Contextual Multi-armed Bandits

The Epoch-Greedy Algorithm for Contextual Multi-armed Bandits

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Creating a Hypotheses<br />

• Simple two <strong>armed</strong> case<br />

• Remember binary thresholds<br />

• Want to learn the threshold value<br />

ε<br />

t<br />

ε<br />

If x < t : pick arm 1<br />

x > t : pick arm 2

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!