The Epoch-Greedy Algorithm for Contextual Multi-armed Bandits
The Epoch-Greedy Algorithm for Contextual Multi-armed Bandits
The Epoch-Greedy Algorithm for Contextual Multi-armed Bandits
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Creating a Hypotheses<br />
• Simple two <strong>armed</strong> case<br />
• Remember binary thresholds<br />
• Want to learn the threshold value<br />
ε<br />
t<br />
ε<br />
If x < t : pick arm 1<br />
x > t : pick arm 2