10.07.2015 Views

Information Theory, Inference, and Learning ... - Inference Group

Information Theory, Inference, and Learning ... - Inference Group

Information Theory, Inference, and Learning ... - Inference Group

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Copyright Cambridge University Press 2003. On-screen viewing permitted. Printing not permitted. http://www.cambridge.org/0521642981You can buy this book for 30 pounds or $50. See http://www.inference.phy.cam.ac.uk/mackay/itila/ for links.36.3: Further exercises 455(c) Assuming you wish very much to marry the most desirable person,<strong>and</strong> that your strategy will be ‘strategy M’:Strategy M – Meet the first M partners <strong>and</strong> say no to allof them. Memorize the maximum desirability d max amongthem. Then meet the others in sequence, waiting until apartner with d n > d max comes along, <strong>and</strong> marry them.If none more desirable comes along, marry the final Nthpartner (<strong>and</strong> feel miserable).– what is the optimal value of M?Exercise 36.7. [3 ] Regret as an objective function?The preceding exercise (parts b <strong>and</strong> c) involved a utility function basedon regret. If one married the tenth most desirable c<strong>and</strong>idate, the utilityfunction asserts that one would feel regret for having not chosen themost desirable.Many people working in learning theory <strong>and</strong> decision theory use ‘minimizingthe maximal possible regret’ as an objective function, but doesthis make sense?Imagine that Fred has bought a lottery ticket, <strong>and</strong> offers to sell it to youbefore it’s known whether the ticket is a winner. For simplicity say theprobability that the ticket is a winner is 1/100, <strong>and</strong> if it is a winner, itis worth £10. Fred offers to sell you the ticket for £1. Do you buy it?The possible actions are ‘buy’ <strong>and</strong> ‘don’t buy’. The utilities of the fourpossible action–outcome pairs are shown in table 36.2. I have assumedthat the utility of small amounts of money for you is linear. If you don’tbuy the ticket then the utility is zero regardless of whether the ticketproves to be a winner. If you do buy the ticket you end up either losingone pound (with probability 99/100) or gaining nine (with probability1/100). In the minimax regret community, actions are chosen to minimizethe maximum possible regret. The four possible regret outcomesare shown in table 36.3. If you buy the ticket <strong>and</strong> it doesn’t win, youhave a regret of £1, because if you had not bought it you would havebeen £1 better off. If you do not buy the ticket <strong>and</strong> it wins, you havea regret of £9, because if you had bought it you would have been £9better off. The action that minimizes the maximum possible regret isthus to buy the ticket.Discuss whether this use of regret to choose actions can be philosophicallyjustified.The above problem can be turned into an investment portfolio decisionproblem by imagining that you have been given one pound to invest intwo possible funds for one day: Fred’s lottery fund, <strong>and</strong> the cash fund. Ifyou put £f 1 into Fred’s lottery fund, Fred promises to return £9f 1 to youif the lottery ticket is a winner, <strong>and</strong> otherwise nothing. The remaining£f 0 (with f 0 = 1 − f 1 ) is kept as cash. What is the best investment?Show that the minimax regret community will invest f 1 = 9/10 of theirmoney in the high risk, high return lottery fund, <strong>and</strong> only f 0 = 1/10 incash. Can this investment method be justified?OutcomeActionBuy Don’tbuyNo win −1 0Wins +9 0Table 36.2. Utility in the lotteryticket problem.OutcomeActionBuy Don’tbuyNo win 1 0Wins 0 9Table 36.3. Regret in the lotteryticket problem.Exercise 36.8. [3 ] Gambling oddities (from Cover <strong>and</strong> Thomas (1991)). A horserace involving I horses occurs repeatedly, <strong>and</strong> you are obliged to betall your money each time. Your bet at time t can be represented by

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!