10.07.2015 Views

Information Theory, Inference, and Learning ... - Inference Group

Information Theory, Inference, and Learning ... - Inference Group

Information Theory, Inference, and Learning ... - Inference Group

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Copyright Cambridge University Press 2003. On-screen viewing permitted. Printing not permitted. http://www.cambridge.org/0521642981You can buy this book for 30 pounds or $50. See http://www.inference.phy.cam.ac.uk/mackay/itila/ for links.446 35 — R<strong>and</strong>om <strong>Inference</strong> TopicsNow, 2 10 = 1024 ≃ 10 3 = 1000, so without needing a calculator, we have10 log 2 ≃ 3 log 10 <strong>and</strong>p 1 ≃ 3 10 . (35.2)More generally, the probability that the first digit is d is(log(d + 1) − log(d))/(log 10 − log 1) = log 10 (1 + 1/d). (35.3)This observation about initial digits is known as Benford’s law.does not correspond to a uniform probability distribution over d.Ignorance✷⊲ Exercise 35.2. [2 ] A pin is thrown tumbling in the air. What is the probabilitydistribution of the angle θ 1 between the pin <strong>and</strong> the vertical at a momentwhile it is in the air? The tumbling pin is photographed. What is theprobability distribution of the angle θ 3 between the pin <strong>and</strong> the verticalas imaged in the photograph?P (9)10✻❄ 987654✻P (3)❄ 32✻P (1)❄ 1⊲ Exercise 35.3. [2 ] Record breaking. Consider keeping track of the world recordfor some quantity x, say earthquake magnitude, or longjump distancesjumped at world championships. If we assume that attempts to breakthe record take place at a steady rate, <strong>and</strong> if we assume that the underlyingprobability distribution of the outcome x, P (x), is not changing –an assumption that I think is unlikely to be true in the case of sportsendeavours, but an interesting assumption to consider nonetheless – <strong>and</strong>assuming no knowledge at all about P (x), what can be predicted aboutsuccessive intervals between the dates when records are broken?35.2 The Luria–Delbrück distribution[3C, p.449]Exercise 35.4. In their l<strong>and</strong>mark paper demonstrating that bacteriacould mutate from virus sensitivity to virus resistance, Luria <strong>and</strong> Delbrück(1943) wanted to estimate the mutation rate in an exponentially-growing populationfrom the total number of mutants found at the end of the experiment.This problem is difficult because the quantity measured (the numberof mutated bacteria) has a heavy-tailed probability distribution: a mutationoccuring early in the experiment can give rise to a huge number of mutants.Unfortunately, Luria <strong>and</strong> Delbrück didn’t know Bayes’ theorem, <strong>and</strong> their wayof coping with the heavy-tailed distribution involves arbitrary hacks leading totwo different estimators of the mutation rate. One of these estimators (basedon the mean number of mutated bacteria, averaging over several experiments)has appallingly large variance, yet sampling theorists continue to use it <strong>and</strong>base confidence intervals around it (Kepler <strong>and</strong> Oprea, 2001). In this exerciseyou’ll do the inference right.In each culture, a single bacterium that is not resistant gives rise, after ggenerations, to N = 2 g descendants, all clones except for differences arisingfrom mutations. The final culture is then exposed to a virus, <strong>and</strong> the numberof resistant bacteria n is measured. According to the now accepted mutationhypothesis, these resistant bacteria got their resistance from r<strong>and</strong>om mutationsthat took place during the growth of the colony. The mutation rate (per cellper generation), a, is about one in a hundred million. The total number ofopportunities to mutate is N, since ∑ g−1i=0 2i ≃ 2 g = N. If a bacterium mutatesat the ith generation, its descendants all inherit the mutation, <strong>and</strong> the finalnumber of resistant bacteria contributed by that one ancestor is 2 g−i .

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!