10.07.2015 Views

Information Theory, Inference, and Learning ... - Inference Group

Information Theory, Inference, and Learning ... - Inference Group

Information Theory, Inference, and Learning ... - Inference Group

SHOW MORE
SHOW LESS
  • No tags were found...

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Copyright Cambridge University Press 2003. On-screen viewing permitted. Printing not permitted. http://www.cambridge.org/0521642981You can buy this book for 30 pounds or $50. See http://www.inference.phy.cam.ac.uk/mackay/itila/ for links.ANotationWhat does P (A | B, C) mean? P (A | B, C) is pronounced ‘the probabilitythat A is true given that B is true <strong>and</strong> C is true’. Or, more briefly, ‘theprobability of A given B <strong>and</strong> C’. (See Chapter 2, p.22.)What do log <strong>and</strong> ln mean? In this book, log x means the base-two logarithm,log 2 x; ln x means the natural logarithm, log e x.What does ŝ mean? Usually, a ‘hat’ over a variable denotes a guess or estimator.So ŝ is a guess at the value of s.Integrals. There is no difference between ∫ f(u) du <strong>and</strong> ∫ du f(u). The integr<strong>and</strong>is f(u) in both cases.What doesN∏n=1mean? This is like the summation ∑ Nn=1but it denotes aproduct. It’s pronounced ‘product over n from 1 to N’. So, for example,[N∏N]∑n = 1 × 2 × 3 × · · · × N = N! = exp ln n . (A.1)n=1I like to choose the name of the free variable in a sum or a product –here, n – to be the lower case version of the range of the sum. So nusually runs from 1 to N, <strong>and</strong> m usually runs from 1 to M. This is ahabit I learnt from Yaser Abu-Mostafa, <strong>and</strong> I think it makes formulaeeasier to underst<strong>and</strong>.( ) NWhat does mean? This is pronounced ‘N choose n’, <strong>and</strong> it is thennumber of ways of selecting an unordered set of n objects from a set ofsize N. ( ) N N!=n (N − n)! n! . (A.2)n=1This function is known as the combination function.What is Γ(x)? The gamma function is defined by Γ(x) ≡ ∫ ∞0du u x−1 e −u ,for x > 0. The gamma function is an extension of the factorial functionto real number arguments. In general, Γ(x + 1) = xΓ(x), <strong>and</strong> for integerarguments, Γ(x + 1) = x!. The digamma function is defined by Ψ(x) ≡ddxln Γ(x).For large x (for practical purposes, 0.1 ≤ x ≤ ∞),ln Γ(x) ≃ ( x − 1 2)ln(x) − x +12ln 2π + O(1/x); (A.3)598

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!