Information Theory, Inference, and Learning ... - Inference Group

More documents

Recommendations

Info

Copyright Cambridge University Press 2003. On-screen viewing permitted. Printing not permitted. http://www.cambridge.org/0521642981You can buy this book for 30 pounds or $50. See http://www.inference.phy.cam.ac.uk/mackay/itila/ for links.312 23 — Useful Probability Distributions23.2 Distributions over unbounded real numbersGaussian, Student, Cauchy, biexponential, inverse-cosh.The Gaussian distribution or normal distribution with mean µ and standarddeviation σ isP (x | µ, σ) = 1 ( )Z exp (x − µ)2−2σ 2 x ∈ (−∞, ∞), (23.5)whereZ = √ 2πσ 2 . (23.6)It is sometimes useful to work with the quantity τ ≡ 1/σ 2 , which is called theprecision parameter of the Gaussian.A sample z from a standard univariate Gaussian can be generated bycomputingz = cos(2πu 1 ) √ 2 ln(1/u 2 ), (23.7)where u 1 and u 2 are uniformly distributed in (0, 1). A second sample z 2 =sin(2πu 1 ) √ 2 ln(1/u 2 ), independent of the first, can then be obtained for free.The Gaussian distribution is widely used and often asserted to be a verycommon distribution in the real world, but I am sceptical about this assertion.Yes, unimodal distributions may be common; but a Gaussian is a special,rather extreme, unimodal distribution. It has very light tails: the logprobability-densitydecreases quadratically. The typical deviation of x from µis σ, but the respective probabilities that x deviates from µ by more than 2σ,3σ, 4σ, and 5σ, are 0.046, 0.003, 6 × 10 −5 , and 6 × 10 −7 . In my experience,deviations from a mean four or five times greater than the typical deviationmay be rare, but not as rare as 6 × 10 −5 ! I therefore urge caution in the use ofGaussian distributions: if a variable that is modelled with a Gaussian actuallyhas a heavier-tailed distribution, the rest of the model will contort itself toreduce the deviations of the outliers, like a sheet of paper being crushed by arubber band.⊲ Exercise 23.1. [1 ] Pick a variable that is supposedly bell-shaped in probabilitydistribution, gather data, and make a plot of the variable’s empiricaldistribution. Show the distribution as a histogram on a log scale andinvestigate whether the tails are well-modelled by a Gaussian distribution.[One example of a variable to study is the amplitude of an audiosignal.]One distribution with heavier tails than a Gaussian is a mixture of Gaussians.A mixture of two Gaussians, for example, is defined by two means,two standard deviations, and two mixing coefficients π 1 and π 2 , satisfyingπ 1 + π 2 = 1, π i ≥ 0.P (x | µ 1 , σ 1 , π 1 , µ 2 , σ 2 , π 2 ) = π )1√ exp(− (x−µ1)2 + π )2√ exp(− (x−µ2)2 .2πσ1 2πσ2If we take an appropriately weighted mixture of an infinite number ofGaussians, all having mean µ, we obtain a Student-t distribution,whereP (x | µ, s, n) = 1 Z2σ 2 12σ 2 21(1 + (x − µ) 2 /(ns 2 , (23.8)))(n+1)/2 Z = √ πns 2 Γ(n/2)Γ((n + 1)/2)(23.9)0.50.40.30.20.10-2 0 2 4 6 80.10.010.0010.0001-2 0 2 4 6 8Figure 23.3. Three unimodaldistributions. Two Studentdistributions, with parameters(m, s) = (1, 1) (heavy line) (aCauchy distribution) and (2, 4)(light line), and a Gaussiandistribution with mean µ = 3 andstandard deviation σ = 3 (dashedline), shown on linear verticalscales (top) and logarithmicvertical scales (bottom). Noticethat the heavy tails of the Cauchydistribution are scarcely evidentin the upper ‘bell-shaped curve’.
Copyright Cambridge University Press 2003. On-screen viewing permitted. Printing not permitted. http://www.cambridge.org/0521642981You can buy this book for 30 pounds or $50. See http://www.inference.phy.cam.ac.uk/mackay/itila/ for links.23.3: Distributions over positive real numbers 313and n is called the number of degrees of freedom and Γ is the gamma function.If n > 1 then the Student distribution (23.8) has a mean and that mean isµ. If n > 2 the distribution also has a finite variance, σ 2 = ns 2 /(n − 2).As n → ∞, the Student distribution approaches the normal distribution withmean µ and standard deviation s. The Student distribution arises both inclassical statistics (as the sampling-theoretic distribution of certain statistics)and in Bayesian inference (as the probability distribution of a variable comingfrom a Gaussian distribution whose standard deviation we aren’t sure of).In the special case n = 1, the Student distribution is called the Cauchydistribution.A distribution whose tails are intermediate in heaviness between Studentand Gaussian is the biexponential distribution,P (x | µ, s) = 1 ( )Z exp |x − µ|− x ∈ (−∞, ∞) (23.10)swhereZ = 2s. (23.11)The inverse-cosh distributionP (x | β) ∝1[cosh(βx)] 1/β (23.12)is a popular model in independent component analysis. In the limit of large β,the probability distribution P (x | β) becomes a biexponential distribution. Inthe limit β → 0 P (x | β) approaches a Gaussian with mean zero and variance1/β.23.3 Distributions over positive real numbersExponential, gamma, inverse-gamma, and log-normal.The exponential distribution,P (x | s) = 1 (Z exp − x )x ∈ (0, ∞), (23.13)swhereZ = s, (23.14)arises in waiting problems. How long will you have to wait for a bus in Poissonville,given that buses arrive independently at random with one every sminutes on average? Answer: the probability distribution of your wait, x, isexponential with mean s.The gamma distribution is like a Gaussian distribution, except whereas theGaussian goes from −∞ to ∞, gamma distributions go from 0 to ∞. Just asthe Gaussian distribution has two parameters µ and σ which control the meanand width of the distribution, the gamma distribution has two parameters. Itis the product of the one-parameter exponential distribution (23.13) with apolynomial, x c−1 . The exponent c in the polynomial is the second parameter.P (x | s, c) = Γ(x; s, c) =1 Z( x) c−1exp (− x ), 0 ≤ x < ∞ (23.15)s swhereZ = Γ(c)s. (23.16)
Page 1 and 2:
Copyright Cambridge University Pres
Page 3 and 4:
Page 5 and 6:
Page 7 and 8:
Page 9 and 10:
Page 11 and 12:
Page 13 and 14:
Page 15 and 16:
Page 17 and 18:
Page 19 and 20:
Page 21 and 22:
Page 23 and 24:
Page 25 and 26:
Page 27 and 28:
Page 29 and 30:
Page 31 and 32:
Page 33 and 34:
Page 35 and 36:
Page 37 and 38:
Page 39 and 40:
Page 41 and 42:
Page 43 and 44:
Page 45 and 46:
Page 47 and 48:
Page 49 and 50:
Page 51 and 52:
Page 53 and 54:
Page 55 and 56:
Page 57 and 58:
Page 59 and 60:
Page 61 and 62:
Page 63 and 64:
Page 65 and 66:
Page 67 and 68:
Page 69 and 70:
Page 71 and 72:
Page 73 and 74:
Page 75 and 76:
Page 77 and 78:
Page 80 and 81:
Page 82 and 83:
Page 84 and 85:
Page 86 and 87:
Page 88 and 89:
Page 90 and 91:
Page 92 and 93:
Page 94 and 95:
Page 96 and 97:
Page 98 and 99:
Page 100 and 101:
Page 102 and 103:
Page 104 and 105:
Page 106 and 107:
Page 108 and 109:
¥¡¥¡¥¡¥¡¥¡¥¡¥¡¥¡¥
Page 110 and 111:
Page 112 and 113:
Page 114 and 115:
Page 116 and 117:
Page 118 and 119:
Page 120 and 121:
Page 122 and 123:
Page 124 and 125:
Page 126 and 127:
Page 128 and 129:
Page 130 and 131:
Page 132 and 133:
Page 134 and 135:
Page 136 and 137:
Page 138 and 139:
Page 140 and 141:
Page 142 and 143:
Page 144 and 145:
Page 146 and 147:
Page 148 and 149:
Page 150 and 151:
Page 152 and 153:
Page 154 and 155:
Page 156 and 157:
Page 158 and 159:
Page 160 and 161:
Page 162 and 163:
Page 164 and 165:
Page 166 and 167:
Page 168 and 169:
Page 170 and 171:
Page 172 and 173:
Page 174 and 175:
Page 176 and 177:
Page 178 and 179:
Page 180 and 181:
Page 182 and 183:
Page 184 and 185:
Page 186 and 187:
Page 188 and 189:
Page 190 and 191:
Page 192 and 193:
Page 194 and 195:
Page 196 and 197:
Page 198 and 199:
Page 200 and 201:
Page 202 and 203:
Page 204 and 205:
Page 206 and 207:
Page 208 and 209:
Page 210 and 211:
Page 212 and 213:
Page 214 and 215:
Page 216 and 217:
Page 218 and 219:
Page 220 and 221:
Page 222 and 223:
Page 224 and 225:
Page 226 and 227:
Page 228 and 229:
Page 230 and 231:
Page 232 and 233:
Page 234 and 235:
Page 236 and 237:
Page 238 and 239:
Page 240 and 241:
Page 242 and 243:
Page 244 and 245:
Page 246 and 247:
Page 248 and 249:
Page 250 and 251:
Page 252 and 253:
Page 254 and 255:
Page 256 and 257:
Page 258 and 259:
Page 260 and 261:
Page 262 and 263:
Page 264 and 265:
Page 266 and 267:
Page 268 and 269:
Page 270 and 271:
Page 272 and 273:
Page 274 and 275: Copyright Cambridge University Pres
Page 374 and 375:
Page 376 and 377:
Page 378 and 379:
Page 380 and 381:
Page 382 and 383:
Page 384 and 385:
Page 386 and 387:
Page 388 and 389:
Page 390 and 391:
Page 392 and 393:
Page 394 and 395:
Page 396 and 397:
Page 398 and 399:
Page 400 and 401:
Page 402 and 403:
Page 404 and 405:
Page 406 and 407:
Page 408 and 409:
Page 410 and 411:
Page 412 and 413:
Page 414 and 415:
Page 416 and 417:
Page 418 and 419:
Page 420 and 421:
Page 422 and 423:
Page 424 and 425:
Page 426 and 427:
Page 428 and 429:
Page 430 and 431:
Page 432 and 433:
Page 434 and 435:
Page 436 and 437:
Page 438 and 439:
Page 440 and 441:
Page 442 and 443:
Page 444 and 445:
Page 446 and 447:
Page 448 and 449:
Page 450 and 451:
Page 452 and 453:
Page 454 and 455:
Page 456 and 457:
Page 458 and 459:
Page 460 and 461:
Page 462 and 463:
Page 464 and 465:
Page 466 and 467:
Page 468 and 469:
Page 470 and 471:
Page 472 and 473:
Page 474 and 475:
Page 476 and 477:
Page 478 and 479:
Page 480 and 481:
Page 482 and 483:
Page 484 and 485:
Page 486 and 487:
Page 488 and 489:
Page 490 and 491:
Page 492 and 493:
Page 494 and 495:
Page 496 and 497:
Page 498 and 499:
Page 500 and 501:
Page 502 and 503:
Page 504 and 505:
Page 506 and 507:
Page 508 and 509:
Page 510 and 511:
Page 512 and 513:
Page 514 and 515:
¡¤¢¢¤¨©¢£¡¢£¨¢£ ©Co
Page 516 and 517:
Page 518 and 519:
Page 520 and 521:
Page 522 and 523:
Page 524 and 525:
Page 526 and 527:
Page 528 and 529:
Page 530 and 531:
Page 532 and 533:
Page 534 and 535:
Page 536 and 537:
Page 538 and 539:
Page 540 and 541:
Page 542 and 543:
Page 544 and 545:
Page 546 and 547:
Page 548 and 549:
Page 550 and 551:
Page 552 and 553:
Page 554 and 555:
ttttCopyright Cambridge University
Page 556 and 557:
Page 558 and 559:
Page 560 and 561:
Page 562 and 563:
Page 564 and 565:
Page 566 and 567:
Page 568 and 569:
Page 570 and 571:
Page 572 and 573:
Page 574 and 575:
Page 576 and 577:
Page 578 and 579:
Page 580 and 581:
Page 582 and 583:
Page 584 and 585:
Page 586 and 587:
Page 588 and 589:
Page 590 and 591:
Page 592 and 593:
Page 594 and 595:
Page 596 and 597:
Page 598 and 599:
Page 600 and 601:
Page 602 and 603:
Page 604 and 605:
Page 606 and 607:
Page 608 and 609:
Page 610 and 611:
Page 612 and 613:
Page 614 and 615:
Page 616 and 617:
Page 618 and 619:
Page 620 and 621:
Page 622 and 623:
Page 624 and 625:
Page 626 and 627:
Page 628 and 629:
Page 630 and 631:
Page 632 and 633:
Page 634 and 635:
Page 636 and 637:
Page 638 and 639:
Page 640:
show all

Information Theory, Inference, and Learning ... - Inference Group

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?