Information Theory, Inference, and Learning ... - MAELabs UCSD

More documents

Recommendations

Info

Copyright Cambridge University Press 2003. On-screen viewing permitted. Printing not permitted. http://www.cambridge.org/0521642981 You can buy this book for 30 pounds or $50. See http://www.inference.phy.cam.ac.uk/mackay/itila/ for links. 112 6 — Stream Codes predictive probability distribution over ai given the sequence that has occurred thus far, P (xn = ai | x1, . . . , xn−1). The receiver has an identical program that produces the same predictive probability distribution P (xn = ai | x1, . . . , xn−1). 0.00 0.25 0.50 0.75 1.00 ✻ ❄ 01 Concepts for understanding arithmetic coding ✛ ✻ 0 ❄ ✻ 1 ❄ 01101 Notation for intervals. The interval [0.01, 0.10) is all numbers between 0.01 and 0.10, including 0.01˙0 ≡ 0.01000 . . . but not 0.10˙0 ≡ 0.10000 . . . . A binary transmission defines an interval within the real line from 0 to 1. For example, the string 01 is interpreted as a binary real number 0.01. . . , which corresponds to the interval [0.01, 0.10) in binary, i.e., the interval [0.25, 0.50) in base ten. The longer string 01101 corresponds to a smaller interval [0.01101, 0.01110). Because 01101 has the first string, 01, as a prefix, the new interval is a sub-interval of the interval [0.01, 0.10). A one-megabyte binary file (2 23 bits) is thus viewed as specifying a number between 0 and 1 to a precision of about two million decimal places – two million decimal digits, because each byte translates into a little more than two decimal digits. Now, we can also divide the real line [0,1) into I intervals of lengths equal to the probabilities P (x1 = ai), as shown in figure 6.2. 0.00 P (x1 = a1) P (x1 = a1) + P (x1 = a2) P (x1 = a1) + . . . + P (x1 = aI−1) 1.0 ✛ ✛ . ✻❄ a1 ✻ a2 ❄ . ✻❄ aI a2a1 a2a5 We may then take each interval ai and subdivide it into intervals denoted aia1, aia2, . . . , aiaI, such that the length of aiaj is proportional to P (x2 = aj | x1 = ai). Indeed the length of the interval aiaj will be precisely the joint probability P (x1 = ai, x2 = aj) = P (x1 = ai)P (x2 = aj | x1 = ai). (6.1) Iterating this procedure, the interval [0, 1) can be divided into a sequence of intervals corresponding to all possible finite length strings x1x2 . . . xN, such that the length of an interval is equal to the probability of the string given our model. Figure 6.1. Binary strings define real intervals within the real line [0,1). We first encountered a picture like this when we discussed the symbol-code supermarket in Chapter 5. Figure 6.2. A probabilistic model defines real intervals within the real line [0,1).
Copyright Cambridge University Press 2003. On-screen viewing permitted. Printing not permitted. http://www.cambridge.org/0521642981 You can buy this book for 30 pounds or $50. See http://www.inference.phy.cam.ac.uk/mackay/itila/ for links. 6.2: Arithmetic codes 113 u := 0.0 v := 1.0 p := v − u for n = 1 to N { Compute the cumulative probabilities Qn and Rn (6.2, 6.3) v := u + pRn(xn | x1, . . . , xn−1) u := u + pQn(xn | x1, . . . , xn−1) p := v − u } Formulae describing arithmetic coding The process depicted in figure 6.2 can be written explicitly as follows. The intervals are defined in terms of the lower and upper cumulative probabilities Qn(ai | x1, . . . , xn−1) ≡ Rn(ai | x1, . . . , xn−1) ≡ i−1 P (xn = ai ′ | x1, . . . , xn−1), (6.2) i ′ = 1 i P (xn = ai ′ | x1, . . . , xn−1). (6.3) i ′ = 1 As the nth symbol arrives, we subdivide the n−1th interval at the points defined by Qn and Rn. For example, starting with the first symbol, the intervals ‘a1’, ‘a2’, and ‘aI’ are and a1 ↔ [Q1(a1), R1(a1)) = [0, P (x1 = a1)), (6.4) a2 ↔ [Q1(a2), R1(a2)) = [P (x = a1), P (x = a1) + P (x = a2)) , (6.5) aI ↔ [Q1(aI), R1(aI)) = [P (x1 = a1) + . . . + P (x1 = aI−1), 1.0) . (6.6) Algorithm 6.3 describes the general procedure. To encode a string x1x2 . . . xN, we locate the interval corresponding to x1x2 . . . xN , and send a binary string whose interval lies within that interval. This encoding can be performed on the fly, as we now illustrate. Example: compressing the tosses of a bent coin Imagine that we watch as a bent coin is tossed some number of times (cf. example 2.7 (p.30) and section 3.2 (p.51)). The two outcomes when the coin is tossed are denoted a and b. A third possibility is that the experiment is halted, an event denoted by the ‘end of file’ symbol, ‘✷’. Because the coin is bent, we expect that the probabilities of the outcomes a and b are not equal, though beforehand we don’t know which is the more probable outcome. Encoding Let the source string be ‘bbba✷’. We pass along the string one symbol at a time and use our model to compute the probability distribution of the next Algorithm 6.3. Arithmetic coding. Iterative procedure to find the interval [u, v) for the string x1x2 . . . xN .
Page 1 and 2:
Copyright Cambridge University Pres
Page 3 and 4:
Page 5 and 6:
Page 7 and 8:
Page 9 and 10:
Page 11 and 12:
Page 13 and 14:
Page 15 and 16:
Page 17 and 18:
Page 19 and 20:
Page 21 and 22:
Page 23 and 24:
Page 25 and 26:
Page 27 and 28:
Page 29 and 30:
Page 31 and 32:
Page 33 and 34:
Page 35 and 36:
Page 37 and 38:
Page 39 and 40:
Page 41 and 42:
Page 43 and 44:
Page 45 and 46:
Page 47 and 48:
Page 49 and 50:
Page 51 and 52:
Page 53 and 54:
Page 55 and 56:
Page 57 and 58:
Page 59 and 60:
Page 61 and 62:
Page 63 and 64:
Page 65 and 66:
Page 67 and 68:
Page 69 and 70:
Page 71 and 72:
Page 73 and 74: Copyright Cambridge University Pres
Page 123: Copyright Cambridge University Pres
Page 175 and 176:
Page 177 and 178:
Page 179 and 180:
Page 181 and 182:
Page 183 and 184:
Page 185 and 186:
Page 187 and 188:
Page 189 and 190:
Page 191 and 192:
Page 193 and 194:
Page 195 and 196:
Page 197 and 198:
Page 199 and 200:
Page 201 and 202:
Page 203 and 204:
Page 205 and 206:
Page 207 and 208:
Page 209 and 210:
Page 211 and 212:
Page 213 and 214:
Page 215 and 216:
Page 217 and 218:
Page 219 and 220:
Page 221 and 222:
Page 223 and 224:
Page 225 and 226:
Page 227 and 228:
Page 229 and 230:
Page 231 and 232:
Page 233 and 234:
Page 235 and 236:
Page 237 and 238:
Page 239 and 240:
Page 241 and 242:
Page 243 and 244:
Page 245 and 246:
Page 247 and 248:
Page 249 and 250:
Page 251 and 252:
Page 253 and 254:
Page 255 and 256:
Page 257 and 258:
Page 259 and 260:
Page 261 and 262:
Page 263 and 264:
Page 265 and 266:
Page 267 and 268:
Page 269 and 270:
Page 271 and 272:
Page 273 and 274:
Page 275 and 276:
Page 277 and 278:
Page 279 and 280:
Page 281 and 282:
Page 283 and 284:
Page 285 and 286:
Page 287 and 288:
Page 289 and 290:
Page 291 and 292:
Page 293 and 294:
Page 295 and 296:
Page 297 and 298:
Page 299 and 300:
Page 301 and 302:
Page 303 and 304:
Page 305 and 306:
Page 307 and 308:
Page 309 and 310:
Page 311 and 312:
Page 313 and 314:
Page 315 and 316:
Page 317 and 318:
Page 319 and 320:
Page 321 and 322:
Page 323 and 324:
Page 325 and 326:
Page 327 and 328:
Page 329 and 330:
Page 331 and 332:
Page 333 and 334:
Page 335 and 336:
Page 337 and 338:
Page 339 and 340:
Page 341 and 342:
Page 343 and 344:
Page 345 and 346:
Page 347 and 348:
Page 349 and 350:
Page 351 and 352:
Page 353 and 354:
Page 355 and 356:
Page 357 and 358:
Page 359 and 360:
Page 361 and 362:
Page 363 and 364:
Page 365 and 366:
Page 367 and 368:
Page 369 and 370:
Page 371 and 372:
Page 373 and 374:
Page 375 and 376:
Page 377 and 378:
Page 379 and 380:
Page 381 and 382:
Page 383 and 384:
Page 385 and 386:
Page 387 and 388:
Page 389 and 390:
Page 391 and 392:
Page 393 and 394:
Page 395 and 396:
Page 397 and 398:
Page 399 and 400:
Page 401 and 402:
Page 403 and 404:
Page 405 and 406:
Page 407 and 408:
Page 409 and 410:
Page 411 and 412:
Page 413 and 414:
Page 415 and 416:
Page 417 and 418:
Page 419 and 420:
Page 421 and 422:
Page 423 and 424:
Page 425 and 426:
Page 427 and 428:
Page 429 and 430:
Page 431 and 432:
Page 433 and 434:
Page 435 and 436:
Page 437 and 438:
Page 439 and 440:
Page 441 and 442:
Page 443 and 444:
Page 445 and 446:
Page 447 and 448:
Page 449 and 450:
Page 451 and 452:
Page 453 and 454:
Page 455 and 456:
Page 457 and 458:
Page 459 and 460:
Page 461 and 462:
Page 463 and 464:
Page 465 and 466:
Page 467 and 468:
Page 469 and 470:
Page 471 and 472:
Page 473 and 474:
Page 475 and 476:
Page 477 and 478:
Page 479 and 480:
Page 481 and 482:
Page 483 and 484:
Page 485 and 486:
Page 487 and 488:
Page 489 and 490:
Page 491 and 492:
Page 493 and 494:
Page 495 and 496:
Page 497 and 498:
Page 499 and 500:
Page 501 and 502:
Page 503 and 504:
Page 505 and 506:
Page 507 and 508:
Page 509 and 510:
Page 511 and 512:
Page 513 and 514:
Page 515 and 516:
Page 517 and 518:
Page 519 and 520:
Page 521 and 522:
Page 523 and 524:
Page 525 and 526:
Page 527 and 528:
Page 529 and 530:
Page 531 and 532:
Page 533 and 534:
Page 535 and 536:
Page 537 and 538:
Page 539 and 540:
Page 541 and 542:
Page 543 and 544:
Page 545 and 546:
Page 547 and 548:
Page 549 and 550:
Page 551 and 552:
Page 553 and 554:
Page 555 and 556:
Page 557 and 558:
Page 559 and 560:
Page 561 and 562:
Page 563 and 564:
Page 565 and 566:
Page 567 and 568:
Page 569 and 570:
Page 571 and 572:
Page 573 and 574:
Page 575 and 576:
Page 577 and 578:
Page 579 and 580:
Page 581 and 582:
Page 583 and 584:
Page 585 and 586:
Page 587 and 588:
Page 589 and 590:
Page 591 and 592:
Page 593 and 594:
Page 595 and 596:
Page 597 and 598:
Page 599 and 600:
Page 601 and 602:
Page 603 and 604:
Page 605 and 606:
Page 607 and 608:
Page 609 and 610:
Page 611 and 612:
Page 613 and 614:
Page 615 and 616:
Page 617 and 618:
Page 619 and 620:
Page 621 and 622:
Page 623 and 624:
Page 625 and 626:
Page 627 and 628:
Page 629 and 630:
Page 631 and 632:
Page 633 and 634:
Page 635 and 636:
Page 637 and 638:
Page 639 and 640:
show all

Information Theory, Inference, and Learning ... - MAELabs UCSD

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?