08.01.2013 Views

DigitalVideoAndHDTVAlgorithmsAndInterfaces.pdf

DigitalVideoAndHDTVAlgorithmsAndInterfaces.pdf

DigitalVideoAndHDTVAlgorithmsAndInterfaces.pdf

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Figure 38.3 JPEG block<br />

diagram shows the encoder<br />

(at the top), which performs<br />

the discrete cosine transform<br />

(DCT), quantization (Q), and<br />

variable-length encoding<br />

(VLE). The decoder (at the<br />

bottom) performs the<br />

inverse of each of these<br />

operations, in reverse order.<br />

Inverse quantization (IQ) has<br />

no relation to IQ color difference<br />

components.<br />

DISCRETE<br />

COSINE<br />

TRANSFORM QUANTIZE<br />

DCT Q VLE<br />

INVERSE<br />

DCT<br />

DCT -1<br />

DCT -1<br />

INVERSE<br />

QUANTIZE<br />

Q -1<br />

Q -1<br />

VARIABLE-<br />

LENGTH<br />

ENCODE<br />

VARIABLE-<br />

LENGTH<br />

DECODE<br />

VLE -1<br />

VLE -1<br />

In principle, the DCT algorithm could be applied to any<br />

block size, from 2×2 up to the size of the whole image,<br />

perhaps 512×512. (DCT is most efficient when applied<br />

to a matrix whose dimensions are powers of two.) The<br />

choice of 8×8 blocks of luma for the application of DCT<br />

in video represents a compromise between a block size<br />

small enough to minimize storage and processing overheads,<br />

but large enough to effectively exploit image<br />

redundancy.<br />

The DCT operation discards picture information to<br />

which vision is insensitive. Surprisingly, though, the<br />

JPEG standard itself makes no reference to perceptual<br />

uniformity. Because JPEG’s goal is to represent visually<br />

important information, it is important that so-called<br />

RGB values presented to the JPEG algorithm are first<br />

subject to a nonlinear transform such as that outlined in<br />

Nonlinear image coding, on page 12, that mimics vision.<br />

JPEG block diagram<br />

The JPEG block diagram in Figure 38.3 shows, at the<br />

top, the three main blocks of a JPEG encoder: the<br />

discrete cosine transform (DCT) computation (sometimes<br />

called forward DCT, FDCT), quantization (Q), and<br />

variable-length encoding (VLE). The decoder (at the<br />

bottom of Figure 38.3) performs the inverse of each of<br />

these operations, in reverse order. The inverse DCT is<br />

sometimes denoted IDCT; inverse quantization is sometimes<br />

denoted IQ.<br />

450 DIGITAL VIDEO AND HDTV ALGORITHMS AND INTERFACES

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!