10.01.2013 Views

DVD Demystified

DVD Demystified

DVD Demystified

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

102<br />

Birds Revisited: Understanding<br />

Audio Compression<br />

Chapter 3<br />

Audio takes up much less space than video, but uncompressed audio coupled<br />

with compressed video uses up a large percentage of the available<br />

bandwidth. Compressing the audio can result in a small loss of quality, but<br />

if the resulting space is used instead for video, it may improve the video<br />

quality significantly. In essence, reducing both the audio and the video is<br />

more effective. Usually video is compressed more than audio, since the ear<br />

is more sensitive to detail loss than the eye.<br />

Just as MPEG compression takes advantage of characteristics of the<br />

human eye, modern audio compression relies on detailed understanding of<br />

the human ear. This is called psychoacoustic or perceptual coding.<br />

Picture again your telephone conversation with a friend. Imagine that<br />

your friend lives near an airport, so that when a plane takes off, your friend<br />

cannot hear you over the sound of the airplane. In a situation like this, you<br />

quickly learn to stop talking when a plane is taking off, since your friend<br />

will not hear you. The airplane has masked the sound of your voice. At the<br />

opposite end of the loudness spectrum from airplane noise is background<br />

noise, such as a ticking clock. While you are speaking, your friend cannot<br />

hear the clock, but if you stop, then the background noise is no longer<br />

masked.<br />

The hairs in your inner ear are sensitive to sound pressure at different<br />

frequencies (pitches). When stimulated by a loud sound, they are incapable<br />

of sensing softer sounds at the same pitch. Because the hairs for similar frequencies<br />

are near each other, a stimulated audio receptor nerve will interfere<br />

with nearby receptors and cause them to be less sensitive. This is called<br />

frequency masking.<br />

Human hearing ranges roughly from low frequencies of 20 Hz to high<br />

frequencies of 20,000 Hz (20 kHz). The ear is most sensitive to the frequency<br />

range from about 2 to 5 kHz, which corresponds to the range of the<br />

human voice. Because aural sensitivity varies in a nonlinear fashion,<br />

sounds at some frequencies mask more neighboring sounds than at other<br />

frequencies. Experiments have established certain critical bands of varying<br />

size that correspond to the masking function of human hearing (Figure 3.6).<br />

Another characteristic of the human audio sensory system is that sounds<br />

cannot be sensed when they fall below a certain loudness (or amplitude).<br />

This sensitivity threshold, as with everything else, is not linear. In other<br />

words, the threshold is at louder or softer points at different frequencies.<br />

The overall threshold varies a little from person to person—some people<br />

have better hearing than others. The threshold of hearing is adaptive; the

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!