28.02.2013 Views

Introduction to Acoustics

Introduction to Acoustics

Introduction to Acoustics

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

and ‘s’ in English text – may be assigned special,<br />

short transmission symbols <strong>to</strong> reduce the average<br />

data rate. Representative techniques which employ<br />

this include arithmetic coding and Huffman coding.<br />

6. Interchannel difference coding: if two or more<br />

channels are used and are sufficiently similar, datarate<br />

savings may be attained by sending sum and<br />

difference information, or a variant thereof. The expectation<br />

is that the difference values will be small,<br />

and can be coded in fewer bits using technique (4),<br />

above. In the case of stereo content, this approach is<br />

sometimes referred <strong>to</strong> as mid/side (M/S) coding.<br />

In addition <strong>to</strong> the techniques described above (and<br />

others), a lossless coder may either directly code timedomain<br />

PCM samples, or may instead use running<br />

transforms and operate in the frequency domain. In the<br />

latter case, a specialized bit-exact transform may be used<br />

<strong>to</strong> guarantee that the final output is an exact clone of the<br />

input.<br />

A fully realized lossless coder is likely <strong>to</strong> use<br />

a combination of techniques like those described above,<br />

requiring a flexible and perhaps elaborate bit stream<br />

syntax. The process of selecting which coding options<br />

will be brought <strong>to</strong> bear on a given block of samples<br />

is often not readily apparent from inspection or simple<br />

calculation, so the encoder may be required <strong>to</strong> try an exhaustive<br />

search of all possibilities for each block in order<br />

<strong>to</strong> select the best option. Needless <strong>to</strong> say, this can make<br />

for a rather slow-running encoder, but usually does not<br />

affect the speed of the decoder.<br />

As may be evident, the effectiveness of a lossless<br />

coder is very much signal dependent: some signals can<br />

be compressed considerably more than others, and some,<br />

such as full-level white noise, basically cannot be losslessly<br />

compressed at all. Typical compression ratios run<br />

in the range of 2:1 <strong>to</strong> 3:1. Because lossless compression<br />

is inherently a variable bitrate (VBR) coding technology,<br />

it is not well suited <strong>to</strong> real-time streaming applications,<br />

such as broadcast or fixed-speed digital tape.<br />

One weakness of lossless coding, and indeed of lossy<br />

coding as well, is that an error of even a single bit (in,<br />

say, one of the control codes specifying which method<br />

<strong>to</strong> use <strong>to</strong> unpack a block of data) can cause the loss of<br />

significant audio data. Some care must be used in the design<br />

of the bitstream <strong>to</strong> ensure that re-synchronization<br />

of the decoder in the face of data errors is reliably possible<br />

within an acceptably short period. More generally,<br />

the complexity and low error <strong>to</strong>lerance of almost any<br />

data-reduction algorithm raises the regrettable possibility<br />

that digital records, audio and otherwise, may be lost<br />

Audio and Electroacoustics 18.5 Digital Audio 773<br />

<strong>to</strong> future generations unless care is taken <strong>to</strong> avoid data<br />

errors, and <strong>to</strong> ensure that decoding algorithms are carefully<br />

documented and preserved in full detail. This is<br />

especially critical in the case of proprietary algorithms<br />

owned and maintained by commercial entities, whose<br />

long-term future operation may not be assured.<br />

Perceptual/Lossy Audio Coding<br />

While one might wish that lossless coding be used<br />

exclusively for digital audio compression, its limited<br />

performance and variable bitrate render it inappropriate<br />

for a large number of coding applications. For moreaggressive<br />

compression and acceptable performance<br />

within the constraint of constant bit rate, one must resort<br />

<strong>to</strong> the use of perceptual audio coding.<br />

The goal of perceptual audio coding is <strong>to</strong> preserve<br />

and convey all of the perceptual aspects of audio content,<br />

without regard <strong>to</strong> preserving the signal literally, usually<br />

employing the fewest bits possible. Recall that the data<br />

rate for a standard audio CD was about 1.4 Mbit/s. As<br />

this is written, the lowest data rate in commercial use for<br />

conveying high-quality stereo audio is no higher than 48<br />

Kbits/s, used by satellite radio and for Internet streaming,<br />

a data-compression ratio on the order of 30:1 or<br />

better.<br />

How is this accomplished? For starters, the use<br />

of lossless coding is not retained and incorporated in<br />

overall lossy algorithms. Every packet of information<br />

generated and reduced <strong>to</strong> its ultimate compactness by<br />

any part of a perceptual coder is a candidate for further<br />

reduction in size if one or more lossless techniques can<br />

be brought <strong>to</strong> bear effectively.<br />

Beyond that, the keys <strong>to</strong> high-efficiency perceptual<br />

coding are <strong>to</strong>:<br />

1. suppress any audio components not contributing <strong>to</strong><br />

the ultimate perception,<br />

2. ensure that all the audible components are conveyed<br />

with no alteration in perceived quality, while using<br />

the minimum possible number of bits, and<br />

3. employ, as much as possible, high-level abstractions<br />

<strong>to</strong> describe audio signals in purely perceptual terms,<br />

while retaining enough precision and detail <strong>to</strong> allow<br />

the decoder <strong>to</strong> accurately reconstitute a signal<br />

indistinguishable from the original encoder input<br />

signal.<br />

One fundamental building block that has almost always<br />

been part of the foundation of perceptual coders is a digital<br />

filter bank. This, of course, mirrors the functionality<br />

of the basilar membrane; and the frequency-domain out-<br />

Part E 18.5

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!