28.02.2013 Views

Introduction to Acoustics

Introduction to Acoustics

Introduction to Acoustics

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

with equalization, without incurring unduly high-power<br />

requirements.<br />

In general, most musical and speech energy is concentrated<br />

in a subset of the audio band, say from 100 Hz<br />

<strong>to</strong> 10 kHz, allowing the majority of audio signals <strong>to</strong> be<br />

satisfac<strong>to</strong>rily reproduced with modestly compromised<br />

system bandwidth, <strong>to</strong> the benefit of size, cost, and<br />

bulk.<br />

18.3.2 Amplitude Response Variation<br />

From the basic amplitude acuity of the ear, it is logical<br />

<strong>to</strong> require that the pass-band deviation in frequency<br />

response be less than an amplitude JND, say within<br />

± 0.25 dB of flat. Of course, this is the desired net response<br />

of an entire signal chain, and if deviations are<br />

known <strong>to</strong> exist in one part of the chain, it may be possible<br />

<strong>to</strong> apply compensating equalization elsewhere in the<br />

chain.<br />

As with the quest for full audio bandwidth, early<br />

audio systems for many years had trouble meeting this<br />

requirement, in part from mechanical limitations. Amplifiers<br />

again have traditionally had the easiest time<br />

meeting this specification, and modern-day DSP is<br />

limited only by the precision of the analog/digital<br />

converters, which is generally quite good in this regard,<br />

plus any deliberate signal-processing spectral<br />

modifications.<br />

The worst offender is many cases is the listening<br />

room, where echoes and resonances are likely <strong>to</strong><br />

make nearly any sine-wave frequency response plot<br />

a jumble of peaks and dips covering many dB. The threedimensionality<br />

of the room and the fact that echoes arrive<br />

from diverse directions work <strong>to</strong> reduce the audibility of<br />

many of these measured deviations.<br />

After rooms, loudspeakers have the hardest time<br />

meeting this flat-response requirement, being mechanical<br />

devices and having <strong>to</strong> handle a lot of power and<br />

move a lot of air. In order <strong>to</strong> cover the bulk of the audio<br />

band, loudspeaker designs typically employ multiple<br />

drivers, each optimized for a portion of the audio band,<br />

but this in turn introduces potential response deviations<br />

from required crossover networks.<br />

In considering reasonable values for most audio<br />

specifications based on psychophysical performance, it<br />

is prudent <strong>to</strong> consider both monotic and dichotic performance;<br />

that is, each ear considered in isolation, and then<br />

the binaural performance of the two ears <strong>to</strong>gether. In the<br />

case of amplitude response variation, similar JNDs apply,<br />

so there is no compelling reason <strong>to</strong> differentiate the<br />

situations in this case.<br />

Audio and Electroacoustics 18.3 Audio Specifications 753<br />

18.3.3 Phase Response<br />

In the course of passing through an audio component, an<br />

audio signal may encounter delays, which may be frequency<br />

dependent, and may therefore be categorized as<br />

either frequency-dependent time delays, group delays,<br />

or phase shifts. It is therefore prudent <strong>to</strong> establish preferred<br />

perceptual limits on the allowable deviations of<br />

these quantities.<br />

Of course, a pure wide-band time delay will be manifested<br />

as a linear change in phase shift proportional <strong>to</strong><br />

frequency, so <strong>to</strong> separate the effects of time delay and<br />

phase shift, so the latter (“frequency-differential singlechannel<br />

phase shift”) may commonly be unders<strong>to</strong>od<br />

<strong>to</strong> represent the deviation from a linear-with-frequency<br />

phase characteristic.<br />

It is a little difficult <strong>to</strong> be completely definitive<br />

about a phase-shift specification, in part because there<br />

are multiple neural timing mechanisms involved. For<br />

a sound channel considered in isolation, phase perception<br />

is rather weak. There does not seem <strong>to</strong> be an explicit<br />

means of comparing the fine-grain phase or timing of<br />

signals at significantly different frequencies (more than<br />

a critical bandwidth), at least for steady-state signals.<br />

This is sometimes used as the basis of a claim that the<br />

ear is virtually phase deaf, although that is probably<br />

an overstatement. Still, if one constructs a steady-state<br />

signal with widely space spectral components that are<br />

in slow precession, say a combination of 100 Hz and<br />

501 Hz (360 ◦ per second relative phase precession), the<br />

resulting percept is almost entirely constant.<br />

So under what conditions are phase shifts audible?<br />

There seem <strong>to</strong> be at least three such mechanisms: fusion<br />

time, transient fusion time, and critical band envelopes.<br />

As noted earlier, fusion time refers <strong>to</strong> the audio event<br />

integration time of the ear, generally in the range of<br />

20–50 ms. Separate audio events that occur farther apart<br />

than the fusion time will generally be heard as separate.<br />

If a spectrally complex signal is put through a system<br />

where some frequencies are phase- or time-shifted by<br />

more than the fusion time, the signal may temporally<br />

defuse, rendering the phase shift quite audible. (A 20 ms<br />

phase delay at 30 Hz corresponds <strong>to</strong> 216 degrees of phase<br />

shift, not an unrealistically large value.) This can occur,<br />

for example, if a sharp pulse is passed through a thirdorder,<br />

1/3-octave filter bank tuned for flat magnitude<br />

response. What starts out as a click will emerge as a<br />

“boing”, quite audibly distinct from the source.<br />

Transient fusion time refers <strong>to</strong> the shortest interval<br />

the ear can differentiate temporally, as exemplified by<br />

the premasking interval of about 1–2 ms. If a sharp tran-<br />

Part E 18.3

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!