05.01.2013 Views

Perceptual Coherence : Hearing and Seeing

Perceptual Coherence : Hearing and Seeing

Perceptual Coherence : Hearing and Seeing

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

(termed the amplitude envelopes) are the critical acoustic information.<br />

(Sometimes the term amplitude envelope refers to amplitude of the entire<br />

sound <strong>and</strong> not the amplitudes of individual frequencies.)<br />

However, before summarizing the evolving spectral envelope, it is worth<br />

considering the usefulness of pitch, loudness, <strong>and</strong> noisiness as perceptual<br />

cues. All give information about the size <strong>and</strong> physical construction of the<br />

object. Yet although pitch, loudness, <strong>and</strong> noisiness may be useful for the<br />

categorization of sounds, they seem restricted in their ability to allow<br />

the identification of specific objects. For example, the fundamental frequency<br />

of the vocal fold vibration is the most important cue for identifying<br />

male, female, <strong>and</strong> child speakers, <strong>and</strong> the pitch range <strong>and</strong> variation in loudness<br />

can provide additional information about the age <strong>and</strong> size of the<br />

source. But neither pitch nor loudness uniquely specifies an individual.<br />

Helmholtz (1877) describes the seamless transition between noise <strong>and</strong><br />

tones (<strong>and</strong> that was part of the argument in chapter 1). Sound sources can<br />

be understood to fall somewhere on that continuum. At one extreme, the<br />

sound is all noise, for example snare drums, fan noise, gurgling of fluids,<br />

<strong>and</strong> computer noise; at the other extreme, the sound is the superposition<br />

only of sinusoidal waves. Most sources fall between the ends: Flutes are<br />

characterized by a steady noise due to blowing across the mouthpiece<br />

opening; voices may be characterized by breathiness due to incomplete closure<br />

of the vocal folds; <strong>and</strong> other sources may be characterized by an initial<br />

noisy sound that evolves into a stronger pitch sound such as the initial<br />

metallic sounds for struck triangles <strong>and</strong> the initial scratchy bowing sounds<br />

before the bow catches the string <strong>and</strong> generates a musical note. But, again,<br />

noisiness seems restricted to identifying types of sounds.<br />

Although the amplitude envelope changes smoothly, we can operationally<br />

define three parts: (1) the onset, (2) the steady state, <strong>and</strong> (3) the decay.<br />

For each part, the goal is to derive acoustic properties that correlate<br />

with the perceived qualities of the sounds.<br />

The Onset of the Sound<br />

The Perception of Quality: Auditory Timbre 349<br />

The rise or attack time from the initiation of excitation until the sound<br />

reaches its maximum amplitude is determined by the manner of excitation<br />

<strong>and</strong> the damping of the vibration modes. If the damping is similar for all<br />

the modes, then the attack time for all the frequency components is similar<br />

<strong>and</strong> the amplitudes increase synchronously. If the damping differs, then<br />

the attack times of the components will differ, <strong>and</strong> the amplitudes of the<br />

components can follow significantly different trajectories. In spite of the<br />

possible differences in onset among the frequency components, the onset<br />

duration usually is measured by the time it takes for the entire sound to<br />

increase from its initiation to its maximum amplitude.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!