28.02.2013 Views

Introduction to Acoustics

Introduction to Acoustics

Introduction to Acoustics

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

676 Part E Music, Speech, Electroacoustics<br />

Part E 16.2<br />

p. 36]). An example is presented in Fig. 16.10 which<br />

shows mean Ps for five dramatic and five lyric sopranos<br />

when they sang pp and ff <strong>to</strong>nes throughout their<br />

range [16.21]. The bars represent one standard deviation.<br />

The dashed curve is the threshold pressure, i. e. the<br />

lowest pressure that produced vocal fold vibration, for<br />

a female voice according <strong>to</strong> Titze [16.25, 26].<br />

The singers mostly used more than twice as high Ps<br />

values when they were singing their loudest as compared<br />

with their softest <strong>to</strong>nes. The subjects reach up <strong>to</strong> 40 cm<br />

H2O. This can be compared with values typical of normal<br />

adult conversational speech, which tend <strong>to</strong> occur in<br />

the range of 5–10 cm H2O. However, loud speech and<br />

particularly stage speech may occasionally approach the<br />

higher values for singing.<br />

Also interesting is the fact that lyric sopranos use significantly<br />

lower Ps values than dramatic sopranos both<br />

in soft and loud phonation. It seems likely that this difference<br />

reflects difference in the mechanical properties<br />

of the vocal folds.<br />

Figure 16.11 presents another example of the close<br />

link between Ps and fundamental frequency. From <strong>to</strong>p <strong>to</strong><br />

bot<strong>to</strong>m it plots the time functions for sound level (SPL<br />

in dB), pressure (cm H2O), fundamental frequency (Hz)<br />

for a professional bari<strong>to</strong>ne singing an ascending triad<br />

followed by a descending dominant-seventh triad (ac-<br />

16.2 The Glottal Sound Source<br />

In speech and singing the general method <strong>to</strong> generate<br />

sound is <strong>to</strong> make a constriction and <strong>to</strong> let a strong flow<br />

of air pass through it. The respira<strong>to</strong>ry component serves<br />

as the power source providing the energy necessary for<br />

sound production. At the glottis the steady flow of air<br />

generated by the respira<strong>to</strong>ry component is transformed<br />

in<strong>to</strong> a quasiperiodic series of glottal pulses. In the vocal<br />

tract, the glottally modified breath stream undergoes<br />

further modifications by the resonance characteristics of<br />

the oral, pharyngeal and nasal cavities.<br />

Constrictions are formed at the glottis - by adjusting<br />

the separation of the vocal folds – and above the glottis<br />

– by positioning the articula<strong>to</strong>rs of the vocal tract.<br />

As the folds are brought close <strong>to</strong>gether, they respond<br />

<strong>to</strong> the air rushing through by initiating an open–close<br />

vibration and thereby imposing a quasiperiodic modulation<br />

of airflow. Thus, in a manner of speaking, the<br />

glottal structures operate as a device that imposes an<br />

AC modulation on a DC flow. This is basically the way<br />

that voicing, the sound source of voiced vowels and con-<br />

cording <strong>to</strong> the score at the bot<strong>to</strong>m). The pressure record<br />

was obtained by recording the oral pressure as the subject<br />

repeated the syllable [Ô�] on each note. During<br />

the occlusion of the s<strong>to</strong>p the peak oral pressure becomes<br />

equal <strong>to</strong> the Ps (as discussed in connection with<br />

Fig. 16.7). That means that the peak values shown in the<br />

middle diagram are good estimates of the actual Ps. It<br />

is important <strong>to</strong> note that, when the trace repeatedly returns<br />

<strong>to</strong> a value near zero, what we see is not the Ps,<br />

but the oral pressure reading for the [�] vowel, which is<br />

approximately zero.<br />

The exercise in Fig. 16.11 is often sung stacca<strong>to</strong>, i. e.<br />

with short pauses rather than with a /p/ consonant between<br />

the <strong>to</strong>nes. During these pauses the singer has <strong>to</strong><br />

get ready for the next fundamental frequency value and<br />

must therefore avoid exhaling the pressurized air being<br />

built up in the lungs. Singers do this by opening the glottis<br />

between the <strong>to</strong>nes and simultaneously reducing their<br />

Ps <strong>to</strong> zero, so that no air will be exhaled during the silent<br />

intervals between the <strong>to</strong>nes. A remarkable fact demonstrated<br />

here is that, particularly when sung loudly – so<br />

that high Ps values are used – this exercise requires nothing<br />

less than a virtuoso mastering of both the timing and<br />

tuning of the breathing apparatus and the pitch control<br />

process. A failure <strong>to</strong> reach a target pressure is likely <strong>to</strong><br />

result in a failure <strong>to</strong> reach the target F0.<br />

sonants and the carrier of in<strong>to</strong>nation and melody, gets<br />

made.<br />

A second mechanism is found in the production of<br />

noise, the acoustic raw materials for voiceless sounds<br />

(e.g., [f], [×], [Ô], [�]). The term refers <strong>to</strong> irregular turbulent<br />

fluctuations in airflow which arise when air comes<br />

out from a constriction at a high speed. This process<br />

can occur at the glottis – e.g., in [��] sounds, whisper<br />

and breathy voice qualities – or at various places of<br />

articulation in the vocal tract.<br />

The framework for describing both singing and<br />

speech is that of the source-filter theory of speech production<br />

[16.27, 28]. The goal of this section is <strong>to</strong> put<br />

speech and singing side by side within that framework<br />

and <strong>to</strong> describe how the speaker and the singer<br />

coordinate respiration, phonation and articulation <strong>to</strong><br />

shape the final product: the acoustic wave <strong>to</strong> be perceived<br />

by the listener.<br />

Figure 16.12 [16.29] is an attempt <strong>to</strong> capture a few<br />

key aspects of vocal fold vibrations. At the center a sin-

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!