04.04.2013 Views

Acoustic Waveforms Acoustic Waveforms Simple Harmonic Motion ...

Acoustic Waveforms Acoustic Waveforms Simple Harmonic Motion ...

Acoustic Waveforms Acoustic Waveforms Simple Harmonic Motion ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

<strong>Acoustic</strong> <strong>Waveforms</strong><br />

Robert Mannell<br />

Macquarie University<br />

<strong>Simple</strong> <strong>Harmonic</strong> <strong>Motion</strong> (1)<br />

• A single cycle of a sine wave can be<br />

depicted as if it were a point on a circle<br />

moving anti-clockwise (they are<br />

mathematically equivalent).<br />

• As we move anti-clockwise from the 3<br />

o’clock position (0°), we pass through 90°<br />

(12 o'clock) 180° (9 o'clock), 270° (6<br />

o'clock) and 360° same as 0°.<br />

<strong>Simple</strong> <strong>Harmonic</strong> <strong>Motion</strong> (3)<br />

• Note that simple harmonic motion (SHM)<br />

is a characteristic of many physical<br />

systems, including a pendulum, a child’s<br />

swing or a tuning fork.<br />

• Ideal systems oscillate forever.<br />

• Natural systems displaying SHM gradually<br />

or quickly reduce in amplitude. This is<br />

known as “damping”.<br />

1<br />

3<br />

5<br />

<strong>Acoustic</strong> <strong>Waveforms</strong><br />

In this topic we will examine:-<br />

1. <strong>Simple</strong> <strong>Harmonic</strong> <strong>Motion</strong><br />

2. Continuous <strong>Waveforms</strong> and Damping<br />

3. Adding <strong>Waveforms</strong> and Phase<br />

4. Speech <strong>Waveforms</strong><br />

<strong>Simple</strong> <strong>Harmonic</strong> <strong>Motion</strong> (2)<br />

• Note that as we rotate anticlockwise from 0° we rise to a<br />

maximum value, back to the starting value (zero) and<br />

then to the minimum value and back to zero again.<br />

• Note that the angle can also be expressed in radians,<br />

where 2π (2 x pi) radians equals 360°.<br />

• One whole rotation (2π or 360°) is equal to the period (T)<br />

<strong>Waveforms</strong> and Damping (1)<br />

• A sine wave is a waveform exhibiting ideal<br />

simple harmonic motion. Such a system<br />

loses no energy (or has its energy<br />

replenished from outside the system).<br />

• Such a waveform can also be called a<br />

continuous waveform as it continues<br />

forever without eventually reducing to zero<br />

intensity.<br />

2<br />

4<br />

6<br />

1


<strong>Waveforms</strong> and Damping (2)<br />

• A sine wave is a waveform generated by a<br />

system that is characterised by ideal<br />

simple harmonic motion.<br />

• A sound wave exhibiting these<br />

characteristics would be a pure tone.<br />

<strong>Waveforms</strong> and Damping (4)<br />

• This loss of energy in an oscillating system<br />

is know as damping.<br />

• A damped waveform is also know as a<br />

non-continuous waveform.<br />

<strong>Waveforms</strong> and Damping (6)<br />

• The terms weakly damped and strongly<br />

damped are relative terms.<br />

• When compared to speech sounds, a<br />

tuning fork is said to be weakly damped as<br />

it dies down very slowly when compared to<br />

each of the glottal pulses generated by the<br />

vocal folds during phonation (glottal pulses<br />

die out in a very small fraction of a<br />

second).<br />

7<br />

9<br />

11<br />

<strong>Waveforms</strong> and Damping (3)<br />

• Real systems are never ideal. All naturallyoccurring<br />

systems lose energy (eg. as heat due<br />

to friction).<br />

• A tuning fork lasts a while, but doesn't ring<br />

forever. Its intensity decreases over a relatively<br />

long period (say about half a minute).<br />

• A tuning fork looses energy as heat (both<br />

internally as a consequence of heat loss during<br />

physical deformation and externally as a<br />

consequence of friction with air).<br />

<strong>Waveforms</strong> and Damping (5)<br />

• A damped waveform can die out quickly or<br />

slowly.<br />

• A waveform that dies out quickly is said to<br />

be strongly damped, as it loses energy<br />

quickly.<br />

• A waveform that dies out slowly is said to<br />

be weakly damped, as it loses energy<br />

slowly.<br />

<strong>Waveforms</strong> and Damping (7)<br />

• Damping is also a characteristic of systems that<br />

produce sounds with very complex spectral<br />

patterns.<br />

• If you hit your fist on a wooden table it will<br />

produce a "bang" that dies out very rapidly. The<br />

"bang" possesses a very complex aperiodic or<br />

random spectrum that dies out in less than a<br />

second and so the sound and the system<br />

generating it are much more highly damped than<br />

a tuning fork and the sound it produces.<br />

8<br />

10<br />

12<br />

2


<strong>Waveforms</strong> and Damping (8)<br />

• Note that it is common to refer to both a<br />

sound and the system that generates it as<br />

possessing some relative degree of<br />

damping.<br />

Adding <strong>Waveforms</strong> & Phase (2)<br />

Here we can see the effect<br />

of adding two pure tones,<br />

one of 100 Hz and the other<br />

of 500 Hz. The 500 Hz tone<br />

has half the sound pressure<br />

level of the 100 Hz tone. In<br />

the bottom part of the<br />

diagram we can see the<br />

two pure tones as dashed<br />

lines. A simple addition of<br />

the dashed lines results in<br />

the unbroken line. The<br />

unbroken line clearly has a<br />

more complex pattern than<br />

either of the two pure tones.<br />

Adding <strong>Waveforms</strong> & Phase (4)<br />

• In the next slide we add together three<br />

waveforms with frequencies of 100, 200<br />

and 300 Hz. (Highest common factor 100)<br />

• They differ in the positions of the start of<br />

each wave cycle.<br />

• In the left image they all start at 0°.<br />

• In the right image they start at 0°, 90° and<br />

180° (going from the lowest to highest<br />

frequency)<br />

13<br />

15<br />

17<br />

Adding <strong>Waveforms</strong> & Phase (1)<br />

• So far, we have examined simple<br />

waveforms that resemble sine waves.<br />

When we plot a pure tone we can easily<br />

see its period (or frequency) and<br />

amplitude (sound pressure level).<br />

• The vast majority of natural sounds are not<br />

pure tones but are complex sounds that<br />

can be thought of as the combination of<br />

two or more pure tones.<br />

Adding <strong>Waveforms</strong> & Phase (3)<br />

• Note, in the previous slide, that the complex<br />

pattern repeats with the same period as the<br />

100 Hz tone.<br />

• 100 Hz is the highest common integer factor of the<br />

frequencies of the two tones (100 and 500 can<br />

both be divided by 100 to give an integer result).<br />

• The frequency of a complex wave is always equal<br />

to the frequency of the highest common factor of<br />

the sine waves being added to produce it.<br />

• The repetition frequency of the complex pattern is<br />

called its fundamental frequency (F 0).<br />

Adding <strong>Waveforms</strong> & Phase (5)<br />

14<br />

16<br />

18<br />

3


Adding <strong>Waveforms</strong> & Phase (6)<br />

• As you have seen, differences in the<br />

relative starting point of the three waves<br />

results in very different shapes to the<br />

resulting complex waves.<br />

• These differences in starting point are<br />

known as phase differences and these<br />

differences can be expressed as a phase<br />

angle (in degrees).<br />

Adding <strong>Waveforms</strong> & Phase (8)<br />

When we add together two waves that differ<br />

slightly in frequency (less than 20Hz), the<br />

resultant waveform has a very low F0 which can<br />

be heard as an intensity fluctuation, which is know<br />

as beating.<br />

Speech <strong>Waveforms</strong> (1)<br />

• A waveform is a two dimensional representation of<br />

a sound. The two dimensions in a waveform<br />

display are time and intensity. The vertical<br />

dimension is intensity and the horizontal<br />

dimension is time.<br />

• <strong>Waveforms</strong> are known as time domain<br />

representations of sound as they display changes<br />

in intensity over time.<br />

• The intensity dimension actually displays sound<br />

pressure. Sound pressure is a measure of the tiny<br />

variations in air pressure that we are able to<br />

perceive as sound. Intensity in these waveforms is<br />

a simple linear scaling of sound pressure (not dB).<br />

19<br />

21<br />

23<br />

Adding <strong>Waveforms</strong> & Phase (7)<br />

Here we can see 2<br />

waves (dotted lines)<br />

added together with<br />

different phase angles.<br />

At 0° and 360° (the two<br />

dotted lines are in the<br />

same place) we get full<br />

reinforcement.<br />

At 180° we get complete<br />

phase cancellation.<br />

In between we have<br />

different degrees of<br />

reinforcement or<br />

cancellation.<br />

Adding <strong>Waveforms</strong> & Phase (9)<br />

• If you examine the topic web page at:http://www.ling.mq.edu.au/speech/acoustics/waveforms/adding_waveforms.html<br />

you can see some examples of how<br />

adding different waves with different phase<br />

angles (or phase relationships) can result<br />

in some very different wave shapes.<br />

Speech <strong>Waveforms</strong> (2)<br />

• On the following pages, all of the diagrams<br />

represent the waveforms of the speech of a<br />

single male speaker of Australian English.<br />

• Time Scales: For figures 5 to 13 the horizontal<br />

time scale is indicated by the vertical dashed<br />

lines which are 100 milliseconds (ms) or 1/10<br />

second apart and the waveforms on these<br />

diagrams represent 800 ms (0.8 secs) of<br />

speech. For figures 1 to 4 the waveforms are 40<br />

ms (0.04 secs) long and there are no vertical<br />

time scale markers.<br />

20<br />

22<br />

24<br />

4


Speech <strong>Waveforms</strong> (3)<br />

• A much more detailed discussion of the<br />

following speech waveforms can be found<br />

on the topic web site at:-<br />

http://www.ling.mq.edu.au/speech/acoustics/waveforms/speech_waveforms.html<br />

Speech <strong>Waveforms</strong> (5)<br />

25<br />

2. Here we compare<br />

the weak murmur in<br />

a voiced stop /d/<br />

occlusion with the<br />

waveforms of /n/<br />

and /l/ and /j/, two<br />

approximates. Note<br />

the fairly simple<br />

waveforms for /d/<br />

and /n/ and the more<br />

complex waveforms<br />

for /l/ and /j/.<br />

Speech <strong>Waveforms</strong> (4)<br />

40 ms<br />

Speech <strong>Waveforms</strong> (6)<br />

1. One fricative /h/<br />

and three vowels<br />

with the start of<br />

each pitch cycle<br />

indicated by a<br />

vertical bar.<br />

The different<br />

vowels have<br />

different<br />

waveform<br />

patterns<br />

40 ms 27<br />

40 ms<br />

28<br />

Speech <strong>Waveforms</strong> (7)<br />

4. Here we can<br />

see different parts<br />

of the same /z/.<br />

Initially the<br />

amount of noise<br />

(randomness) is<br />

slight but it<br />

gradually<br />

strengthens to<br />

compete in<br />

strength with the<br />

regular (voiced)<br />

component.<br />

Speech <strong>Waveforms</strong> (8)<br />

26<br />

3. /f/ greatly amplified<br />

(is actually about the<br />

same amplitude as<br />

the low intensity part<br />

of /s/ marked with α).<br />

Note the random<br />

pattern for /f/ and /s/<br />

and the slight<br />

amounts of random<br />

pattern superimposed<br />

on a regular pattern<br />

for /v/ and /z/.<br />

5. Here we can<br />

see three CVC<br />

syllables each<br />

starting with /h/<br />

and ending with<br />

/d/. Its difficult to<br />

differentiate the<br />

three vowels<br />

from their pattern<br />

(which looks a<br />

little like a fish<br />

skeleton).<br />

40 ms 29<br />

30<br />

5


Speech <strong>Waveforms</strong> (9)<br />

6. In this figure we<br />

can compare the<br />

three voiceless<br />

stops of English<br />

(followed by the<br />

same vowel).<br />

Notice that the<br />

burst is strongest<br />

for /t/ and that they<br />

all have a<br />

significant random<br />

aspiration phase<br />

following the burst.<br />

Speech <strong>Waveforms</strong> (11)<br />

31<br />

8. Here we<br />

compare the<br />

affricates of<br />

English. The burst<br />

and aspiration of<br />

/tS/ is similar to<br />

the voiceless<br />

stops. The /dZ/<br />

aspiration is much<br />

longer than for the<br />

voiced stops and<br />

the VOT is<br />

negative.<br />

Speech <strong>Waveforms</strong> (13)<br />

33<br />

10. For these two<br />

voiced fricatives<br />

we can see that<br />

the regular voiced<br />

component is<br />

strong and the<br />

irregular fricative<br />

component is<br />

relatively weak<br />

only becoming<br />

fairly strong near<br />

the end of the /z/<br />

35<br />

Speech <strong>Waveforms</strong> (10)<br />

7. Here we<br />

compare the<br />

three English<br />

voiced stops.<br />

The are all prevoiced<br />

(negative<br />

VOT) and the<br />

bursts can be<br />

seen just before<br />

the vowel.<br />

Speech <strong>Waveforms</strong> (12)<br />

9. /f/ is a weak<br />

fricative and /s/<br />

is a strong<br />

fricative. Both<br />

are random. We<br />

can clearly<br />

differentiate the<br />

strengths of<br />

these two types<br />

of stops here.<br />

Speech <strong>Waveforms</strong> (14)<br />

32<br />

34<br />

11. The /m/ and /n/<br />

waveforms are<br />

very similar (ignore<br />

the creak). Their<br />

waveforms can’t<br />

be discriminated<br />

but they can be<br />

discriminated from<br />

the more complex<br />

and intense pattern<br />

of the following<br />

vowel.<br />

36<br />

6


Speech <strong>Waveforms</strong> (15)<br />

12. /l/ and /r/<br />

waveforms are<br />

also very similar<br />

to each other<br />

and to /m/ and<br />

/n/ but, again,<br />

are simpler in<br />

pattern than the<br />

following vowel.<br />

Speech <strong>Waveforms</strong> (17)<br />

Waveform Visual Classes<br />

• For the purposes of this course we will define a number<br />

of "waveform visual classes". These classes are used<br />

purely for the practical purpose of distinguishing between<br />

pairs of waveforms.<br />

• These waveform visual classes are not true phonetic<br />

classes, but they are sets consisting of one or more<br />

complete phonetic class.<br />

• For the present, we will only consider consonant<br />

phonemes in simple CV syllables. In VC and VCV<br />

contexts the visual characteristics of some of the<br />

consonants can be quite different from their visual<br />

characteristics in CV context (this is particularly true for<br />

the oral stops).<br />

Speech <strong>Waveforms</strong> (19)<br />

b) Ambiguous, difficult to distinguish consonant visual classes:<br />

• WVC#5: Affricates: Appearance intermediate between the<br />

stops and fricatives /tS dZ/. /tS/ often looks like an intense<br />

stop, but sometimes its burst is not clear and so it will look<br />

more like a short voiceless fricative. /dZ/ often looks quite<br />

similar to a voiced strong fricative /z Z/ but usually the<br />

aspiration phase is much stronger.<br />

• WVC#6: Voiced fricatives (strong and weak) /v D z Z/: Can<br />

look a lot like WVC#4, but the mixture of periodicity and<br />

aperiodicity is sometimes clearly seen just before the vowel,<br />

especially for the strong fricatives /z Z/.<br />

• WVC#7: Voiced stops /b d g/: When the burst is clear<br />

(especially for /d/) the identification of this class is possible<br />

but often the burst is unclear and these sounds look very<br />

much like voiced strong fricatives.<br />

37<br />

39<br />

41<br />

Speech <strong>Waveforms</strong> (16)<br />

13. The<br />

semivowels /w/<br />

and /j/ (“y”) are<br />

also similar to<br />

each other and<br />

the preceding 4<br />

nasals and<br />

approximants but,<br />

again, are simpler<br />

in pattern than the<br />

following vowel.<br />

Speech <strong>Waveforms</strong> (18)<br />

Clearly separable consonant waveform visual classes:-<br />

• WVC#1: Strong Voiceless fricatives /s S/: Gradually<br />

increasing aperiodic or random pattern, no bursts,<br />

relatively intense frication.<br />

• WVC#2: Weak Voiceless fricatives /f T h/: Gradually<br />

increasing aperiodic or random pattern, no bursts,<br />

relatively weak frication.<br />

• WVC#3: Voiceless oral stops /p t k/: Aperiodic sound<br />

commencing abruptly with a burst.<br />

• WVC#4: Approximants and nasal consonants /l r w j m n<br />

N/: Relatively simple periodic (voiced) pattern and weaker<br />

in intensity than the vowel but rising smoothly to the vowel<br />

intensity over several glottal cycles during the consonantvowel<br />

transition.<br />

Speech <strong>Waveforms</strong> (20)<br />

• You are expected to be able to<br />

discriminate vowel waveforms from<br />

consonant waveforms (at least in simple<br />

CV contexts) and to reasonably reliably<br />

discriminate pairs of tokens belonging to<br />

the clearly separable CV consonant<br />

waveform visual classes (WVC#1 to 4).<br />

38<br />

40<br />

42<br />

7


Readings<br />

• The main readings for this topic are the<br />

topic web pages reached via:-<br />

http://www.ling.mq.edu.au/speech/acoustics/waveforms/index.html<br />

43<br />

8

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!