LR Rabiner and RW Schafer, June 3

More documents

Recommendations

Info

DRAFT: L. R. Rabiner and R. W. Schafer, June 3, 2009 446CHAPTER 8. THE CEPSTRUM AND HOMOMORPHIC SPEECH PROCESSING Imaginary Part Imaginary Part 1 0.5 0 −0.5 −1 (a) Glottal Pulse G(z) 33 −2 −1 0 Real Part 1 1 0.5 0 −0.5 −1 (c) Radiation Load R(z) −1 0 Real Part 1 Imaginary Part Imaginary Part 1 0.5 0 −0.5 −1 1 0.5 0 −0.5 −1 (b) Vocal Tract Impulse Response V(z) 10 −1 0 Real Part 1 (d) Voiced Excitation P(z) 80 −1 0 Real Part 1 Figure 8.14: Pole-zero plots for speech model: (a) Glottal pulse G(z). (b) Vocal tract system function, V (z). (c) Radiation load system function R(z). (d) z-transform of periodic excitation P (z). The final component of the model is the periodic excitation p[n]. To allow a simple analysis with z-transforms, we define p[n] as the one-sided quasi-periodic impulse train ∞ p[n] = β k δ[n − kNp], (8.42) which has z-transform P (z) = ∞ k=0 k=0 β k z −kNp = 1 . (8.43) −Np 1 − βz Note that P (z) is a rational function of z −Np due to the even spacing of the assumed excitation sequence. The denominator in Eq. (8.43) has Np roots at locations zk = β 1/Np e j2πk/Np , k = 0, 1, . . . , Np − 1. Figure 8.13(d) shows the first few impulse samples of p[n] with spacing Np = 80 samples and β = 0.999, corresponding to a fundamental frequency of 10000/80 = 125 Hz. Figure 8.14(d) shows the Np poles on a circle of radius β 1/Np for the case β = 0.999. The angular spacing between the poles is 2π/Np radians, corresponding to analog frequency 10000/Np = 125 Hz for an assumed sampling rate of Fs = 10000 Hz. This spacing is, of course, equal to the fundamental frequency. The log magnitudes of the discrete-time Fourier transforms corresponding to the sequences in Figure 8.13 and pole-zero plots in Figure 8.14 are shown
DRAFT: L. R. Rabiner and R. W. Schafer, June 3, 2009 8.3. HOMOMORPHIC ANALYSIS OF THE SPEECH MODEL 447 log e | G(e j2π FT ) | log e | R(e j2π FT ) | 3 2 1 0 −1 −2 −3 (a) Glottal Pulse Spectrum −4 0 1000 2000 3000 4000 5000 frequency in Hz 1 0 −1 −2 −3 (c) Radiation Load Frequency Response −4 0 1000 2000 3000 4000 5000 frequency in Hz log e | V(e j2π FT ) | | P(e j2π FT ) | 2 1.5 1 0.5 0 −0.5 −1 (b) Vocal Tract Frequency Response −1.5 0 1000 2000 3000 4000 5000 frequency in Hz 100 80 60 40 20 (d) Voiced Excitation Spectrum 0 0 1000 2000 3000 4000 5000 frequency in Hz Figure 8.15: Log magnitude (base e of DTFTs: (a) Glottal pulse DTFT log |G(e jω )|. (b) Vocal tract frequency response, log |V (e jω )|. (c) Radiation load frequency response log |R(e jω )|. (d) Magnitude of DTFT of periodic excitation |P (e jω )|. in Figure 8.15 in corresponding locations. Note that the discrete-time Fourier transforms are plotted as log e | · | rather than in dB (i.e., 20 log 10 | · |) as is common elsewhere throughout this text. To convert the plots in Figure 8.15(a), (b) and (c) to dB, simply multiply by 20 log 10 e = 8.6859. We see that the spectral contribution due to the glottal pulse is a lowpass component that has a dynamic range of about 6 between F = 0 and F = 5000 Hz. This is equivalent to about 50 dB spectral falloff. Figure 8.15(b) shows the spectral contribution of the vocal tract system. The peaks of the spectrum are approximately at the locations given in Table 8.2 with bandwidths that increase with increasing frequency. As depicted in Figure 8.15(c), the effect of radiation is to give a high frequency boost that partially compensates for the falloff due to the glottal pulse. Finally, Figure 8.15(d) shows |P (e j2πF T )| (not the log) as a function of F . Note the periodic structure due to the periodicity of p[n]. The fundamental frequency for Np = 80 is F0 = 10000/80 = 125 Hz. 6 Now if the components of the speech model are combined by convolution, as defined in the upper branch of Figure 8.12, the result is the synthetic speech signal s[n] which is plotted in Figure 8.16(a). The frequency-domain repre- 6 In order to be able to make the plot in Figure 8.15(d) in Matlab it was necessary to use β = 0.999; i.e., the excitation was not perfectly periodic.
Page 1 and 2: DRAFT: L. R. Rabiner and R. W. Scha
Page 23: DRAFT: L. R. Rabiner and R. W. Scha
Page 75 and 76:
DRAFT: L. R. Rabiner and R. W. Scha
Page 77:
DRAFT: L. R. Rabiner and R. W. Scha
show all

LR Rabiner and RW Schafer, June 3

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?