18.07.2013 Views

LR Rabiner and RW Schafer, June 3

LR Rabiner and RW Schafer, June 3

LR Rabiner and RW Schafer, June 3

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

DRAFT: L. R. <strong>Rabiner</strong> <strong>and</strong> R. W. <strong>Schafer</strong>, <strong>June</strong> 3, 2009<br />

8.5. HOMOMORPHIC FILTERING OF NATURAL SPEECH 469<br />

log magnitude<br />

Phase (Radians)<br />

Phase (Radians)<br />

2<br />

0<br />

−2<br />

(a) Log Magnitude<br />

−4 4096−point DFT of Windowed Signal<br />

−6<br />

Estimate of Vocal Tract Spectrum<br />

0 500 1000 1500 2000 2500 3000 3500 4000<br />

5<br />

0<br />

−5<br />

−10<br />

(b) Unwrapped Phase<br />

4096−point DFT of Windowed Signal<br />

Estimate of Vocal Tract Spectrum<br />

−15<br />

0 500 1000 1500 2000 2500 3000 3500 4000<br />

2<br />

0<br />

−2<br />

(c) Principal Value Phase<br />

0 500 1000 1500 2000 2500 3000 3500 4000<br />

Frequency (Hz)<br />

Figure 8.29: Homomorphic analysis of voiced speech; (a) Log magnitude<br />

of short-time Fourier transform, log |X(e jω )|. (Heavy line is estimate<br />

of log |HV (e jω )| obtained by lowpass liftering.) (b) Unwrapped phase<br />

arg{X(e jω )}. (Heavy line is estimate of arg{HV (e jω )} obtained by lowpass<br />

liftering.) (c) Principal value of phase of short-time Fourier transform<br />

ARG{X(e jω )}. (All functions in this figure are plotted as a function analog<br />

frequency F ; i.e., ω = 2πF T .)<br />

DFT frequency. Note that the unwrapped phase curve also displays periodic<br />

“ripples” with period equal to the fundamental frequency. Figures. 8.29a <strong>and</strong><br />

8.29b together comprise the complex logarithm of the short-time Fourier transform;<br />

i.e, they are the real <strong>and</strong> imaginary parts of the Fourier transform of<br />

the complex cepstrum, which is shown in Figure 8.30a. Figure 8.30 is virtually<br />

identical to Figure 8.27 because the DFT length (N = 4096) was large enough<br />

to make the time aliasing effects negligible <strong>and</strong> ensure accurate phase unwrapping.<br />

Notice again the peaks at both positive <strong>and</strong> negative times equal to the<br />

pitch period, <strong>and</strong> notice the rapidly decaying low-time components representing<br />

the combined effects of the vocal tract, glottal pulse <strong>and</strong> radiation. The cepstrum,<br />

which is simply the inverse transform of only the log magnitude with zero<br />

imaginary part, is shown if Figure 8.30b. Note that the cepstrum also displays<br />

the same general properties as the complex cepstrum, as it should, since the<br />

cepstrum is the even part of the complex cepstrum.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!