18.07.2013 Views

LR Rabiner and RW Schafer, June 3

LR Rabiner and RW Schafer, June 3

LR Rabiner and RW Schafer, June 3

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

DRAFT: L. R. <strong>Rabiner</strong> <strong>and</strong> R. W. <strong>Schafer</strong>, <strong>June</strong> 3, 2009<br />

8.4. COMPUTING THE SHORT-TIME CEPSTRUM AND COMPLEX CEPSTRUM OF SPEECH459<br />

Therefore, in this example, only the two circled samples at n = Np = 75 <strong>and</strong><br />

n = N − Np = 256 − 75 = 181 are in what could be considered their “correct”<br />

locations (assuming samples at 128 < n < 256 to be “negative quefrency” samples).<br />

Again, by increasing N, we can mitigate the aliasing effects since the<br />

cepstrum approaches zero for large n.<br />

8.4.2 Computation Based on the z-Transform<br />

An approach to computing the complex cepstrum of finite-length sequences that<br />

does not require phase unwrapping is suggested by the example of Section 8.3.<br />

In that example, the z-transforms of all the convolved components of the model<br />

had closed-form expressions as rational functions. By factoring the numerator<br />

<strong>and</strong> denominator polynomials, it was possible to compute the complex cepstrum<br />

exactly. Short-time analysis of natural speech signals is based upon finite-length<br />

(windowed) segments of the speech waveform, <strong>and</strong> if a sequence x[n] has finite<br />

length, then its z-transform is a polynomial in z −1 of the form<br />

X(z) =<br />

M<br />

x[n]z −n . (8.66a)<br />

n=0<br />

Such an M th -order polynomial in z −1 can be represented in terms of its roots<br />

as<br />

Mi <br />

X(z) = x[0] (1 − amz −1 Mo <br />

) (1 − b −1<br />

m=1<br />

m=1<br />

m z −1 ), (8.66b)<br />

where the quantities am are the (complex) zeros that lie inside the unit circle<br />

(minimum-phase part) <strong>and</strong> the quantities b −1<br />

m are the zeros that are outside the<br />

unit circle (maximum-phase part); i.e., |am| < 1 <strong>and</strong> |bm| < 1. We assume that<br />

no zeros lie precisely on the unit circle. 11 If we factor a term −b −1<br />

m z −1 out of<br />

each factor of the product at far right in Eq. (8.66b), then Eq. (8.66b) can be<br />

expressed as<br />

where<br />

X(z) = Az −Mo<br />

Mi <br />

(1 − amz −1 Mo <br />

) (1 − bmz). (8.66c)<br />

m=1<br />

A = x[0](−1) Mo<br />

Mo <br />

m=1<br />

m=1<br />

b −1<br />

m . (8.66d)<br />

This representation of a windowed frame of speech can be obtained by using a<br />

polynomial rooting algorithm to find the zeros am <strong>and</strong> b −1<br />

m that lie inside <strong>and</strong><br />

outside the unit circle, respectively for the polynomial whose coefficients are the<br />

sequence x[n].<br />

11 Perhaps not surprisingly, it is rare that a computed root of a polynomial is precisely on<br />

the unit circle; however, as previously mentioned, most of the zeros lie close to the unit circle<br />

for high-order polynomials.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!