LR Rabiner and RW Schafer, June 3
LR Rabiner and RW Schafer, June 3
LR Rabiner and RW Schafer, June 3
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
DRAFT: L. R. <strong>Rabiner</strong> <strong>and</strong> R. W. <strong>Schafer</strong>, <strong>June</strong> 3, 2009<br />
8.4. COMPUTING THE SHORT-TIME CEPSTRUM AND COMPLEX CEPSTRUM OF SPEECH459<br />
Therefore, in this example, only the two circled samples at n = Np = 75 <strong>and</strong><br />
n = N − Np = 256 − 75 = 181 are in what could be considered their “correct”<br />
locations (assuming samples at 128 < n < 256 to be “negative quefrency” samples).<br />
Again, by increasing N, we can mitigate the aliasing effects since the<br />
cepstrum approaches zero for large n.<br />
8.4.2 Computation Based on the z-Transform<br />
An approach to computing the complex cepstrum of finite-length sequences that<br />
does not require phase unwrapping is suggested by the example of Section 8.3.<br />
In that example, the z-transforms of all the convolved components of the model<br />
had closed-form expressions as rational functions. By factoring the numerator<br />
<strong>and</strong> denominator polynomials, it was possible to compute the complex cepstrum<br />
exactly. Short-time analysis of natural speech signals is based upon finite-length<br />
(windowed) segments of the speech waveform, <strong>and</strong> if a sequence x[n] has finite<br />
length, then its z-transform is a polynomial in z −1 of the form<br />
X(z) =<br />
M<br />
x[n]z −n . (8.66a)<br />
n=0<br />
Such an M th -order polynomial in z −1 can be represented in terms of its roots<br />
as<br />
Mi <br />
X(z) = x[0] (1 − amz −1 Mo <br />
) (1 − b −1<br />
m=1<br />
m=1<br />
m z −1 ), (8.66b)<br />
where the quantities am are the (complex) zeros that lie inside the unit circle<br />
(minimum-phase part) <strong>and</strong> the quantities b −1<br />
m are the zeros that are outside the<br />
unit circle (maximum-phase part); i.e., |am| < 1 <strong>and</strong> |bm| < 1. We assume that<br />
no zeros lie precisely on the unit circle. 11 If we factor a term −b −1<br />
m z −1 out of<br />
each factor of the product at far right in Eq. (8.66b), then Eq. (8.66b) can be<br />
expressed as<br />
where<br />
X(z) = Az −Mo<br />
Mi <br />
(1 − amz −1 Mo <br />
) (1 − bmz). (8.66c)<br />
m=1<br />
A = x[0](−1) Mo<br />
Mo <br />
m=1<br />
m=1<br />
b −1<br />
m . (8.66d)<br />
This representation of a windowed frame of speech can be obtained by using a<br />
polynomial rooting algorithm to find the zeros am <strong>and</strong> b −1<br />
m that lie inside <strong>and</strong><br />
outside the unit circle, respectively for the polynomial whose coefficients are the<br />
sequence x[n].<br />
11 Perhaps not surprisingly, it is rare that a computed root of a polynomial is precisely on<br />
the unit circle; however, as previously mentioned, most of the zeros lie close to the unit circle<br />
for high-order polynomials.