05.06.2013 Views

A Study of the ITU-T G.729 Speech Coding Algorithm ...

A Study of the ITU-T G.729 Speech Coding Algorithm ...

A Study of the ITU-T G.729 Speech Coding Algorithm ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Open<br />

MASTER THESIS<br />

Datum - Date Rev Dokumentnr - Document no.<br />

04-09-28 PA1<br />

In <strong>the</strong> spectral envelope for voiced speech, <strong>the</strong> formants can be identified as peaks.<br />

The phoneme /ih/ has formants at approximately 300, 2300 and 3000 Hz. The formant’s<br />

bandwidth describes <strong>the</strong> width <strong>of</strong> <strong>the</strong> formants peaks. Formants and <strong>the</strong>ir bandwidths<br />

are <strong>the</strong> most important spectral features. They characterize <strong>the</strong> sounds <strong>of</strong> a speaker. In<br />

Figure 6: Waveforms and spectrums, for <strong>the</strong> voiced sound /ih/ and <strong>the</strong> unvoiced /s/.<br />

The dashed line in <strong>the</strong> spectrum shows <strong>the</strong> LPC envelope.<br />

statistical terms, a speech signal may be classified as a non-stationary random process.<br />

It is commonly assumed that a speech signal is wide sense stationary and ergodic in<br />

autocorrelation for short-time periods from 10 to 30 ms.<br />

The autocorrelation for a discrete-time deterministic signal is given by<br />

Rxx =<br />

∞<br />

m=−∞<br />

x[m]x[m + k] (1)<br />

The Discrete Fourier Transform (DFT) <strong>of</strong> <strong>the</strong> autocorrelation function yields <strong>the</strong> power spectrum<br />

<strong>of</strong> <strong>the</strong> signal, according to <strong>the</strong> equation<br />

Sxx ∆ =<br />

∞<br />

n=−∞<br />

Rxx(n)e −jωn<br />

This shows that <strong>the</strong> short-term autocorrelation for a frame is a function <strong>of</strong> <strong>the</strong> power spectral<br />

envelope, which in turn is directly related to <strong>the</strong> shape <strong>of</strong> <strong>the</strong> vocal tract. Thus, <strong>the</strong><br />

short-term autocorrelation is a function <strong>of</strong> <strong>the</strong> shape <strong>of</strong> <strong>the</strong> vocal tract.<br />

(2)<br />

20 (78)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!