22.09.2015 Views

of Microprocessors

Musical-Applications-of-Microprocessors-2ed-Chamberlin-H-1987

Musical-Applications-of-Microprocessors-2ed-Chamberlin-H-1987

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

SOURCE-SIGNAL ANALYSIS 587<br />

Quite obviously, eithet method involves a lot <strong>of</strong> calculation, approximately<br />

M X N operations, where M is the number <strong>of</strong> samples in the<br />

waveform section being analyzed and N is the number <strong>of</strong> lags tried in the<br />

peak/dip search. It can be reduced substantially by only evaluating lags that<br />

are close to the last measured pitch period.<br />

In theory, true autocorrelation is guaranteed to produce maximumheight<br />

peaks only at multiples <strong>of</strong> the true period. Thus, any perfectly<br />

periodic waveform will be correctly analyzed by the autocorrelation method.<br />

A changing waveform, however, is a different story. When the waveform<br />

changes, the peak corresponding to the pitch period is smaller than it would<br />

otherwise be because <strong>of</strong> the inexact repetition. There ate also additional peaks<br />

that may be due to strong harmonics, formants, etc. If the pitch peak<br />

attenuation due to waveform change is great enough, these secondary peaks<br />

will cause an error.<br />

Frequency-Domain Methods<br />

Pitch detection in the frequency domain is fairly simple to understand<br />

but does imply a lot <strong>of</strong> computation to obtain the spectrum. However, if the<br />

spectrum is used for formant analysis, then the additional processing necessary<br />

for pitch detection is relatively minor.<br />

The most straightforward frequency-domain pitch-detection scheme is<br />

an extension <strong>of</strong> frequency analysis mentioned earliet. The idea is to take the<br />

measured frequency <strong>of</strong> each significant component found and determine the<br />

greatest common divisor. For example, if components were found at 500 Hz,<br />

700 Hz, 1,100 Hz, and 1,500 Hz, the fundamental would be 100 Hz<br />

because it is the highest possible frequency for which all <strong>of</strong> the measured<br />

frequencies are harmonics. In real life, though, the frequency measurements<br />

will not be exact because <strong>of</strong> noise, a changing spectrum, etc. Thus, classic<br />

greatest-common-divisor algorithms will have to be extensively modified to<br />

allow some slop. Also, confining attention to the strongest half-dozen or<br />

fewer components will lessen the likelihood <strong>of</strong> confusion. If the least common<br />

multiple turns out to be a ridiculous number such as 20 Hz (any value<br />

less than the spectrum analysis bandwidth is suspect), then an unpitched<br />

sound should be assumed.<br />

The primary difficulty with the frequency-analysis method is the restriction<br />

on harmonic spacing so that accutate analysis is assured. When low<br />

fundamental frequencies are to be analyzed, this leads to very long analysis<br />

records and the possibility <strong>of</strong> significant frequency content changes over the<br />

duration <strong>of</strong> the record, which in turn can lead to errors.<br />

Homomorphic spectral analysis leads to a very good pitch detector, in<br />

fact one <strong>of</strong> the best available for speech sounds. In a cepstral plot, the<br />

low-quefrency values correspond to the spectrum shape, while highquefrency<br />

values correspond to the excitation function. For harmonic-rich<br />

tones, there will be a single sharp peak in the upper part <strong>of</strong> the cepstrum that

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!