13.12.2012 Aufrufe

DAGA 2010 - Deutsche Gesellschaft für Akustik eV

DAGA 2010 - Deutsche Gesellschaft für Akustik eV

DAGA 2010 - Deutsche Gesellschaft für Akustik eV

MEHR ANZEIGEN
WENIGER ANZEIGEN

Sie wollen auch ein ePaper? Erhöhen Sie die Reichweite Ihrer Titel.

YUMPU macht aus Druck-PDFs automatisch weboptimierte ePaper, die Google liebt.

154 <strong>DAGA</strong> <strong>2010</strong> Programm<br />

Mi. 14:50 Grashof C 20 Signalverarbeitung<br />

Cepstral Modulation Features for Classifying Audio Data<br />

A. Nagathil und R. Martin<br />

Ruhr-Universität Bochum, Inst. f. Kommunikationsakustik<br />

A variety of static short-time features derived from different signal domains<br />

such as the time, frequency and cepstral domain has been proposed<br />

for the task of general audio and music classification within the<br />

last decade. Further, the importance of modeling the temporal evolution<br />

of audio signals for improving the classification performance was<br />

demonstrated in the literature. These methods, however, are often based<br />

on the static features which only give a rough description of the<br />

underlying signal. Therefore, we propose an alternative way of feature<br />

extraction in which we first represent the signal dynamics by means of a<br />

highly-resolved cepstral modulation spectrum based on which dynamic<br />

features are extracted for general audio and music classification. For<br />

discriminating speech, music and noise we obtain an average detection<br />

rate of 96%. Musical genres are classified correctly with an accuracy of<br />

81%.<br />

Mi. 15:15 Grashof C 20 Signalverarbeitung<br />

Direction-of-arrival estimation using auditory models employing<br />

pitch analysis<br />

M. Klein-Hennig, M. Dietz, S. Ewert und V. Hohmann<br />

Medizinische Physik, Carl-von-Ossietzky Universität Oldenburg<br />

For the estimation of the direction-of-arrival (DOA) of sounds from binaural<br />

signals, a computational algorithm based on a binaural auditory<br />

model was presented by Dietz et al. [NAG/<strong>DAGA</strong>, 1299-1300 (2009)].<br />

In the present study, an extension of this DOA estimator with an algorithm<br />

for multiple source tracking based on particle filters [e.g., Särkkä<br />

et al., Information Fusion 8(1), 2-15 (2007)] is suggested and evaluated.<br />

The extended DOA estimator was able to successfully track independent<br />

movements of three concurrent speakers in the frontal, horizontal halfplane<br />

of a virtual acoustic space. To further improve the accuracy and<br />

robustness of speech source tracking and to simulate grouping mechanisms<br />

as employed by the human auditory system, a second extension<br />

of the model by an auditory-model-based pitch (fundamental frequency)<br />

estimator was investigated. The pitch estimator combines spectral and<br />

temporal information of the signal extracted from the excitation pattern<br />

and extracted by strobed temporal averaging, respectively.

Hurra! Ihre Datei wurde hochgeladen und ist bereit für die Veröffentlichung.

Erfolgreich gespeichert!

Leider ist etwas schief gelaufen!