03.05.2014 Views

Computational Models of Music Similarity and their ... - OFAI

Computational Models of Music Similarity and their ... - OFAI

Computational Models of Music Similarity and their ... - OFAI

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

26 2 Audio-based <strong>Similarity</strong> Measures<br />

a classifier <strong>and</strong> is the main weak-point. To be able to describe a broad<br />

range <strong>of</strong> music a huge amount <strong>of</strong> clusters is necessary. Furthermore, there<br />

is no guarantee that music not used to train the clusters can be described<br />

meaningfully.<br />

The first localized approach was presented by Logan <strong>and</strong> Salomon [LS01].<br />

For each piece an individual set <strong>of</strong> clusters is used. The distances between<br />

these are computed using the Kullback-Leibler Divergence combined with<br />

the Earth Movers Distance [RTG00].<br />

Aucouturier <strong>and</strong> Pachet suggested using the computationally more expensive<br />

Monte Carlo sampling instead [AP02a; AP04a]. A simplified approach<br />

approach using a fast approximation <strong>of</strong> the Monte Carlo sampling<br />

was presented in [Pam05]. M<strong>and</strong>el <strong>and</strong> Ellis [ME05] propose an even simpler<br />

approach using only one cluster per piece <strong>and</strong> comparing them using<br />

the Kullback-Leibler Divergence. All three are described in detail in this<br />

subsection.<br />

Alternative techniques to compute spectral similarity include, for example,<br />

the anchor space similarity [BEL03], the spectrum histograms [PDW03a],<br />

or simply using the mean <strong>and</strong> st<strong>and</strong>ard deviations <strong>of</strong> the MFCCs (e.g. [TC02]).<br />

2.2.3.2 Thirty Gaussians <strong>and</strong> Monte Carlo Sampling (G30)<br />

This approach was originally presented in [AP02a]. Extensive evaluation<br />

results were reported in [AP04a]. A Matlab implementation based on these<br />

won the ISMIR 2004 genre classification contest [Pam04]. 12<br />

The approach consists <strong>of</strong> two steps. These are clustering the frames <strong>and</strong><br />

computing the cluster model similarity. First, the various spectra (frames<br />

represented by MFCCs) which occur in the piece are summarized (i.e., the<br />

distribution is modeled) by using a clustering algorithm to find typical spectra<br />

(cluster centers) <strong>and</strong> describing how typical they are (prior probabilities),<br />

<strong>and</strong> how the other spectra vary with respect to these few typical spectra<br />

(variances).<br />

Second, to compute the distance between two pieces, the distribution <strong>of</strong><br />

<strong>their</strong> spectra are compared. If two pieces have similar distributions (i.e.,<br />

if <strong>their</strong> spectra can be described using similar typical spectra, with similar<br />

variances, <strong>and</strong> priors) they are assumed to be similar.<br />

Frame Clustering<br />

The frames are clustered using a Gaussian Mixture Model (GMM) <strong>and</strong> Expectation<br />

Maximization (see e.g. [Bis95]). GMMs are a st<strong>and</strong>ard technique<br />

12 http://ismir2004.ismir.net/genre contest/index.htm

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!