21.01.2014 Views

improving music mood classification using lyrics, audio and social tags

improving music mood classification using lyrics, audio and social tags

improving music mood classification using lyrics, audio and social tags

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Proof: Let p <strong>and</strong><br />

<strong>lyrics</strong><br />

p<strong>audio</strong><br />

denote the posterior probabilities of a data sample being estimated<br />

as positive by the lyric-based <strong>and</strong> <strong>audio</strong>-based classifiers, <strong>and</strong><br />

p <strong>lyrics</strong><br />

<strong>and</strong> p<br />

<strong>audio</strong><br />

denote the<br />

probabilities of the sample being estimated as negative by the two classifiers respectively.<br />

The multiplying rule says:<br />

if<br />

p × p > p × p<br />

<strong>lyrics</strong><br />

<strong>audio</strong><br />

<strong>lyrics</strong><br />

<strong>audio</strong><br />

(2)<br />

then the hybrid classifier would predict positive, otherwise, predict negative. In the case of<br />

binary <strong>classification</strong>, (2) can be rewritten as:<br />

p<br />

<strong>lyrics</strong><br />

⇒ p<br />

× p<br />

<strong>lyrics</strong><br />

<strong>audio</strong><br />

+ p<br />

> p<br />

<strong>audio</strong><br />

<strong>lyrics</strong><br />

× p<br />

<strong>audio</strong><br />

> 1 ⇒<br />

= ( 1−<br />

p<br />

p<br />

<strong>lyrics</strong><br />

+ p<br />

2<br />

<strong>lyrics</strong><br />

<strong>audio</strong><br />

)( 1−<br />

p<br />

> 0.5<br />

<strong>audio</strong><br />

)<br />

=<br />

1−<br />

p<br />

<strong>lyrics</strong><br />

− p<br />

<strong>audio</strong><br />

+ p<br />

<strong>lyrics</strong><br />

× p<br />

<strong>audio</strong><br />

This is, in fact, the rule of averaging. <br />

Therefore, this research uses the weighted averaging as the rule of late fusion. For each<br />

testing instance, the final estimation probability is calculated as:<br />

p = α p + ( 1−<br />

α)<br />

p<br />

(3)<br />

hybrid<br />

<strong>lyrics</strong><br />

<strong>audio</strong><br />

where α is the weight given to the posterior probability estimated by the lyric-based classifier. A<br />

song is classified as positive when the hybrid posterior probability is larger or equal than 0.5. In<br />

this experiment, the value of α was changed from 0.1 to 0.9 with an increment step of 0.1. The α<br />

value resulting in the best performing system was used to build the late fusion system, which was<br />

91

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!