29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

E.1. CAL500 data set<br />

φ (NMI)<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

audio<br />

λ agglo−cos−upgma<br />

c<br />

E<br />

CSPA<br />

EAC<br />

HGPA<br />

MCLA<br />

ALSAD<br />

KMSAD<br />

SLSAD<br />

(a) Modality 1<br />

φ (NMI)<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

text<br />

λ agglo−cos−upgma<br />

c<br />

E<br />

CSPA<br />

EAC<br />

HGPA<br />

MCLA<br />

ALSAD<br />

KMSAD<br />

SLSAD<br />

(b) Modality 2<br />

φ (NMI)<br />

audio+text<br />

λ agglo−cos−upgma<br />

c<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

E<br />

CSPA<br />

EAC<br />

HGPA<br />

MCLA<br />

ALSAD<br />

KMSAD<br />

SLSAD<br />

(c) Multimodal<br />

φ (NMI)<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

λ c agglo−cos−upgma<br />

E<br />

CSPA<br />

EAC<br />

HGPA<br />

MCLA<br />

ALSAD<br />

KMSAD<br />

SLSAD<br />

(d) Intermodal<br />

Figure E.1: φ (NMI) boxplots of the unimodal, multimodal and intermodal consensus clustering<br />

solutions using the agglo-cos-upgma algorithm on the CAL500 data set.<br />

E.1.1 Consensus clustering per modality and across modalities<br />

For starters, the quality of the consensus clustering solutions obtained on i) the two original<br />

modalities, ii) the fused audio+text multimodal modality, and iii) across the previous<br />

three modalities are evaluated. In figure E.1, the results obtained after the application<br />

of the proposed multimodal consensus architecture on the cluster ensemble resulting from<br />

the compilation of the partitions output by the agglo-cos-upgma clustering algorithm are<br />

presented. It can be observed that the quality of the clusterings corresponding to the<br />

audio modality are notably better than those obtained on the text mode (except when<br />

the EAC consensus function is employed). The early fusion of the auditory and textual<br />

features does not introduce any beneficial effect, rather the contrary. The quality of the<br />

intermodal consensus clusterings λc corresponding to the combination of three modalities<br />

are approximately a trade-off between them.<br />

Figures E.2, E.3 and E.4 depict, respectively, the results obtained on the cluster ensembles<br />

created upon the direct-cos-i2, graph-cos-i2 and rb-cos-i2 CLUTO clustering algorithms.<br />

It can be observed that pretty similar results to the ones just reported are obtained<br />

in all cases: that is, the consensus clusterings based on the audio mode attain higher qualities<br />

than on the remaining modalities, while multimodal and intermodal consensus clustering<br />

solutions are a kind of trade-off between modalities.<br />

360

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!