29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Appendix E<br />

Experiments on multimodal<br />

consensus clustering<br />

This appendix presents several experiments regarding multimodal self-refining consensus<br />

architectures described in chapter 5, applied to the CAL500, InternetAds and Corel data<br />

collections. Due to space limitations, the experiments described correspond to the application<br />

of the proposed methodology on cluster ensembles resulting from the application of four<br />

of the twenty-eight clustering algorithms employed in this thesis, namely agglo-cos-upgma,<br />

direct-cos-i2, graph-cos-i2 and rb-cos-i2.<br />

For each one of the data sets, two facets of the experiments are presented separately.<br />

Firstly, the consensus clusterings obtained on each modality and across modalities is qualitatively<br />

evaluated. To do so, a set of boxplot charts displaying the φ (NMI) values of the<br />

components of the corresponding cluster ensemble E, and of the unimodal, multimodal and<br />

intermodal consensus clusterings obtained by the seven consensus functions employed in<br />

this work across 10 independent runs.<br />

And secondly, the quality of the self-refined consensus clustering solutions output by<br />

the proposed consensus self-refining procedure is also evaluated with the help of boxplot<br />

diagrams displaying the φ (NMI) values of the corresponding cluster ensembles, of the nonrefined<br />

consensus clustering λc and of the self-refined consensus clustering solutions λc p i .<br />

As regards the latter, a set of refined clusterings are obtained using a range of percentages<br />

pi = {2, 5, 10, 15, 20, 30, 40, 50, 60, 75} of the whole ensemble E. The performance of the<br />

φ (ANMI) -based supraconsensus function for picking up one of the λc p i is also qualitatively<br />

evaluated.<br />

E.1 CAL500 data set<br />

In this section, the results of the multimodal consensus clustering experiments conducted<br />

on the CAL500 data collection are described. The modalities contained in this data set are<br />

audio and text —see appendix A.2.2 for a description.<br />

359

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!