29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

1.4. Clustering indeterminacies<br />

Data set Best results Mode #1 Mode #2 Multimodal<br />

CAL500<br />

Corel<br />

InternetAds<br />

IsoLetters<br />

φ (NMI) 0.411 (P100) 0.249 (P74) 0.310 (P88)<br />

algorithm rbr-cos-i2 graph-cos-i2 graph-jacc-i2<br />

representation RP, r=120 PCA, r=40 ICA, r=100<br />

φ (NMI) 0.669 (P99) 0.270 (P40) 0.675 (P100)<br />

algorithm rbr-corr-i2 rb-cos-e1 rbr-corr-i2<br />

representation RP, r=300 NMF, r=400 NMF, r=550<br />

φ (NMI) 0.430 (P100) 0.258 (P98) 0.319 (P99)<br />

algorithm bagglo-cos-slink agglo-corr-clink graph-jacc-i2<br />

representation RP, r=70 Baseline NMF, r=150<br />

φ (NMI) 0.754 (P93) 0.537 (P67) 0.897 (P100)<br />

algorithm rbr-corr-i2 graph-jacc-i2 rbr-corr-i2<br />

representation Baseline ICA, r=12 PCA, r=100<br />

Table 1.2: Illustration of the clustering indeterminacies on the CAL500, Corel, InternetAds<br />

and IsoLetters multimoda data sets. Each column presents the top-performing clustering<br />

configuration for each separate modality and for the multimodal data representation.<br />

The twenty-eight clustering algorithms employed in this work (see table A.1 for a quick<br />

reference) are run on i) each modality of the data set, and ii) on the multimodal representation<br />

obtained by early feature fusion, as described in appendix A.3.2. In both cases,<br />

clustering is run on the baseline and feature extraction based data representations (see<br />

section A.3.1 of appendix A).<br />

As a summary of the obtained results and an illustration of the clustering indeterminacies,<br />

table 1.2 presents the highest quality clustering results obtained in each case (i.e.<br />

when clustering is conducted on either mode –mode #1 and mode #2 columns– and on<br />

the multimodal representation), indicating the corresponding value of φ (NMI) , its percentile<br />

in the global φ (NMI) distribution obtained, and the top-performing clustering configuration<br />

(i.e. algorithm plus data representation plus dimensionality of the reduced feature space r<br />

when needed).<br />

As expected, there is no predominant modality across all the data sets. For the CAL500<br />

collection, the clustering results obtained on mode #1 (text) are clearly superior to the rest.<br />

A similar behaviour is observed in the InternetAds data set, where it is also mode #1 (images<br />

size and aspect ratio, plus caption and alternate text in this case) the one that yields the<br />

best clustering results, although its predominance is not as clear as in CAL500. In contrast,<br />

the highest quality clustering solutions are obtained from multimodal representations in the<br />

Corel and IsoLetters data sets.<br />

<strong>La</strong>st but not least, notice the effect of the data representation and clustering algorithm<br />

indeterminacies on all the data sets. Again, there is no universally superior clustering<br />

algorithm nor data representation that guarantees the best clustering results. For a more<br />

detailed description of the experimental results regarding the clustering indeterminacies in<br />

multimodal data collections, see appendix B.2.<br />

24

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!