29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

5.4. Discussion<br />

Data set<br />

Relative φ (NMI) difference<br />

with respect to<br />

BEC MEC<br />

IsoLetters -17.5 53.2<br />

CAL500 -34.7 13.2<br />

InternetAds -65.2 138.7<br />

Corel -28.9 169.1<br />

Table 5.15: Relative φ (NMI) percentage differences between the best and median components<br />

of the cluster ensemble and the consensus clustering λ final<br />

c selected by supraconsensus,<br />

for the four multimedia data collections, averaged across the seven consensus functions<br />

employed.<br />

In average, λ final<br />

c<br />

is, in relative percentage terms, a 36.6% worse than the BEC and a<br />

93.5% better than the MEC. As expected, the price to pay for the lack of precision of the<br />

supraconsensus function is a reduction of the quality of the final clustering solution.<br />

5.4 Discussion<br />

In this chapter, we have proposed and experimentally explored the use of consensus clustering<br />

strategies for partitioning multimedia data collections robustly. From our viewpoint,<br />

this application constitutes a natural extension of the computationally efficient consensus<br />

architectures presented in chapter 3 and the self-refining procedures proposed in chapter<br />

4. As mentioned earlier, the growing ubiquity of multimedia data makes the proposals put<br />

forward in the present chapter even more appealing.<br />

Across the experiments presented in this chapter and in appendix B.2, we have observed<br />

that partitioning multimodal data sets in a robust manner is more tricky than doing so in a<br />

unimodal scenario, as the existence of multiple modalities in the data increases the already<br />

numerous indeterminacies inherent to the clustering problem.<br />

As a means for fighting against this fact, modality fusion has become a recurrent issue in<br />

the multimedia data analysis literature. Indeed, assuming that combining the distinct data<br />

modalities can be of interest is pretty logical, as it is an obvious way to take advantage of<br />

the expectably existing constructive dependences between them. In this sense, and focusing<br />

on the clustering problem, there exist two main approaches to modality fusion: early (aka<br />

feature level) fusion and late (classification decision) fusion.<br />

However, our experiments have revealed that none of these fusion strategies is, by itself,<br />

capable of ensuring robust clustering results: in some cases, feature level fusion gives rise to<br />

the best clustering results, whereas, in other cases, it simply constitutes a trade-off between<br />

modalities. For this reason, our multimodal self-refining consensus clustering architectures<br />

constitute a generic approach for partitioning multimedia data collections with a reasonable<br />

degree of robustness, with the advantage of encompassing, simultaneously, early and late<br />

fusion techniques.<br />

To our knowledge, most works dealing with multimodal clustering focus on feature level<br />

fusion, deriving novel early fusion approaches for combining modalities (see section 1.4<br />

for a brief review). However, they often disregard the fact that, in some data sets, early<br />

160

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!