29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Chapter 5<br />

Multimedia clustering based on<br />

cluster ensembles<br />

As already outlined in section 1.3, multimodality is an increasingly noticeable trend as<br />

far as the nature of data is concerned. Given the growing ubiquity of multimedia data,<br />

it seems logical to consider that the derivation of robust clustering strategies as a means<br />

for organizing the increasingly larger multimodal repositories available is already a field of<br />

interest in itself.<br />

However, it is important to take into account that the direct application of classic<br />

clustering algorithms for partitioning multimedia data collections may turn out to be suboptimal.<br />

The reason for this is twofold: firstly, the usual indeterminacies that condition<br />

the performance of clustering algorithms are multiplied due to the existence of several data<br />

modalities. This means that the user must not only make a decision regarding which is<br />

the object representation or the clustering algorithm that are supposed to yield the best<br />

partition of the data —i.e. the ones best describing the natural group structure of the<br />

data. Furthermore, it is also necessary to make a decision regarding on which of the m<br />

data modalities clustering is to be conducted, as classic clustering algorithms are designed<br />

to operate on unimodal data.<br />

And secondly, notice that clustering multimedia data using a single modality entails<br />

ignoring the presumable positive synergies that may exist between the different modalities,<br />

which could be of interest for deriving a better partition of the data. The only way a<br />

classic clustering algorithm can take advantage of the possible benefits of multimodality<br />

consists in creating multimodal representations of the objects, conducting an early fusion<br />

of the features corresponding to distinct modalities prior to clustering. That is, clustering<br />

is conducted on a single, artificially generated multimodal representation of the objects<br />

created by the combination of the m original modalities. However, feature fusion may be<br />

benefitial or not as regards the quality of the clustering results, as reported in appendix<br />

B.2, which turns the application of this strategy into a further indeterminacy to deal with.<br />

For these reasons, in this chapter we propose applying consensus clustering as a means<br />

for clustering multimedia data robustly, as it provides a natural way for combining i) the<br />

results of clustering processes run on each one of the m distinct modalities —thus conducting<br />

a late fusion of modalities, and ii) the partitions obtained upon the multimodal data<br />

representation derived by the early fusion of the features of the m modalities. By doing so,<br />

133

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!