29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Chapter 5. Multimedia clustering based on cluster ensembles<br />

As far as this latter issue is concerned, it is important to notice that, if the cluster<br />

ensemble generation process proposed in section 5.1 is followed, multimodality induces an<br />

important increase in the cluster ensemble size. Indeed, if a set of f mutually crossed<br />

diversity factors of cardinalities |df1|, |df2|, ..., |dff | are applied on a particular unimodal<br />

data collection, we obtain a cluster ensemble of size:<br />

lunimodal = |df1||df2| ...|dff| (5.4)<br />

However, if the same data collection was multimodal and contained m modalities, the<br />

application of the previously presented multimedia cluster ensemble creation procedure –<br />

using exactly the same diversity factors applied on the unimodal version of the data set–<br />

would yield an ensemble of size:<br />

lmultimodal =(m +1)|df1||df2| ...|dff| (5.5)<br />

That is, multimodality increases the size of cluster ensembles by a factor of (m +1)<br />

—i.e. in the minimally multimodal case, m = 2, the cluster ensembles obtained are three<br />

times larger than those that would be created in a unimodal scenario. For this reason, and<br />

allowing for the conclusions of chapter 3, it seems that hierarchical consensus architectures<br />

are likely to be the most computationally efficient implementation alternative for deriving<br />

consensus clustering solutions upon multimodal cluster ensembles —however, the running<br />

time estimation process proposed in chapter 3 constitutes a valid tool for selecting apriori,<br />

and with a high degree of precision, which is the computationally optimal consensus<br />

architecture for solving a specific consensus clustering problem.<br />

Regardless of the consensus architecture employed, it will output a consensus clustering<br />

solution λc. The next and final step consists in applying the consensus self-refining procedure<br />

proposed in section 4.1, so as to obtain a presumably higher quality refined consensus<br />

clustering solution λ final<br />

c . Quite obviously, in a multimedia clustering scenario, the selfrefining<br />

process will be based on selecting a subset of the components of the multimodal<br />

cluster ensemble E for creating a select cluster ensemble. Otherwise, it follows exactly the<br />

steps presented in table 4.1.<br />

In this work, a three-stage deterministic hierarchical consensus architecture (or DHCA<br />

for short) has been applied for deriving the consensus clustering solution λc upon our<br />

multimodal cluster ensembles. This is due to the fact that we have deemed multimodality<br />

as a diversity factor (denoted as dfM) in itself. Moreover, in contrast to the procedure<br />

followed in the previous chapters, the multimodal cluster ensemble E considered in each<br />

individual experiment conducted in this chapter only contains clusterings created by a<br />

single clustering algorithm, despite, as mentioned earlier, |dfA| = 28 of them have been<br />

employed. In other words, for each data collection, experiments on |dfA| = 28 different<br />

cluster ensembles have been conducted. The components of each one of these ensembles<br />

only differ in the representational (dfR), dimensional (dfD) and multimodal (dfM) diversity<br />

factors, while having been created by means of a single clustering algorithm.<br />

According to the conclusions drawn in section 3.3, the DHCA variant that minimizes the<br />

number of executed consensus processes and the running time of its serial implementation<br />

is the one in which consensus processes are sequentially run on the distinct diversity factors<br />

that make up the cluster ensemble E arranged in decreasing cardinality order.<br />

137

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!