29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

5.3. Multimodal consensus clustering results<br />

The second analysis consists of comparing the unimodal and multimodal consensus<br />

clusterings with the cluster ensemble components of maximum and median φ (NMI) (which<br />

we call best and median ensemble components, or BEC and MEC for short). Taking these<br />

two cluster ensemble components as a reference, we have computed i) the percentage of<br />

experiments in which the evaluated consensus clustering attains a higher φ (NMI) ,andii) the<br />

relative percentage φ (NMI) differences between them and the evaluated consensus clustering.<br />

The results of this analysis are presented, per data collection and consensus function, in<br />

tables 5.3 and 5.4, where evaluation is referred to BEC and the MEC, respectively.<br />

It can be observed that, as already noticed in previous experiments, the EAC and<br />

HGPA consensus functions perform notably worse than the remaining ones. In average,<br />

unimodal consensus clusterings are better than their corresponding BEC in a 6.5% of the<br />

experiments, whereas the multimodal consensus clustering solutions attain a higher φ (NMI)<br />

than the BEC in a 8.7% of the occasions (see table 5.3). If this comparison is made in<br />

terms of the relative percentage φ (NMI) differences, we see that, in average, the unimodal<br />

consensus clusterings are a 33.5% worse than the BEC, while this percentage reduces to<br />

28.2% when the multimodal consensus clusterings are considered.<br />

If the median ensemble component is taken as a reference –see table 5.4–, we observe<br />

that unimodal consensus clusterings are better than the MEC in a 54% of the experiments<br />

conducted. In contrast, when the multimodal consensus clustering solution is considered,<br />

superiority with respect to the MEC is obtained in a 62.3% of the cases. If the MEC and<br />

the consensus clusterings are compared in terms of relative percentage φ (NMI) differences,<br />

we see that the unimodal consensus yields clusterings which are a 21.1% better than the<br />

MEC, while this percentage rises to 39.6% in the case of multimodal consensus.<br />

Thus, a conclusion we can draw at this point is that, in view of the results just reported,<br />

the execution of consensus processes on multimodal cluster ensembles yields better quality<br />

consensus clusterings than those obtained upon cluster ensembles based on a single modality,<br />

which somehow constitutes a claim in favour of early fusion techniques. However, this<br />

statement must be made with caution, as it is not supported by evidence in all the data<br />

collections (for instance, the CAL500 collection constitutes an exception to this rule).<br />

Multimodal vs unimodal consensus clustering<br />

Secondly, we have compared the quality of the multimodal consensus clusterings (that is,<br />

mod 1+mod 2<br />

mod 1 mod 2<br />

λ ) with the quality of their unimodal counterparts (λ and λ ). Again,<br />

c<br />

c<br />

c<br />

this comparison has been made in terms of the percentage of experiments in which the<br />

former attains a higher φ (NMI) than the latter, and the relative percentage φ (NMI) differences<br />

between them, taking the unimodal consensus clusterings as a reference. The results are<br />

presented in table 5.5.<br />

mod 1+mod 2<br />

In average terms, the multimodal consensus clustering λc is better than its<br />

two unimodal counterparts in a 53.3% of the experiments conducted, and worse than both<br />

of them in only a 13% of the occasions. If the φ (NMI) differences between these consensus<br />

clusterings are measured, we observe that multimodal consensus yields partitions that, in<br />

average relative percentage φ (NMI) terms, are a 4802.4% better. Although this large figure is<br />

mainly caused by two outliers (on the InternetAds data set using the ALSAD and KMSAD<br />

consensus functions), the results presented in table 5.5 show an overwhelming majority of<br />

positive Δφ (NMI) values, which reinforces the notion that multimodal consensus processes,<br />

146

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!