29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

3.4. Flat vs. hierarchical consensus<br />

φ (NMI)<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

CSPA<br />

E<br />

RHCA<br />

DHCA<br />

flat<br />

φ (NMI)<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

EAC<br />

E<br />

RHCA<br />

DHCA<br />

flat<br />

φ (NMI)<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

HGPA<br />

E<br />

RHCA<br />

DHCA<br />

flat<br />

φ (NMI)<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

MCLA<br />

E<br />

RHCA<br />

DHCA<br />

flat<br />

φ (NMI)<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

ALSAD<br />

E<br />

RHCA<br />

DHCA<br />

flat<br />

φ (NMI)<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

KMSAD<br />

E<br />

RHCA<br />

DHCA<br />

flat<br />

φ (NMI)<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

SLSAD<br />

E<br />

RHCA<br />

DHCA<br />

flat<br />

Figure 3.22: φ (NMI) of the consensus solutions yielded by the computationally optimal<br />

RHCA, DHCA and flat consensus architectures on the Zoo data collection for the diversity<br />

scenario corresponding to a cluster ensemble of size l = 570.<br />

φ (NMI)<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

CSPA<br />

E<br />

RHCA<br />

DHCA<br />

flat<br />

φ (NMI)<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

EAC<br />

E<br />

RHCA<br />

DHCA<br />

flat<br />

φ (NMI)<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

HGPA<br />

E<br />

RHCA<br />

DHCA<br />

flat<br />

φ (NMI)<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

MCLA<br />

E<br />

RHCA<br />

DHCA<br />

flat<br />

φ (NMI)<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

ALSAD<br />

E<br />

RHCA<br />

DHCA<br />

flat<br />

φ (NMI)<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

KMSAD<br />

E<br />

RHCA<br />

DHCA<br />

flat<br />

φ (NMI)<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

SLSAD<br />

E<br />

RHCA<br />

DHCA<br />

flat<br />

Figure 3.23: φ (NMI) of the consensus solutions yielded by the computationally optimal<br />

RHCA, DHCA and flat consensus architectures on the Zoo data collection for the diversity<br />

scenario corresponding to a cluster ensemble of size l = 1083.<br />

observed when consensus are built using the HGPA consensus function. In the remaining<br />

cases, much smaller deviations are found (maybe with the exception of ALSAD, but the<br />

observed dispersions are smaller than those of HGPA) —in fact, statistically non significant<br />

differences between the three consensus architectures are observed in the EAC, MCLA,<br />

KMSAD and SLSAD based architectures.<br />

Diversity scenario |df A| =28<br />

And last, the φ (NMI) values of the consensus clustering solutions yielded by the hierarchical<br />

and flat consensus architectures corresponding to the experiments conducted on the<br />

highest diversity scenario (i.e. cluster ensembles of size l = 1596) are presented in figure<br />

3.24. This scenario is ideal for analyzing the variability of the quality of the consensus<br />

clustering solutions output by the distinct consensus functions, as exactly the same cluster<br />

ensemble has been employed in the ten experiments analyzed —in contrast, in the previous<br />

diversity scenarios, the cluster ensemble employed in each one of the ten experiments was<br />

created by compiling the clustering components generated by |dfA| = {1, 10, 19} randomly<br />

picked clustering algorithms (that is, two superimposed randomness factors underlie the<br />

boxplots presented in figures 3.21 to 3.23). In this sense, it can be observed that, for any<br />

100

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!