29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Appendix C. Experiments on hierarchical consensus architectures<br />

tation of hierarchical architectures are presented in figure C.30. It can be observed that<br />

flat consensus gradually moves from being the optimal consensus architecture in the lowest<br />

diversity scenario to being the slowest in the highest diversity one. Compared to the serial<br />

implementation, the running time differences between RHCA and DHCA are less significant<br />

in this case —except when the ALSAD consensus function is the base of the consensus<br />

architecture.<br />

Consensus quality comparison<br />

As regards the quality of the consensus clustering solutions yielded by each consensus architecture<br />

as a function of the consensus function employed and the diversity scenario, the<br />

results obtained are presented in figure C.31. A few observations can be made: firstly, if<br />

the results obtained by the seven consensus functions are compared, it is to notice that<br />

fairly different performances are obtained: for instance, HGPA gives rise to pretty poorer<br />

quality consensus than the remaining consensus functions, as none of the boxes exceeds the<br />

φ (NMI) =0.6 level. Moreover, these relative performances are maintained across the different<br />

diversity scenarios. Secondly, the variability of the quality of the consensus clustering<br />

solutions can be evaluated by observing figure C.31(d), as the depicted boxplots corresponds<br />

to ten independent runs of the consensus clustering processes on the same cluster ensemble.<br />

Notice that the major differences are observed in the HGPA, MCLA and KMSAD consensus<br />

functions, which, as aforementioned, contain some random parameters that makes their<br />

performance vary (largely, as in HGPA or slightly, as in MCLA) from run to run. Thirdly,<br />

the relative comparison of the quality of the consensus solutions yielded by the the two HCA<br />

and flat consensus is local to the consensus function employed. Whereas DHCA seems to<br />

give rise to better consensus clustering solutions when the CSPA, ALSAD and KMSAD<br />

consensus functions are employed, it tends to be outperformed by RHCA flat consensus<br />

when clusterings are combined by EAC or HGPA. <strong>La</strong>st, notice that the highest level of<br />

similarity between the top-quality cluster ensemble components and consensus clustering<br />

solutions correspond to DHCA based on the CSPA consensus function.<br />

C.4.2 Wine data set<br />

This section presents the comparison between flat consensus and the computationally optimal<br />

consensus architectures in terms of CPU execution time and normalized mutual information<br />

between the ground truth and the consensus clustering solution yielded by each<br />

one of them. On this data collection, the cluster ensemble sizes corresponding to the four<br />

diversity scenarios are l =45, 450, 855 and 1260.<br />

Running time comparison<br />

As regards the execution times of the fully serial implementations of the estimated optimal<br />

RHCA and DHCA variants and flat consensus, a double evolutive behaviour can be<br />

observed —see figure C.32. Firstly, those consensus architectures using the CSPA, HGPA,<br />

MCLA, ALSAD, KMSAD and SLSAD consensus functions follow the same evolution pattern<br />

observed, for instance, in the Iris data collection (i.e. the larger the cluster ensemble,<br />

the more efficient hierarhical architectures become compared to flat consensus). In contrast,<br />

the consensus architectures based on the EAC consensus function present a fairly different<br />

291

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!