29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Consensus quality comparison<br />

Appendix C. Experiments on hierarchical consensus architectures<br />

As regards the quality of the consensus clustering process, figure C.37 presents the boxplots<br />

depicting the φ (NMI) values of the components of the cluster ensemble E and of the consensus<br />

clustering solutions output by the RHCA, DHCA and flat consensus architectures. On<br />

this data collection, the φ (NMI) differences between the clustering solutions output by the<br />

three distinct consensus architectures are, in general terms, small —except when the EAC<br />

consensus function is employed, as flat consensus is clearly superior in this case. Moreover,<br />

we can observe that ALSAD and SLSAD outstand among the remaining consenus functions<br />

as the top performers.<br />

C.4.4 Ionosphere data set<br />

This section presents the execution times of the computationally optimal RHCA, DHCA<br />

and flat consensus architecture and the φ (NMI) values of the consensus clustering solutions<br />

yielded by them on the Ionosphere data collection. The presented results consider the<br />

experiments conducted across four diversity scenarios, and the cluster ensemble sizes corresponding<br />

to them are l =97, 970, 1843 and 2716, respectively.<br />

Running time comparison<br />

The execution times of flat consensus and the serially implemented RHCA and DHCA are<br />

depicted in the boxplots charts presented in figure C.38. The relative behaviour of the<br />

three consensus architectures is pretty similar to the one observed on the previous data<br />

collections. That is, i) flat consensus becomes slower than its hierarchical counterparts as<br />

the size of the cluster ensemble becomes larger, except when the EAC consensus function is<br />

employed, and ii) RHCA tends to be faster than DHCA when the EAC, ALSAD, SLSAD,<br />

whereas the opposite behaviour is observed in the hypergraph based consensus functions.<br />

The running times obtained in the case the hierarchical consensus architectures are<br />

implemented in an entirely parallel manner are presented in figure C.39. As expected,<br />

RHCA and DHCA become sensibly more efficient than flat consensus. Notice, however,<br />

that notable differences between the running times of the two hierarchical architectures can<br />

be found under certain consensus functions, such as EAC, ALSAD, or SLSAD.<br />

Consensus quality comparison<br />

As far as the quality of the consensus clustering process is concerned, the φ (NMI) boxplots<br />

corresponding to the consensus clustering solutions obtained by the RHCA, DHCA and flat<br />

consensus architectures across the four diversity scenarios on the Ionosphere data collection<br />

are presented in figure C.40. A notably high variability as regards the optimality of the<br />

different consensus architectures is found depending on the consensus function employed.<br />

For instance, DHCA tends to yield the highest quality results when consensus is conducted<br />

by means of SLSAD, flat consensus gives the best consensus clustering solutions derived by<br />

HGPA, and when MCLA is chosen as the clustering combiner, RHCA attains the higher<br />

φ (NMI) values than the remaining consensus architectures. In contrast, marginal quality<br />

differences are observed between the qualities of the consensus clustering solutions derived<br />

by the three consensus architectures when the remaining consensus functions are employed.<br />

301

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!