29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

3.4. Flat vs. hierarchical consensus<br />

of objects in the collection, as they employ object co-association matrices as a basic<br />

element of their consensus processes.<br />

– regarding which type of hierarchical consensus architecture (RHCA or DHCA) is<br />

more computationally efficient, little can be said apriori, as it depends on the specific<br />

configurations of the hierarhical architectures, i.e. their number of stages and the<br />

sizes of the mini-ensembles.<br />

– using the MCLA consensus function penalizes those architectures with large miniensembles,<br />

as its time complexity depends quadratically on this factor.<br />

Comparison across diversity scenarios and data collections<br />

Aiming to reveal the existence of any global computational superiority pattern between<br />

the two hierarchical consensus architecture variants, besides confirming the hypotheses put<br />

forward earlier, we have compiled the real running times of the assumedly computationally<br />

optimal RHCA and DHCA variants and of flat consensus in each diversity scenario using<br />

each consensus function, across the twelve data collections employed in these experiments.<br />

This process has been replicated for both the fully serial and parallel implementations<br />

of hierarchical consensus architectures, and as a result, the running time boxplot charts<br />

presented in figures 3.19 and 3.20 have been obtained.<br />

For starters, figure 3.19 presents the running times corresponding to the entirely serial<br />

implementation of the computationally optimal RHCA and DHCA variants and flat consensus.<br />

Notice the notable height of the boxes in the boxplots, caused by the fact that they<br />

represent running times of the consensus architectures across data collections with fairly<br />

distinct characteristics (i.e. number of objects and clusters). Nevertheless, the focus of this<br />

analysis should be placed on detecting relative differences between the boxes corresponding<br />

to the three consensus architectures. In general terms, it can be observed how flat consensus<br />

becomes gradually slower than hierarchical consensus as the size of the cluster ensembles<br />

grows (i.e. as the cardinality of the algorithmic diversity factor |dfA| is increased). As reported<br />

earlier, consensus architectures based on the EAC consensus function constitute the<br />

only exception to this rule. As regards the comparison between the running times of RHCA<br />

and DHCA, the most significant differences are observed when the ALSAD and SLSAD<br />

consensus functions are employed —in these cases, the random HCA variants are faster<br />

than their deterministic counterparts, a trend that becomes more apparent as the cluster<br />

ensembles size grows. Finally, notice that, in absolute terms, consensus architectures based<br />

on the HGPA, MCLA and CSPA consensus functions are faster than those employing the<br />

EAC, ALSAD, KMSAD and SLSAD clustering combiners.<br />

And secondly, the execution time boxplots corresponding to flat consensus and the fully<br />

parallel implementation of consensus architectures are depicted in figure 3.20. As in the serial<br />

case, the use of the HGPA, MCLA and CSPA consensus functions gives rise, in general,<br />

to faster consensus architectures than when consensus clustering solutions are generated by<br />

means of the EAC, ALSAD, KMSAD and SLSAD clustering combiners. Moreover, the superiority<br />

of hierarchical architectures in front of flat consensus becomes manifest in diversity<br />

scenarios with |dfA| ≥10, depending on the consensus function employed. As regards the<br />

comparison between RHCA and DHCA, a wide spectrum of behaviours is observed. When<br />

consensus is built upon the CSPA, EAC, HGPA and KMSAD consensus functions, little significant<br />

differences between both hierarchical architectures are detected. Meanwhile, RHCA<br />

94

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!