29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Appendix C. Experiments on hierarchical consensus architectures<br />

DHCA variant implemented, at least, no significant differences are observed among them –<br />

as opposed to the serial implementation case–, while hierarchical consensus is more efficient<br />

than flat consensus as soon as the cluster ensemble size grows.<br />

C.3.6 Balance data set<br />

This section presents the results of estimating the execution times of the fully serial and<br />

parallel implementations of DHCA in the four diversity scenarios for the Balance data set,<br />

which give rise to cluster ensembles of sizes l =7, 70, 133 and 196 each.<br />

Firstly, figure C.25 presents the estimated and real execution times corresponding to<br />

the serial implementation context. It is quite apparent that SERTDHCA provides the user<br />

with a good estimation of the real running time of consensus architectures (SRTDHCA) and,<br />

as such, it allows determining which is the computationally optimal consensus architecture<br />

with a high degree of accuracy. In this case, given the small size of the cluster ensemble<br />

in either of the four diversity scenarios, flat consensus is faster than most serial DHCA<br />

variants.<br />

And secondly, the results corresponding to the parallel implementation of DHCA are<br />

depicted in figure C.26. Once more, all DHCA variants have very similar running times.<br />

As in the serial case, flat consensus is faster than any of its hierarhical counterparts, except<br />

when the HGPA and MCLA consensus functions are employed as clustering combiners.<br />

C.3.7 MFeat data set<br />

This section describes the results of the minimum complexity DHCA variant selection based<br />

on running time estimation. In the case of the MFeat data collection, cluster ensembles of<br />

sizes l =6, 60, 114 and 168 correspond to the four diversity scenarios where this experiment<br />

is conducted.<br />

Figure C.27 depicts the estimated and real execution times of the fully serial implementation<br />

of DHCA. In this case, SERTDHCA is a pretty accurate estimator of SRTDHCA, and,<br />

as such, it is a good predictor of the most computationally efficient consensus architecture.<br />

In most cases, however, due to the relatively small sizes of the cluster ensembles in this data<br />

set, flat consensus is faster than any of the DHCA variants —except when the HGPA and<br />

MCLA consensus functions are employed in high diversity scenarios.<br />

<strong>La</strong>st, figure C.28 presents the results corrresponding to the parallel DHCA implementation.<br />

Again, the time complexities of DHCA variants reach pretty similar values. However,<br />

notice that DHCA variants are slower than flat consensus in most of the diversity scenarios<br />

—except when the HGPA and MCLA consensus functions are used for combining the<br />

clusterings.<br />

283

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!