29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

SERT RHCA (sec.)<br />

PERT RHCA (sec.)<br />

10 1<br />

10 0<br />

10 1<br />

10 0<br />

10 −1<br />

s : number of stages<br />

9 6 5 4 4 3 3 2 2 1<br />

2 3 4 5 7 8 19 20 285 570<br />

b : mini−ensemble size<br />

(a) Serial estimated running time<br />

s : number of stages<br />

9 6 5 4 4 3 3 2 2 1<br />

2 3 4 5 7 8 19 20 285 570<br />

b : mini−ensemble size<br />

(c) Parallel estimated running time<br />

CSPA<br />

EAC<br />

HGPA<br />

MCLA<br />

ALSAD<br />

KMSAD<br />

SLSAD<br />

CSPA<br />

EAC<br />

HGPA<br />

MCLA<br />

ALSAD<br />

KMSAD<br />

SLSAD<br />

Chapter 3. Hierarchical consensus architectures<br />

SRT RHCA (sec.)<br />

PRT RHCA (sec.)<br />

10 1<br />

10 0<br />

10 1<br />

10 0<br />

10 −1<br />

s : number of stages<br />

9 6 5 4 4 3 3 2 2 1<br />

2 3 4 5 7 8 19 20 285 570<br />

b : mini−ensemble size<br />

(b) Serial real running time<br />

s : number of stages<br />

9 6 5 4 4 3 3 2 2 1<br />

2 3 4 5 7 8 19 20 285 570<br />

b : mini−ensemble size<br />

(d) Parallel real running time<br />

CSPA<br />

EAC<br />

HGPA<br />

MCLA<br />

ALSAD<br />

KMSAD<br />

SLSAD<br />

CSPA<br />

EAC<br />

HGPA<br />

MCLA<br />

ALSAD<br />

KMSAD<br />

SLSAD<br />

Figure 3.5: Estimated and real running times of the serial RHCA on the Zoo data collection<br />

in the diversity scenario corresponding to a cluster ensemble of size l = 570.<br />

of PRTRHCA —in fact, the only prediction error occurs in the case the SLSAD consensus<br />

function is employed. In this case, according to PERTRHCA (figure 3.4(c)), the most efficient<br />

consensus architecture is the RHCA variant with s = 2 stages using mini-ensembles of<br />

size b = 28. However, the real execution times (figure 3.4(d)) reveal that flat consensus is<br />

the fastest option when this consensus function is employed for combining the clusterings.<br />

Nevertheless, we would like to highlight that the cost (measured in terms of running<br />

time) of selecting this computationally suboptimal RHCA variant based on the PERTRHCA<br />

prediction is almost negligible in absolute terms, as the difference between the running<br />

times of the truly and allegedly optimal parallel RHCA variants is smaller than a tenth of<br />

a second in this case.<br />

Diversity scenario |df A| =10<br />

Figure 3.5 presents the estimated and real execution times of several architectural variants of<br />

both the fully serial and parallel RHCA implementations in the second diversity scenario,<br />

the one resulting from employing |dfA| = 10 randomly chosen clustering algorithms for<br />

generating a cluster ensemble of size l = 570. In this case, the sweep of values of the<br />

mini-ensembles size is b = {2, 3, 4, 5, 7, 8, 19, 20, 285, 570}.<br />

If the estimated and real execution times of the serial implementation of the RHCA are<br />

59

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!