29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

SERT DHCA (sec.)<br />

PERT DHCA (sec.)<br />

10 1<br />

10 0<br />

|df A | = 19 , |df D | = 14 , |df R | = 5<br />

ADR ARD DAR DRA RAD RDA flat<br />

DHCA variant<br />

10 0<br />

(a) Serial estimated running time<br />

|df A | = 19 , |df D | = 14 , |df R | = 5<br />

ADR ARD DAR DRA RAD RDA flat<br />

DHCA variant<br />

(c) Parallel estimated running time<br />

CSPA<br />

EAC<br />

HGPA<br />

MCLA<br />

ALSAD<br />

KMSAD<br />

SLSAD<br />

CSPA<br />

EAC<br />

HGPA<br />

MCLA<br />

ALSAD<br />

KMSAD<br />

SLSAD<br />

Chapter 3. Hierarchical consensus architectures<br />

SRT DHCA (sec.)<br />

PRT DHCA (sec.)<br />

10 1<br />

10 0<br />

|df A | = 19 , |df D | = 14 , |df R | = 5<br />

ADR ARD DAR DRA RAD RDA flat<br />

DHCA variant<br />

10 0<br />

(b) Serial real running time<br />

|df A | = 19 , |df D | = 14 , |df R | = 5<br />

ADR ARD DAR DRA RAD RDA flat<br />

DHCA variant<br />

(d) Parallel real running time<br />

CSPA<br />

EAC<br />

HGPA<br />

MCLA<br />

ALSAD<br />

KMSAD<br />

SLSAD<br />

CSPA<br />

EAC<br />

HGPA<br />

MCLA<br />

ALSAD<br />

KMSAD<br />

SLSAD<br />

Figure 3.12: Estimated and real running times of the serial and parallel DHCA on the Zoo<br />

data collection in the diversity scenario corresponding to a cluster ensemble of size l = 1083.<br />

DHCA variant (ADR in this case). This fact somehow reinforces the idea that, when<br />

compared to the typically linear or quadratic time complexity of consensus functions, the<br />

multiplicative growth rate of the total number of consensus imposes a stronger constraint<br />

as far as the running time of the DHCA is concerned. As observed in the previous diversity<br />

scenarios, the selection of a particular DHCA variant is a less critical matter when the fully<br />

parallel implementation of DHCA is considered, as all six variants yield pretty similar real<br />

execution times. In the serial case, in contrast, the accuracy of the running time estimation<br />

methodology is much more crucial, although SERTDHCA performs as a reasonably successful<br />

predictor.<br />

As mentioned earlier, these same experiments have been run on the Iris, Wine, Glass,<br />

Ionosphere, WDBC, Balance and MFeat unimodal data collections, and the corresponding<br />

results are presented in appendix C.3. Most of the conclusions drawn regarding the computational<br />

efficiency of hierarchical and flat consensus architectures in the analysis of random<br />

hierarchical consensus architectures are also applicable in the context of DHCA, such as<br />

the high computational efficiency of i) parallel DHCA even in low diversity scenarios, and<br />

ii) serial DHCA in medium and high diversity scenarios, or the dependence between the<br />

characteristics of the consensus function employed for conducting clustering combination<br />

and the execution time of consensus architectures.<br />

Moreover, two extra conclusions regarding the selection of the computationally optimal<br />

DHCA variant must be discussed. Firstly, defining the DHCA architecture by means of an<br />

ordered list of diversity factors arranged in decreasing cardinality order (i.e. associating the<br />

79

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!