29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

3.3.4 Experiments<br />

Chapter 3. Hierarchical consensus architectures<br />

This section presents the results of multiple experiments oriented to illustrate the computational<br />

efficiency of DHCA, as well as to evaluate the predictive power of the running time<br />

estimation methodology of table 3.6. Their design follows the scheme presented next.<br />

Experimental design<br />

– What do we want to measure?<br />

i) The time complexity of deterministic hierarchical consensus architectures.<br />

ii) The ability of the proposed methodology for predicting the computationally optimal<br />

DHCA variant, in both the fully serial and parallel implementations.<br />

iii) The predictive power of the proposed methodology based on running time estimation<br />

vs the computational optimality criterion based on designing the DHCA<br />

according to a decreasing diversity factor cardinality order, in both the fully<br />

serial and parallel implementations.<br />

– How do we measure it?<br />

i) The time complexity of the implemented serial and parallel DHCA variants is<br />

measured in terms of the CPU time required for their execution —serial running<br />

time (SRTDHCA) and parallel running time (PRTDHCA).<br />

ii) The estimated running times of the same DHCA variants –serial estimated running<br />

time (SERTDHCA) and parallel estimated running time (PERTDHCA)– are<br />

computed by means of the proposed running time estimation methodology, which<br />

is based on the measured running time of c = 1 consensus clustering process. Predictions<br />

regarding the computationally optimal DHCA variant will be successful<br />

in case that both the real and estimated running times are minimized by the<br />

same DHCA variant, and the percentage of experiments in which prediction is<br />

successful is given as a measure of its performance. In order to measure the<br />

impact of incorrect predictions, we also measure the execution time differences<br />

(in both absolute and relative terms) between the truly and the allegedly fastest<br />

DHCA variants in the case prediction fails. This evaluation process is replicated<br />

for a range of values of c ∈ [1, 20], so as to measure the influence of this factor<br />

on the prediction accuracy of the proposed methodology.<br />

iii) Both computationally optimal DHCA variants prediction approaches are compared<br />

in terms of the percentage of experiments in which prediction is successful,<br />

and in terms of the execution time overheads (in both absolute and relative terms)<br />

between the truly and the allegedly fastest DHCA variants in the case prediction<br />

fails.<br />

– How are the experiments designed? The f! DHCA variants corresponding to<br />

all the possible permutations of the f diversity factors employed in the generation<br />

of the cluster ensemble have been implemented (see table 3.6). As described in appendix<br />

A.4, cluster ensembles have been created by the mutual crossing of f =3<br />

diversity factors: clustering algorithms (dfA), object representations (dfR) and data<br />

dimensionalities (dfD). Thus, in all our experiments, the number of DHCA variants is<br />

75

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!