29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

3.2. Random hierarchical consensus architectures<br />

1. Given the cluster ensemble size l, create a set of mini-ensembles sizes b with values<br />

sweeping from 2 to ⌊ l<br />

2 ⌋.<br />

2. For each b value –which corresponds to a RHCA variant– compute the number of<br />

stages of the RHCA s according to equation (3.1). With these results in hand, limit<br />

the sweep of values of b according to two criteria:<br />

i) as there exist multiple values of b that yield RHCA variants with the same number<br />

of stages, consider only the largest and smallest values of b that yield the same<br />

number of RHCA stages s.<br />

ii) keep those values of b that uniquely give rise to RHCA variants with a specific<br />

number of stages.<br />

3. For the reduced set of b values, compute the total number of consensus processes<br />

s<br />

Ki, and the real mini-ensembles sizes bij of the corresponding RHCA variants,<br />

i=1<br />

according to equations (3.2) and (3.3), respectively.<br />

4. Measure the time required for executing the consensus function F on c randomly<br />

picked mini-cluster ensembles of the sizes bij corresponding to each value of b.<br />

5. Employ the computed parameters of each RHCA variant (i.e. number of stages s,<br />

s<br />

total number of consensus processes Ki and the running times of the consensus<br />

i=1<br />

function F) to estimate the running times of the whole hierarchical architecture,<br />

using equations (3.4) or (3.9) depending on whether its fully serial or parallel version<br />

is to be implemented in practice.<br />

Table 3.2: Methodology for estimating the running time of multiple RHCA variants.<br />

Experimental design<br />

– What do we want to measure?<br />

i) The time complexity of random hierarchical consensus architectures.<br />

ii) The ability of the proposed methodology for predicting the computationally optimal<br />

RHCA variant, in both the fully serial and parallel implementations.<br />

– How do we measure it?<br />

i) The time complexity of the implemented serial and parallel RHCA variants is<br />

measured in terms of the CPU time required for their execution —serial running<br />

time (SRTRHCA) and parallel running time (PRTRHCA).<br />

ii) The estimated running times of the same RHCA variants –serial estimated running<br />

time (SERTRHCA) and parallel estimated running time (PERTRHCA)– are<br />

computed by means of the proposed running time estimation methodology, which<br />

is based on the measured running time of c = 1 consensus clustering process. Predictions<br />

regarding the computationally optimal RHCA variant will be successful<br />

56

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!