29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Chapter 3. Hierarchical consensus architectures<br />

In this chapter, we propose two strategies for constructing hierarchical consensus architectures,<br />

which differ in i) the way mini-ensembles are created, and ii) which HCA<br />

parameters are tuned by the user. As a result, two HCA implementation alternatives are<br />

put forward:<br />

– random hierarchical consensus architectures, whose tunable parameter is the size of<br />

the mini-ensembles –the components of which are selected at random–, which eventually<br />

determines the HCA topology (i.e. its number of stages).<br />

– deterministic hierarchical consensus architectures, where the construction of the miniensembles<br />

is driven by the cluster ensemble creation process —in particular, the diversity<br />

factors used in creating the ensemble determine the number of HCA stages<br />

and the mini-ensembles components.<br />

The following sections are devoted to describing the rationale and implementation details<br />

of both HCA variants, specifying how the number of stages, the number of consensus<br />

processes per stage and the size of the mini-ensembles are determined in each case. This<br />

description is completed by an analysis of their computational complexity.<br />

3.2 Random hierarchical consensus architectures<br />

In this section, we introduce random hierarchical consensus architectures (RHCA for short),<br />

we define their topology from a generic perspective, making a brief description of their<br />

foundations, followed by an analysis of their computational complexity.<br />

3.2.1 Rationale and definition<br />

The idea behind random hierarchical consensus architectures is to construct a regular pyramidal<br />

structure of intermediate consensus processes that delivers, at its top, the final consensus<br />

clustering solution λc. The term random refers not to the consensus architecture<br />

itself, but to the way mini-ensembles are created. In particular, the randomness of RHCA<br />

lies in the fact that the clusterings input to each stage of the hierarchical architecture are<br />

shuffled randomly.<br />

Besides this fact, the most distinctive feature of RHCA is that the user determines<br />

the size of the mini-ensembles, setting it to b, keeping it constant across the stages of<br />

the consensus architecture. Therefore, given a cluster ensemble containing l component<br />

clusterings and a mini-ensemble size set equal to b by the user, the number of stages s of<br />

the resulting RHCA is computed by equation (3.1).<br />

⎧<br />

⎪⎩<br />

⌊log b (l)⌉ if<br />

⎪⎨<br />

<br />

s = ⌊logb (l)⌉−1 if<br />

⌊log b (l)⌉ +1 if<br />

<br />

<br />

l<br />

b ⌊log b (l)⌉<br />

l<br />

b ⌊log b (l)⌉<br />

l<br />

b ⌊log b (l)⌉<br />

49<br />

<br />

≤ 1and<br />

<br />

≤ 1and<br />

<br />

> 1<br />

l<br />

b ⌊log b (l)⌉−1<br />

l<br />

b ⌊log b (l)⌉−1<br />

<br />

> 1<br />

<br />

=1<br />

(3.1)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!