29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

6.4. Experiments<br />

CSPA<br />

EAC<br />

HGPA<br />

MCLA<br />

VMA<br />

φ (NMI) BC CC PC SC<br />

better than ... 27.3% 27.3% 9.1% 9.1%<br />

equivalent to ... 9.1% 9.1% 45.4% 45.4%<br />

worse than ... 63.6% 63.6% 45.4% 45.4%<br />

better than ... 0% 0% 0% 0%<br />

equivalent to ... 0% 0% 0% 0%<br />

worse than ... 100% 100% 100% 100%<br />

better than ... 0% 0% 0% 0%<br />

equivalent to ... 0% 0% 0% 0%<br />

worse than ... 100% 100% 100% 100%<br />

better than ... 0% 0% 0% 0%<br />

equivalent to ... 16.7% 16.7% 16.7% 16.7%<br />

worse than ... 83.3% 83.3% 83.3% 83.3%<br />

better than ... 25% 25% 0% 0%<br />

equivalent to ... 33.3% 33.3% 100% 91.7%<br />

worse than ... 41.7% 41.7% 0% 8.3%<br />

Table 6.3: Percentage of experiments in which the state-of-the-art consensus functions<br />

(CSPA, EAC, HGPA, MCLA and VMA) yield (statistically significant) better/equivalent/worse<br />

consensus clustering solutions than the four proposed consensus functions<br />

(BC, CC, PC and SC).<br />

differences between the distinct voting strategies is somehow lost.<br />

In either case, the φ (NMI) scores obtained by the four proposed voting based consensus<br />

functions is statistically significantly lower than that of CSPA and VMA in only a 15.3% of<br />

the experiments conducted, which clearly indicates that, from a consensus quality perspective,<br />

our proposals constitute an attractive alternative for conducting consensus clustering<br />

on soft cluster ensembles.<br />

And secondly, the results of the previously described comparison, but referred to execution<br />

CPU time, are presented in table 6.4. In general terms, the state-of-the-art consensus<br />

functions (except for EAC) are faster than the proposed consensus functions based on positional<br />

voting methods (BC and CC). This is due to the candidates ranking step that<br />

precedes the voting process itself (see algorithms 6.3 and 6.4). Moreover, the execution of<br />

CC takes longer than BC, due to the exhaustive pairwise candidate comparison involved in<br />

the Condorcet voting method. In contrast, the confidence voting based consensus functions<br />

(PC and SC) are more computationally efficient, being as fast or faster than CSPA, EAC,<br />

HGPA and MCLA in an average 80.7% of the experiments. However, they are unable to<br />

match the computational efficiency of VMA, which, as mentioned earlier, is caused by its<br />

simultaneous and iterative cluster disambiguation and voting procedure.<br />

As a conclusion, it can be stated that the four voting based consensus functions proposed<br />

in this chapter are indeed worthy of being considered as an alternative when it comes to<br />

creating consensus clustering solutions on soft cluster ensembles, as they are capable of<br />

delivering high quality consensus clustering solutions at an acceptable computational cost<br />

—this is specially true for those consensus functions based on confidence voting methods<br />

(i.e. PC and SC). The higher computational complexity of positional voting based consensus<br />

functions (BC and CC) suggests limiting their application to those cases in which the<br />

188

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!