29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

F.1. Iris data set<br />

φ (NMI)<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

IRIS<br />

0<br />

0 0.1 0.2 0.3 0.4<br />

CPU time (sec.)<br />

CSPA<br />

EAC<br />

HGPA<br />

MCLA<br />

VMA<br />

BC<br />

CC<br />

PC<br />

SC<br />

Figure F.1: φ (NMI) vs CPU time mean ± 2-standard deviation regions of the soft consensus<br />

functions on the Iris data collection.<br />

F.1 Iris data set<br />

This section described the results of the soft consensus clustering experiments run on the<br />

Iris data set. Figure F.1 presents the φ (NMI) vs CPU time mean ± 2-standard deviation<br />

regions of the nine consensus functions compared. Quite obviously, the closer the scatterplot<br />

of a consensus function was to the top left corner of the diagram, the better its performance<br />

would be (i.e. it would yield high quality consensus clustering solutions with low time<br />

complexity).<br />

In this case, the proposed SC and PC consensus functions match the performance of the<br />

VMA, both in terms of time complexity and consensus quality. The performance of the other<br />

two consensus functions proposed (BC and CC) is pretty comparable as far as the quality<br />

of the consensus clustering solutions is concerned, but their computational complexity is<br />

higher. As regards the state-of-the-art consensus functions, CSPA seems to yield slightly<br />

better quality results, although its CPU time more than doubles our proposals, being the<br />

most costly. On its part, MCLA seems to be competitive from a computational viewpoint,<br />

but it yields lower quality consensus clusterings. <strong>La</strong>st, EAC and HGPA are the worst<br />

performing consensus functions.<br />

If the statistical significance of the results is evaluated –see table F.1–, it can be obvserved<br />

that the φ (NMI) superiority of CSPA is only apparent, as the differences with respect<br />

to BC and CC are not statistically significant, and the quality of the consensus clusterings<br />

output by SC and PC are significantly better than those of CSPA. Moreover, SC and PC<br />

are statistically equivalent to VMA both in terms of quality and execution time.<br />

F.2 Wine data set<br />

The soft consensus clustering results obtained on the Wine data collection are presented<br />

next. For starters, figure F.2 displays the φ (NMI) vs CPU time mean ± 2-standard deviation<br />

regions corresponding to the nine consensus functions compared. In general terms, it can<br />

be observed that VMA is the fastest alternative, while the best quality consensus clustering<br />

374

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!