29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

F.3. Glass data set<br />

CSPA EAC HGPA MCLA VMA BC CC PC SC<br />

CSPA ——— 0.001 0.0001 × 0.0001 0.0249 0.0005 0.0001 0.0001<br />

EAC 0.0001 ——— 0.0001 × 0.0001 × 0.0105 0.0001 0.0001<br />

HGPA 0.0001 0.0001 ——— 0.0004 0.0001 0.0001 0.0001 × ×<br />

MCLA × 0.0001 0.0001 ——— 0.0001 × 0.0199 0.0013 0.0014<br />

VMA 0.0001 0.0001 0.0001 0.0001 ——— 0.0001 0.0001 0.0002 0.0002<br />

BC 0.0001 0.0001 0.0001 0.0001 0.0006 ——— × 0.0001 0.0001<br />

CC 0.0001 0.0001 0.0001 0.0001 0.0006 × ——— 0.0001 0.0001<br />

PC 0.0001 0.0001 0.0001 0.0001 × 0.001 0.001 ——— ×<br />

SC 0.0001 0.0001 0.0001 0.0001 × 0.0129 0.0129 × ———<br />

Table F.2: Significance levels p corresponding to the pairwise comparison of soft consensus<br />

functions using a t-paired test on the Wine data set. The upper and lower triangular sections<br />

of the table correspond to the comparison in terms of CPU time and φ (NMI) , respectively.<br />

Statistically non-significant differences (p >0.05) are denoted by the symbol ×.<br />

φ (NMI)<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

GLASS<br />

0<br />

0 0.5 1 1.5 2 2.5<br />

CPU time (sec.)<br />

CSPA<br />

EAC<br />

HGPA<br />

MCLA<br />

VMA<br />

BC<br />

CC<br />

PC<br />

SC<br />

Figure F.3: φ (NMI) vs CPU time mean ± 2-standard deviation regions of the soft consensus<br />

functions on the Glass data collection.<br />

As figure F.3 suggests, VMA is again the least time consuming consensus function. As<br />

mentioned earlier, this is due to the simultaneity of the cluster disambiguation and voting<br />

processes in this consensus function. In contast, the proposed CC consensus function is by<br />

far the slowest, probably due to the exhaustive pairwise cluster confrontation implicit in<br />

the Condorcet voting method.<br />

In terms of quality, there is an apparent equality between the VMA, PC and SC consensus<br />

functions, attaining the highest φ (NMI) scores. The CSPA, BC, CC and MCLA<br />

consensus functions apparently yield lower quality consensus clustering solutions.<br />

When the statistical significance of these results is analyzed –see table F.3–, we see that<br />

the apparent time complexity superiority of VMA is statistically significant. As regards the<br />

quality of the consensus clustering solutions, it can be observed that the performances of<br />

VMA, SC and PC are statistically equivalent, whereas the differences between these and<br />

BC and CC are indeed significant.<br />

376

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!