29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

6.4. Experiments<br />

Data set Soft cluster ensemble size l<br />

Zoo 285<br />

Iris 45<br />

Wine 225<br />

Glass 145<br />

Ionosphere 485<br />

WDBC 565<br />

Balance 35<br />

Mfeat 30<br />

miniNG 365<br />

Segmentation 260<br />

BBC 285<br />

PenDigits 285<br />

Table 6.1: Soft cluster ensemble sizes l corresponding to the unimodal data sets.<br />

φ (NMI)<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

ZOO<br />

0<br />

0 1 2 3 4<br />

CPU time (sec.)<br />

CSPA<br />

EAC<br />

HGPA<br />

MCLA<br />

VMA<br />

BC<br />

CC<br />

PC<br />

SC<br />

Figure 6.2: φ (NMI) vs CPU time mean ± 2-standard deviation regions of the soft consensus<br />

functions on the Zoo data collection.<br />

efficiency of VMA is quite expectable, due to the fact that it simultaneously solves the<br />

cluster correspondence problem and voting following an iterative procedure (Dimitriadou,<br />

Weingessel, and Hornik, 2002), whereas in SC, PC, BC and CC, these two processes are<br />

sequentially conducted.<br />

As regards the quality of the consensus clustering solutions, notice that the four consensus<br />

functions proposed achieve almost identical φ (NMI) scores than the best performing<br />

state-of-the-art alternative, VMA.<br />

Table 6.2 presents the significance level values obtained from all the t-paired tests conducted<br />

on the Zoo data set. The upper and lower triangular sections of the table correspond<br />

to the comparison between consensus functions in terms of CPU time and φ (NMI) , respectively.<br />

When pairwise comparisons between the ith and the jth consensus functions result<br />

in statistically significant differences, the corresponding significance level value p is presented<br />

in the (i,j)th entry of the table (or in the (j,i)th entry, depending on whether it is a<br />

comparison in terms of CPU time or φ (NMI) ). Otherwise, the lack of statistically significant<br />

186

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!