29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

clustering count<br />

clustering count<br />

10<br />

5<br />

Mfeat FAC<br />

0<br />

0 0.5 1<br />

φ (NMI)<br />

10<br />

5<br />

(a) FAC<br />

Mfeat MOR<br />

0<br />

0 0.5 1<br />

φ (NMI)<br />

(d) MOR<br />

Appendix B. Experiments on clustering indeterminacies<br />

clustering count<br />

clustering count<br />

10<br />

5<br />

Mfeat FOU<br />

0<br />

0 0.5 1<br />

φ (NMI)<br />

10<br />

5<br />

(b) FOU<br />

Mfeat PIX<br />

0<br />

0 0.5 1<br />

φ (NMI)<br />

(e) PIX<br />

clustering count<br />

clustering count<br />

10<br />

5<br />

Mfeat KAR<br />

0<br />

0 0.5 1<br />

φ (NMI)<br />

10<br />

5<br />

(c) KAR<br />

Mfeat ZER<br />

0<br />

0 0.5 1<br />

φ (NMI)<br />

(f) ZER<br />

Figure B.8: Histograms of the φ (NMI) values obtained on each data representation in the<br />

MFeat data set.<br />

clustering count<br />

60<br />

40<br />

20<br />

miniNG Baseline<br />

0<br />

0 0.5 1<br />

φ (NMI)<br />

(a) Baseline<br />

clustering count<br />

60<br />

40<br />

20<br />

miniNG PCA<br />

0<br />

0 0.5 1<br />

φ (NMI)<br />

(b) PCA<br />

clustering count<br />

60<br />

40<br />

20<br />

miniNG ICA<br />

0<br />

0 0.5 1<br />

φ (NMI)<br />

(c) ICA<br />

clustering count<br />

60<br />

40<br />

20<br />

miniNG NMF<br />

0<br />

0 0.5 1<br />

φ (NMI)<br />

(d) NMF<br />

clustering count<br />

60<br />

40<br />

20<br />

miniNG RP<br />

0<br />

0 0.5 1<br />

φ (NMI)<br />

(e) RP<br />

Figure B.9: Histograms of the φ (NMI) values obtained on each data representation in the<br />

miniNG data set.<br />

Furthermore, if figures B.10(b) and B.10(c) are compared to figure B.10(d), it is easy to<br />

see that whereas the two former present a wide and sharp peak centered around φ (NMI) =0.7<br />

(thus indicating that clustering solutions this good are likely to be obtained using the PCA<br />

and ICA representations of the objects), the latter has its acme around φ (NMI) =0.35—i.e.<br />

the quality of the RP based clustering solutions tends to be lower in this data set.<br />

B.1.11 BBC data set<br />

The BBC data collection constitutes another example where very diverse clustering solutions<br />

–with qualities ranging from φ (NMI) =0.01 to φ (NMI) =0.81– are obtained when clustering<br />

is conducted on the original representation of the objects (see figure B.11(a)).<br />

As far as the remaining data representations are concerned, the best results seem to<br />

be obtained using the NMF feature extraction technique, as its corresponding histogram is<br />

more scarcely and densely populated at the low and high ranges of φ (NMI) , respectively.<br />

239

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!