29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

D.2. Experiments on selection-based self-refining<br />

φ (NMI)<br />

φ (NMI)<br />

1<br />

0.5<br />

0<br />

1<br />

0.5<br />

0<br />

E λc<br />

λ c 2<br />

λ c 5<br />

λ c 10<br />

HGPA<br />

E λc<br />

λ c 2<br />

λ c 5<br />

λ c 10<br />

λ c 15<br />

λ c 20<br />

λ c 30<br />

MCLA<br />

(a) RHCA<br />

λ c 40<br />

λ c 50<br />

λ c 60<br />

λ c 75<br />

λ c 90<br />

λ c 15<br />

λ c 20<br />

λ c 30<br />

λ c 40<br />

λ c 50<br />

λ c 60<br />

λ c 75<br />

λ c 90<br />

φ (NMI)<br />

φ (NMI)<br />

1<br />

0.5<br />

0<br />

1<br />

0.5<br />

0<br />

E λc<br />

λ c 2<br />

λ c 5<br />

λ c 10<br />

HGPA<br />

E λc<br />

λ c 2<br />

λ c 5<br />

λ c 10<br />

λ c 15<br />

λ c 20<br />

λ c 30<br />

MCLA<br />

(b) DHCA<br />

λ c 40<br />

λ c 50<br />

λ c 60<br />

λ c 75<br />

λ c 90<br />

λ c 15<br />

λ c 20<br />

λ c 30<br />

λ c 40<br />

λ c 50<br />

λ c 60<br />

λ c 75<br />

λ c 90<br />

Figure D.11: φ (NMI) boxplots of the cluster ensemble E, the original consensus clustering<br />

λc and the self-refined consensus clustering solutions λc p i output by the flat, RHCA, and<br />

DHCA consensus architectures on the PenDigits data collection across all the consensus<br />

functions employed. The green dashed vertical line identifies the clustering solution selected<br />

by the supraconsensus function in each experiment.<br />

D.2 Experiments on selection-based self-refining<br />

This section presents the results of the clustering self-refining procedure based on the selection<br />

of a cluster ensemble component λref by means of an average normalized mutual<br />

information (φ (ANMI) ) criterion —see section 4.3.<br />

The results are presented in a very similar fashion to that of the previous section,<br />

that is, by means of boxplot charts displaying the φ (NMI) of the cluster ensemble E, the<br />

selected cluster ensemble component λref and the self-refined consensus solutions λcpi ,<br />

with pi = {2, 5, 10, 15, 20, 30, 40, 50, 60, 75, 90}. Moreover, the clustering solution designated<br />

to be the optimal according to the supraconsensus function is highlighted by a vertical<br />

green dashed line, which provides a simple and fast means for evaluating its performance<br />

qualitatively.<br />

D.2.1 Iris data set<br />

Figure D.12 presents the results of the selection-based self-refining procedure applied on<br />

the Iris data set. It can be observed that the selected cluster ensemble component λref is<br />

of notable quality —i.e. well above the median partition in the cluster ensemble E. Notice<br />

how the self-refining process brings about relevant φ (NMI) gains depending on the consensus<br />

function employed. This is the case of the CSPA, MCLA, ALSAD, KMSAD and SLSAD<br />

consensus functions. However, the supraconsensus function selects λref as the optimal<br />

partition, thus ignoring the improvements introduced by the self-refining process on the<br />

aforementioned cases. This again highlights the need for good performing supraconsensus<br />

functions.<br />

348

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!