29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Appendix D<br />

Experiments on self-refining<br />

consensus architectures<br />

This appendix presents several experiments regarding self-refining flat and hierarchical consensus<br />

architectures described in chapter 4. In appendix D.1, the proposal for automatically<br />

refining a previously derived consensus clustering solution –what is called consensus based<br />

self-refining– is experimentally evaluated, whereas appendix D.2 presents the experiments<br />

regarding the creation of a refined consensus clustering solution upon the selection of a high<br />

quality cluster ensemble component, i.e. selection-based self-refining.<br />

In both cases, the experiments are conducted on eleven unimodal data collections,<br />

namely: Iris, Wine, Glass, Ionosphere, WDBC, Balance, MFeat, miniNG, Segmentation,<br />

BBC and PenDigits. The results of the self-refining experiments are displayed by means<br />

of boxplot charts showing the normalized mutual information (φ (NMI) ) with respect to the<br />

ground truth of each data set compiled across 100 independent experiment runs of i) the<br />

cluster ensemble E each experiment is conducted upon, ii) the clustering solution employed<br />

as the reference of the self-refining procedure, and iii) the self-refined consensus cluster-<br />

ing solutions λc p i obtained upon select cluster ensembles Epi<br />

created by the selection of<br />

a percentage pi = {2, 5, 10, 15, 20, 30, 40, 50, 60, 75, 90} of the whole ensemble E. Asinall<br />

the experimental sections of this thesis, consensus processes have been replicated using the<br />

set of seven consensus functions described in appendix A.5, namely: CSPA, EAC, HGPA,<br />

MCLA, ALSAD, KMSAD and SLSAD.<br />

D.1 Experiments on consensus-based self-refining<br />

In this section, the results of applying the consensus-based self-refining procedure described<br />

in section 4.1 on the aforementioned eleven data sets are presented. The self-refining process<br />

is intended to better the quality of a consensus clustering solution λc output by a flat, a<br />

random (RHCA) and a deterministic hierarchical consensus architecture (DHCA).<br />

For this reason, the results are displayed as a matrix of boxplot charts with three columns<br />

(the leftmost one for flat consensus, the central one presenting the results of RHCA, and<br />

DHCA on the right), and as many rows as consensus functions are employed on each<br />

particular data collection (seven in all cases except for the PenDigits data set, where only<br />

two consensus functions are applicable due to memory limitations given our computational<br />

333

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!