29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Chapter 6. Voting based consensus functions for soft cluster ensembles<br />

in the ensemble) will depend on the voting procedure applied, which, at the same time, is<br />

conditioned by the way the voters’ preferences are expressed.<br />

Given that in fuzzy cluster ensembles voters express their preference for each and every<br />

one of the k candidates, the soft consensus functions proposed in this chapter make use of<br />

confidence and positional voting methods (van Erp, Vuurpijl, and Schomaker, 2002), which<br />

are applicable in voting scenarios in which voters grade candidates according to their degree<br />

of confidence. The former makes direct use of the specific values of the preference scores<br />

the voters emit –thus, they are sensitive to their scaling–, whereas the latter are based on<br />

ranking the candidates according to the degree of confidence expressed by the voters.<br />

As mentioned earlier, the way fuzzy clusterers express their preference for the clusters<br />

can be either directly or inversely proportional to the strength of association between objects<br />

and clusters (e.g. membership probabilities or distances to centroids, respectively). In fact,<br />

it is possible that both types of clusterings are intermingled in E, and, for this reason, the<br />

voting method must somehow be informed of this fact, so that appropriate scale or ranking<br />

transformations are applied —depending on whether a confidence or a positional voting<br />

strategy is employed.<br />

In the following sections, we present four consensus functions for soft cluster ensembles,<br />

each of which is based on a specific voting mechanism. Besides their generic description,<br />

we illustrate them by means of a toy example using the soft cluster ensemble E containing<br />

the l = 2 cluster aligned fuzzy clustering solutions presented in equation (6.28).<br />

E =<br />

Λ1<br />

Λ2<br />

⎛<br />

0.921 0.932 0.905 0.025 0.019 0.030 0.014 0.055<br />

⎞<br />

0.017<br />

⎜<br />

⎜<br />

0.025<br />

⎜<br />

= ⎜0.054<br />

⎜<br />

⎜0.932<br />

⎝0.042<br />

0.042<br />

0.026<br />

0.921<br />

0.025<br />

0.038<br />

0.057<br />

0.019<br />

0.005<br />

0.006<br />

0.969<br />

0.030<br />

0.011<br />

0.005<br />

0.976<br />

0.014<br />

0.009<br />

0.011<br />

0.959<br />

0.025<br />

0.006<br />

0.976<br />

0.009<br />

0.057<br />

0.038<br />

0.929<br />

0.016<br />

0.017<br />

0.972<br />

0.972 ⎟<br />

0.010 ⎟<br />

0.055 ⎟<br />

0.929⎠<br />

0.026 0.054 0.976 0.959 0.976 0.969 0.905 0.010 0.016<br />

(6.29)<br />

Notice that, in this toy example, the l = 2 clustering solutions (voters) compiled in<br />

the ensemble (Λ1 and Λ2) express object to cluster associations (their preferences for candidates)<br />

by means of membership probabilities, which makes the scalar elements of both<br />

clusterings directly comparable, thus avoiding the need for applying any scale transformations.<br />

However, this need not be the general case, and we will address how to deal with<br />

cluster ensembles containing unequal clusterings as the proposed consensus functions are<br />

presented throughout the following paragraphs.<br />

Confidence voting<br />

Consensus functions based on confidence voting methods derive the consolidated clustering<br />

solution upon the values of the confidence scores each clusterer assigns to each cluster. For<br />

this reason, a prerequisite for using these voting methods is that these confidence values are<br />

comparable in magnitude (van Erp, Vuurpijl, and Schomaker, 2002). Assuming this is true,<br />

we propose the application of the sum and product confidence voting rules, which gives rise<br />

to the following two consensus functions:<br />

– SumConsensus (SC): this consensus function is based on the application of the<br />

confidence voting sum rule, which simply consists of adding the confidence values<br />

177

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!