29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Chapter 6. Voting based consensus functions for soft cluster ensembles<br />

⎛<br />

1.853 1.853 0.924 0.055 0.033 0.055 0.071 0.072<br />

⎞<br />

0.072<br />

ΣE = ⎝0.067<br />

0.067 0.043 0.017 0.014 0.017 1.014 1.901 1.901⎠<br />

(6.31)<br />

0.08 0.08 1.033 1.928 1.952 1.928 0.914 0.026 0.026<br />

Notice that the higher the value of the (i,j)th entry of ΣE, the more likely is that the<br />

jth object belongs to the ith cluster. Of course, this is due to the fact that the l =2<br />

fuzzy clusterings contained in the soft cluster ensemble of our toy example express<br />

object to cluster associations by means of membership probabilities, which are directly<br />

proportional to the strength of association between objects and clusters.<br />

Moreover, notice that if each column of ΣE is divided by its L1-norm (i.e. the sum<br />

of its elements), each column entries become cluster membership probabilities, and,<br />

therefore, a classic fuzzy consensus clustering solution Λc can be obtained (see equation<br />

(6.32)).<br />

⎛<br />

0.926 0.926 0.462 0.027 0.016 0.028 0.035 0.036<br />

⎞<br />

0.036<br />

Λc = ⎝0.033<br />

0.033 0.021 0.008 0.007 0.008 0.507 0.951 0.951⎠<br />

(6.32)<br />

0.041 0.041 0.517 0.965 0.977 0.964 0.458 0.013 0.013<br />

Furthermore, notice that Λc can be transformed into a crisp consensus clustering<br />

λc by simply assigning each object to the cluster it is most strongly associated to,<br />

breaking hypothetical ties at random. Referring once more to our toy example, the<br />

crisp consensus clustering obtained by hardening Λc is the one presented in equation<br />

(6.33).<br />

λc = 1 1 3 3 3 3 2 2 2 <br />

(6.33)<br />

– ProductConsensus (PC): the only difference between this consensus function and<br />

SumConsensus is that the preference values per candidate are multiplied instead of<br />

added. Quite obviously, the product rule is highly sensitive to low values, which could<br />

ruin the chances of a candidate on winning the election, no matter what its other confidence<br />

values are (van Erp, Vuurpijl, and Schomaker, 2002). Equation (6.34) presents<br />

the voting process that constitutes the core of the ProductConsensus consensus function.<br />

It is important to notice that Λi correspond to the cluster ensemble components<br />

once cluster alignment has been conducted, and matrix products are computed entrywise.<br />

As a result, the k × n product matrix ΠE is obtained.<br />

ΠE =<br />

l<br />

i=1<br />

Λi<br />

(6.34)<br />

Algorithm 6.2 presents the schematic description of the ProductConsensus consensus<br />

function. As in the previous consensus function, we propose converting the fuzzy<br />

clusterings Λi into membership probability matrices, which allows to apply the product<br />

voting rule on them once the cluster correspondence problem has been solved by<br />

means of the Hungarian algorithm. It can be observed that ProductConsensus yields<br />

179

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!