29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

6.3. Voting based consensus functions<br />

Input: Soft cluster ensemble E containing l fuzzy clusterings Λi (∀i =1...l)<br />

Output: Product voting matrix ΠE<br />

Data: k clusters, n objects<br />

Hungarian (E)<br />

ΠE = 1k×n for i =1...l do<br />

if Λi not membership probabilities then<br />

Probabilize (Λi)<br />

end<br />

ΠE = ΠE ◦ Λi<br />

end<br />

Algorithm 6.2: Symbolic description of the soft consensus function ProductConsensus.<br />

Probabilize and Hungarian are symbolic representations of the conversion of fuzzy clusterings<br />

to membership probability matrices and the cluster disambiguation procedures,<br />

respectively, while 1 k×n represents a k × n unit matrix, and ◦ represents the Hadamard<br />

(or entrywise) matrix product.<br />

the product voting matrix ΠE as its output. However, as in the case of SumConsensus,<br />

it can be transformed into a fuzzy or a crisp consensus clustering, a process that<br />

can be included as the final step of ProductConsensus.<br />

The results of applying the product rule on the toy cluster ensemble of equation (6.29)<br />

yields the product matrix ΠE presented in equation (6.35).<br />

⎛<br />

ΠE = ⎝<br />

0.858 0.858 0.017 7.5·10 −4 2.7·10 −4 7.5·10 −4 7.9·10 −4 9.3·10 −4 9.3·10 −4<br />

0.001 0.001 1.9·10−4 6.6·10−5 4.5·10−5 6.6·10−5 0.037 0.903 0.903<br />

0.001 0.001 0.056 0.929 0.953 0.929 0.008 1.6·10−4 1.6·10−4 ⎞<br />

⎠<br />

(6.35)<br />

Dividing each column of ΠE by its L1-norm gives rise to the fuzzy consensus clustering<br />

solution Λc based on membership probabilities of equation (6.36), and assigning each<br />

object to the cluster it is most strongly associated to (breaking ties randomly) yields<br />

the crisp consensus clustering λc of equation (6.37).<br />

⎛<br />

Λc = ⎝<br />

0.997 0.997 0.235 8.1·10−4 2.8·10−4 8.1·10−4 0.017 0.001 0.001<br />

0.001 0.001 0.003 7.1·10−5 4.7·10−5 7.1·10−5 0.806 0.998 0.998<br />

0.002 0.002 0.762 0.999 0.999 0.999 0.177 1.8·10−4 1.8·10−4 λc = 1 1 3 3 3 3 2 2 2 <br />

⎞<br />

⎠ (6.36)<br />

(6.37)<br />

Notice that the differences between the fuzzy consensus clusterings Λc obtained by<br />

SumConsensus and ProductConsensus (equations (6.32) and (6.36) ) –due to the<br />

different way voters’ preferences are combined– are lost when they are transformed<br />

into crisp consensus clusterings.<br />

180

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!