29.04.2013 Views

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

TESI DOCTORAL - La Salle

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Chapter 6. Voting based consensus functions for soft cluster ensembles<br />

clustering process, the interpretation of the scalar values λij contained in Λ differs (i.e. if λij<br />

represent membership probabilities, the larger their value the stronger the object-cluster<br />

association, whereas the opposite interpretation should be made in case they represent<br />

object to centroid distances).<br />

Either way, fuzzy clustering can be regarded as a version of hard clustering with relaxed<br />

object membership restrictions. Such relaxation is particularly useful when the clusters are<br />

not well separated, or when their boundaries are ambiguous. Moreover, object to cluster<br />

association information may be of help in discovering more complex relations between a<br />

given object and the clusters (Xu and Wunsch II, 2005). Furthermore, notice that soft<br />

clustering can also be regarded as a generalization of its hard counterpart, as a crisp partition<br />

can always be derived from a fuzzy one, whereas the opposite cannot be held.<br />

However, these apparent strengths of soft clustering algorithms have barely been reflected<br />

in the development of consensus functions especially devised for combining the outcomes<br />

of multiple fuzzy clustering processes, as most proposals in this area are oriented<br />

towards their application on hard clustering scenarios. Nevertheless, as described in section<br />

2.2, there exist a few works in the consensus clustering literature devoted to the derivation<br />

of consensus functions for soft cluster ensembles, such as the VMA consensus function of<br />

(Dimitriadou, Weingessel, and Hornik, 2002) or the ITK consensus function of (Punera and<br />

Ghosh, 2007). Moreover, several other consensus functions can be indistinctly applied on<br />

both hard and soft cluster ensembles, such as PLA (<strong>La</strong>nge and Buhmann, 2005) or HGBF<br />

(Fern and Brodley, 2004), while others, originally devised for hard cluster ensembles, can be<br />

adapted for their use as soft partition combiners with relative ease (e.g. (Strehl and Ghosh,<br />

2002)).<br />

In this chapter, we make several proposals regarding the application of consensus processes<br />

on soft cluster ensembles. For starters, the notion of soft cluster ensembles is reviewed<br />

in section 6.1. Next, in section 6.2, we describe a procedure for adapting the hard consensus<br />

functions employed in this work to soft cluster ensembles. In section 6.3, a family of<br />

novel soft consensus functions based on the application of cluster disambiguation and voting<br />

strategies is proposed. Finally, the results of several experiments regarding the performance<br />

of the proposed soft consensus functions are presented in section 6.4, and the discussion of<br />

section 6.5 puts an end to this chapter.<br />

6.1 Soft cluster ensembles<br />

As described in chapter 2, cluster ensembles are nothing but the compilation of the outputs<br />

of multiple (namely l) clustering processes. Focusing on a fuzzy clustering scenario, and<br />

making the simplyfing assumption that the l clustering processes partition the data into k<br />

clusters, a soft cluster ensemble E is represented by means of a kl × n real-valued matrix:<br />

⎛ ⎞<br />

Λ1<br />

⎜<br />

⎜Λ2<br />

⎟<br />

E = ⎜ ⎟<br />

⎝ . ⎠<br />

Λl<br />

(6.6)<br />

where Λi is the k × n real-valued clustering matrix resulting from the ith soft clustering<br />

process (∀i ∈ [1,l]).<br />

165

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!