24.04.2015 Views

A novel fuzzy clustering algorithm based on a fuzzy scatter matrix ...

A novel fuzzy clustering algorithm based on a fuzzy scatter matrix ...

A novel fuzzy clustering algorithm based on a fuzzy scatter matrix ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

640 K.-L. Wu et al. / Pattern Recogniti<strong>on</strong> Letters 26 (2005) 639–652<br />

The <str<strong>on</strong>g>clustering</str<strong>on</strong>g> applicati<strong>on</strong>s in various areas have<br />

been well documented (Duda and Hart, 1973; Jain<br />

and Dubes, 1988; Kaufman and Rousseeuw,<br />

1990). In these <str<strong>on</strong>g>clustering</str<strong>on</strong>g> methods, the hard<br />

c-means (or k-means) and <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> c-means (FCM)<br />

<str<strong>on</strong>g>clustering</str<strong>on</strong>g> <str<strong>on</strong>g>algorithm</str<strong>on</strong>g>s are the most well-known<br />

methods (Bezdek, 1981; Jain and Dubes, 1988;<br />

Yang, 1993). Most of these methods are <str<strong>on</strong>g>based</str<strong>on</strong>g><br />

<strong>on</strong> minimizing the within-cluster <strong>scatter</strong> <strong>matrix</strong><br />

trace. The within-cluster <strong>scatter</strong> <strong>matrix</strong> trace can<br />

be interpreted as a compactness measure with a<br />

within-cluster variati<strong>on</strong>. Because the <str<strong>on</strong>g>clustering</str<strong>on</strong>g> results<br />

obtained using k-means and FCMare<br />

roughly spherical with similar volumes, many <str<strong>on</strong>g>clustering</str<strong>on</strong>g><br />

<str<strong>on</strong>g>algorithm</str<strong>on</strong>g>s such as the Gustafs<strong>on</strong>–Kessel<br />

(G–K) <str<strong>on</strong>g>algorithm</str<strong>on</strong>g> (Gustafs<strong>on</strong> and Kessel, 1979),<br />

the sum of all normalized determinants (SAND)<br />

<str<strong>on</strong>g>algorithm</str<strong>on</strong>g> (Rouseeuw et al., 1996), the minimum<br />

<strong>scatter</strong> volume (MSV) and minimum cluster volume<br />

(MCV) <str<strong>on</strong>g>algorithm</str<strong>on</strong>g>s (Krishnapuram and Kim,<br />

2000), the unsupervised <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> partiti<strong>on</strong>-optimal<br />

number of classes (UFP-ONC) <str<strong>on</strong>g>algorithm</str<strong>on</strong>g> (Gath<br />

and Geva, 1989), etc. were proposed to accommodate<br />

elliptical clusters with different volumes.<br />

These <str<strong>on</strong>g>algorithm</str<strong>on</strong>g>s are all <str<strong>on</strong>g>based</str<strong>on</strong>g> <strong>on</strong> a within-cluster<br />

<strong>scatter</strong> <strong>matrix</strong> with a compactness measure.<br />

The c<strong>on</strong>cept of adopting a separati<strong>on</strong> measure<br />

in <str<strong>on</strong>g>clustering</str<strong>on</strong>g> is used widely in solving cluster validity<br />

problems such as the separati<strong>on</strong> coefficient proposed<br />

by Gunders<strong>on</strong> (1978), the XB index<br />

proposed by Xie and Beni (1991), the FS index proposed<br />

by Fukuyama and Sugeno (1989), the SC<br />

index proposed by Zahid et al. (1999), F HV (<str<strong>on</strong>g>fuzzy</str<strong>on</strong>g><br />

hyper-volume) and P D (partiti<strong>on</strong> density) indexes<br />

proposed by Gath and Geva (1989), etc. Özdemir<br />

and Akarun (2001) proposed an inter-cluster separati<strong>on</strong><br />

(ICS) <str<strong>on</strong>g>clustering</str<strong>on</strong>g> <str<strong>on</strong>g>algorithm</str<strong>on</strong>g> that involves a<br />

separati<strong>on</strong> measure in the ICS objective functi<strong>on</strong>.<br />

Because the between-cluster <strong>scatter</strong> <strong>matrix</strong> trace<br />

can be interpreted as a separati<strong>on</strong> measure with a<br />

between-cluster variati<strong>on</strong>, maximizati<strong>on</strong> of the between-cluster<br />

<strong>scatter</strong> <strong>matrix</strong> trace will induce a result<br />

with well-separated clusters (Yang et al., 2003).<br />

In this paper, we propose a <str<strong>on</strong>g>novel</str<strong>on</strong>g> <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> <str<strong>on</strong>g>clustering</str<strong>on</strong>g><br />

<str<strong>on</strong>g>algorithm</str<strong>on</strong>g>, called the <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> compactness and<br />

separati<strong>on</strong> (FCS) <str<strong>on</strong>g>algorithm</str<strong>on</strong>g>. The FCS objective<br />

functi<strong>on</strong> is <str<strong>on</strong>g>based</str<strong>on</strong>g> <strong>on</strong> a <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> <strong>scatter</strong> <strong>matrix</strong>. The<br />

FCS <str<strong>on</strong>g>algorithm</str<strong>on</strong>g> is derived by minimizing the compactness<br />

measure and simultaneously maximizing<br />

the separati<strong>on</strong> measure (Yang et al., 2003). The<br />

compactness is measured using a <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> within-cluster<br />

<strong>scatter</strong> <strong>matrix</strong>. The separati<strong>on</strong> is measured<br />

using a <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> between-cluster <strong>scatter</strong> <strong>matrix</strong> trace.<br />

In k-means, data points always have crisp membership<br />

values of zero or <strong>on</strong>e. Although, FCMallows<br />

the data points to have <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> membership values<br />

between zero and <strong>on</strong>e, it does not exactly produce<br />

a zero or <strong>on</strong>e for the membership values. In the<br />

proposed FCS <str<strong>on</strong>g>algorithm</str<strong>on</strong>g>, crisp and <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> membership<br />

values could co-exist. These FCS properties<br />

will be discussed. We will also show that, when the<br />

weighting exp<strong>on</strong>ent m is large, the FCS <str<strong>on</strong>g>algorithm</str<strong>on</strong>g><br />

is more robust to noise and outliers than FCM.<br />

The theoretical analysis <strong>on</strong> FCS will be investigated.<br />

Yu et al. (2004) gave a theoretical upper<br />

bound for the weighting exp<strong>on</strong>ent m in FCMin<br />

which the grand sample mean x is a unique optimizer<br />

of the FCMobjective functi<strong>on</strong>. In this paper,<br />

we will show that FCS with the different cluster<br />

kernel characteristic can avoid the situati<strong>on</strong> in<br />

which x is a unique optimizer of the FCS objective<br />

functi<strong>on</strong>. We also studied the optimality tests.<br />

These results will be used as the parameter selecti<strong>on</strong><br />

for FCS. The paper is organized as follows.<br />

In Secti<strong>on</strong> 2, the (crisp) <strong>scatter</strong> <strong>matrix</strong> is extended<br />

to the <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> <strong>scatter</strong> <strong>matrix</strong>. Some <str<strong>on</strong>g>clustering</str<strong>on</strong>g> <str<strong>on</strong>g>algorithm</str<strong>on</strong>g>s<br />

<str<strong>on</strong>g>based</str<strong>on</strong>g> <strong>on</strong> the within-<strong>scatter</strong> <strong>matrix</strong> are then<br />

reviewed. In Secti<strong>on</strong> 3, we propose the <str<strong>on</strong>g>novel</str<strong>on</strong>g> <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g><br />

<str<strong>on</strong>g>clustering</str<strong>on</strong>g> <str<strong>on</strong>g>algorithm</str<strong>on</strong>g> <str<strong>on</strong>g>based</str<strong>on</strong>g> <strong>on</strong> the <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> withinand<br />

between-<strong>scatter</strong> <strong>matrix</strong>. Secti<strong>on</strong> 4 gives our<br />

theoretical analysis <strong>on</strong> the optimality tests and<br />

the FCS parameter selecti<strong>on</strong>. Secti<strong>on</strong> 5 gives the<br />

robust properties of FCS <str<strong>on</strong>g>based</str<strong>on</strong>g> <strong>on</strong> the gross error<br />

sensitivity and influence functi<strong>on</strong>. Some numerical<br />

examples are presented in Secti<strong>on</strong> 6. C<strong>on</strong>clusi<strong>on</strong>s<br />

are made in Secti<strong>on</strong> 7.<br />

2. Clustering <str<strong>on</strong>g>algorithm</str<strong>on</strong>g>s <str<strong>on</strong>g>based</str<strong>on</strong>g> <strong>on</strong> a within-cluster<br />

<strong>scatter</strong> <strong>matrix</strong><br />

Let X ={x 1 ,...,x n } be a data set in an s-dimensi<strong>on</strong>al<br />

Euclidean space R s and let c be a positive<br />

integer larger than <strong>on</strong>e. A partiti<strong>on</strong> of X into c<br />

clusters can be presented using mutually disjoint<br />

sets X 1 ,...,X c such that X 1 [[X c = X or

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!