24.04.2015 Views

A novel fuzzy clustering algorithm based on a fuzzy scatter matrix ...

A novel fuzzy clustering algorithm based on a fuzzy scatter matrix ...

A novel fuzzy clustering algorithm based on a fuzzy scatter matrix ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

642 K.-L. Wu et al. / Pattern Recogniti<strong>on</strong> Letters 26 (2005) 639–652<br />

According to Eq. (5), we have P n<br />

j¼1 lm ij a i ¼<br />

P n<br />

j¼1 lm ij x j. Thus, we can create the property that<br />

S FT = S FW + S FB where a i is defined by Eq. (5).<br />

This property is exactly the same as S T = S W + S B<br />

for the (crisp) <strong>scatter</strong> <strong>matrix</strong>.<br />

Fuzzy <str<strong>on</strong>g>clustering</str<strong>on</strong>g>s including FCM(Bezdek,<br />

1981), alternative FCM(Wu and Yang, 2002;<br />

Yang et al., 2002), G–K (Gustafs<strong>on</strong> and Kessel,<br />

1979), SAND (Rouseeuw et al., 1996), MCV<br />

(Krishnapuram and Kim, 2000), and UFP-ONC<br />

(Gath and Geva, 1989), etc., are all <str<strong>on</strong>g>based</str<strong>on</strong>g> <strong>on</strong><br />

the <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> within-cluster <strong>scatter</strong> <strong>matrix</strong> S FW .<br />

It is known that the FCM<str<strong>on</strong>g>clustering</str<strong>on</strong>g> <str<strong>on</strong>g>algorithm</str<strong>on</strong>g><br />

is created by minimizing the objective<br />

functi<strong>on</strong><br />

J FCM ¼ trðS FW Þ¼ Xc<br />

i¼1<br />

X n<br />

j¼1<br />

with the membership update equati<strong>on</strong><br />

l ij ¼ ðkx j a i k 2 Þ 1<br />

m 1<br />

P c<br />

k¼1 ðkx j a k k 2 Þ 1<br />

m 1<br />

l m ij kx j a i k 2 ; ð9Þ<br />

ð10Þ<br />

and the cluster center update Eq. (5). Although<br />

tr(S FT ) = tr(S FW ) + tr(S FB ) for a given data set X,<br />

tr(S FT ) is not a fixed c<strong>on</strong>stant but depends <strong>on</strong> l ij .<br />

Thus, to minimize tr(S FW ), it is not necessary to<br />

maximize tr(S FB ). The trace tr(S FB ) of a <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> set<br />

between the cluster <strong>scatter</strong> <strong>matrix</strong> can be interpreted<br />

as a separati<strong>on</strong> with a cluster variati<strong>on</strong> in<br />

between. A maximum value of tr(S FB ) will induce<br />

a <str<strong>on</strong>g>clustering</str<strong>on</strong>g> result with separated (distinguishable)<br />

clusters. In the next secti<strong>on</strong>, an <str<strong>on</strong>g>algorithm</str<strong>on</strong>g> that c<strong>on</strong>siders<br />

tr(S FW ) and tr(S FB ) simultaneously is<br />

introduced.<br />

3. A proposed <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> <str<strong>on</strong>g>clustering</str<strong>on</strong>g> <str<strong>on</strong>g>algorithm</str<strong>on</strong>g> <str<strong>on</strong>g>based</str<strong>on</strong>g> <strong>on</strong><br />

tr(S FW ) and tr(S FB )<br />

Fukuyama and Sugeno (1989), Sugeno et al.<br />

(1993) had used tr(S FW ) and tr(S FB ) to create an<br />

index FS(c) in the cluster validity problem where<br />

Pal and Bezdek (1995) gave more discussi<strong>on</strong>s in<br />

cluster validity for FCM. The validity index<br />

FS(c) was formed with<br />

FSðcÞ ¼trðS FW Þ<br />

¼ Xc<br />

i¼1<br />

X c<br />

i¼1<br />

X n<br />

j¼1<br />

X n<br />

trðS FB Þ<br />

l m ij kx j a i k 2<br />

j¼1<br />

l m ij ka i xk 2 : ð11Þ<br />

A small FS(c) value will induce a good <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g><br />

<str<strong>on</strong>g>clustering</str<strong>on</strong>g> result with a small <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> within-cluster<br />

variati<strong>on</strong> tr(S FW ) and a large <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> between-cluster<br />

variati<strong>on</strong> tr(S FB ). This will help us find a good<br />

cluster number estimate. It is reas<strong>on</strong>able to have<br />

a <str<strong>on</strong>g>clustering</str<strong>on</strong>g> objective functi<strong>on</strong> c<strong>on</strong>taining a measure<br />

of within- and between-cluster variati<strong>on</strong>s such<br />

as the FS(c) index. However, there does not exist<br />

an update equati<strong>on</strong> for a i by differentiating FS(c)<br />

with respect to a i . Thus, FS(c) cannot be a <str<strong>on</strong>g>clustering</str<strong>on</strong>g><br />

objective functi<strong>on</strong>.<br />

In this secti<strong>on</strong>, we propose a <str<strong>on</strong>g>novel</str<strong>on</strong>g> <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> <str<strong>on</strong>g>clustering</str<strong>on</strong>g><br />

objective functi<strong>on</strong> which is a modificati<strong>on</strong><br />

of the FS(c) index. This could be also a generalizati<strong>on</strong><br />

of the FCMobjective functi<strong>on</strong> by combining<br />

<str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> within- and between-cluster variati<strong>on</strong>s. Our<br />

goal is to minimize a <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> within-cluster variati<strong>on</strong><br />

tr(S FW ) and also simultaneously maximize a <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g><br />

between-cluster variati<strong>on</strong> tr(S FB ). We call this a<br />

<str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> compactness and separati<strong>on</strong> (FCS) <str<strong>on</strong>g>algorithm</str<strong>on</strong>g>,<br />

because the compactness is measured using<br />

a <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> within variati<strong>on</strong> and the separati<strong>on</strong> is<br />

measured using a <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> between variati<strong>on</strong>. Thus,<br />

the FCS objective functi<strong>on</strong> J FCS is defined as<br />

J FCS ¼ Xc X n<br />

l m ij kx j a i k 2<br />

i¼1 j¼1<br />

X c X n<br />

i¼1<br />

j¼1<br />

g i l m ij ka i xk 2 ; ð12Þ<br />

where g i P 0. Note that, J FCS = J FCM when g i =0<br />

and J FCS = FS(c) when g i = 1. By minimizing J FCS<br />

we have the following update equati<strong>on</strong>s:<br />

ðkx j a i k 2 g<br />

l ij ¼<br />

i ka i xk 2 m<br />

Þ 1<br />

1<br />

P c<br />

k¼1 ðkx ð13Þ<br />

j a k k 2 g k ka k xk 2 m<br />

Þ 1<br />

1<br />

and<br />

P n<br />

j¼1<br />

a i ¼<br />

lm ij x j<br />

P n<br />

j¼1 lm ij<br />

P n<br />

g i j¼1 lm ij<br />

P x<br />

g n ; ð14Þ<br />

i j¼1 lm ij

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!