24.04.2015 Views

A novel fuzzy clustering algorithm based on a fuzzy scatter matrix ...

A novel fuzzy clustering algorithm based on a fuzzy scatter matrix ...

A novel fuzzy clustering algorithm based on a fuzzy scatter matrix ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

K.-L. Wu et al. / Pattern Recogniti<strong>on</strong> Letters 26 (2005) 639–652 651<br />

0.5<br />

m=1.5<br />

0.12<br />

m=2<br />

NFI<br />

0.4<br />

PIM<br />

NFI<br />

0.10<br />

0.08<br />

0.06<br />

FCS<br />

0.04<br />

0.3<br />

FCS<br />

0.02<br />

PIM<br />

0.00<br />

0 0.05 0.1 0.15 0.2 0.5 0.99<br />

beta, delta<br />

(a)<br />

0 0.05 0.1 0.15 0.2 0.5 0.99<br />

beta, delta<br />

(b)<br />

Fig. 8. NFI(11) values for the normalized Vowel data set in which both PIMand FCS <str<strong>on</strong>g>algorithm</str<strong>on</strong>g>s are processed with the same<br />

parameter values.<br />

nels can avoid the situati<strong>on</strong> in which the sample<br />

mean is a unique optimizer of the FCS objective<br />

functi<strong>on</strong>. Fig. 7 presents the NFI (2) values of<br />

the data set shown in Fig. 5. The NFI values of<br />

FCS with cluster kernels (b > 0) are always larger<br />

than the NFI values of the FCM(b = 0) which is<br />

the case of the sample mean x being the unique<br />

optimizer with NFI = 0 when m = 10 and 20. This<br />

shows that the FCS <str<strong>on</strong>g>algorithm</str<strong>on</strong>g> can avoid the case<br />

of NFI = 0 and is robust to the noise and outliers<br />

than FCMwhen m is larger. Because the sample<br />

mean x of the data set shown in Fig. 6 will not<br />

be the unique optimizer of FCMand FCS when<br />

m is larger, we do not show their NFI values.<br />

Note that some properties of FCS discussed<br />

above can also be achieved by the partiti<strong>on</strong> index<br />

maximizati<strong>on</strong> (PIM) <str<strong>on</strong>g>algorithm</str<strong>on</strong>g> (Özdemir and<br />

Akarun, 2002) which used a fixed volume for all<br />

cluster kernels. The radius of each cluster volume<br />

in PIMis defined by<br />

a ¼ d minfmin ka i a 0<br />

i6¼i 0 ik=2g; 0 6 d 6 1: ð34Þ<br />

The NFI values of the normalized Vowel data set<br />

in the UCI Machine Learning Repository (Blake<br />

and Merz, 1998) of PIMand FCS are shown in<br />

Fig. 8. Yu et al. (2004) showed that when<br />

m > 1.7787, the sample mean x will be the unique<br />

optimizer of FCMfor the normalized Vowel data<br />

set in Blake and Merz (1998). InFig. 8(a), when<br />

m = 1.5, both PIMand FCS with different d and<br />

b values have the NFI index values larger than<br />

0.3. However, when m = 2 as shown in Fig. 8(b),<br />

the PIMgive the same NFI values as FCM<br />

(d =0 or b = 0). The use of the same volumes of<br />

the cluster kernels do not help PIMto have a larger<br />

NFI values than FCM. The same situati<strong>on</strong><br />

when m = 2 in FCS as shown in Fig. 8(b), the<br />

NFI values of FCS are always larger than FCM<br />

and PIM. Using the different cluster kernel volumes<br />

in FCS produces these good merits.<br />

7. C<strong>on</strong>clusi<strong>on</strong>s<br />

We proposed a <str<strong>on</strong>g>novel</str<strong>on</strong>g> <str<strong>on</strong>g>clustering</str<strong>on</strong>g> <str<strong>on</strong>g>algorithm</str<strong>on</strong>g><br />

called the FCS <str<strong>on</strong>g>algorithm</str<strong>on</strong>g> which attempts to minimize<br />

the <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> within-cluster <strong>scatter</strong> <strong>matrix</strong> trace<br />

and simultaneously maximize the <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> betweencluster<br />

<strong>scatter</strong> <strong>matrix</strong> trace. Each cluster obtained<br />

by the FCS will have a cluster kernel. Data points<br />

that fall inside any <strong>on</strong>e of the c cluster kernels will<br />

have crisp memberships and be outside all of the<br />

cluster kernels that have <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> memberships.<br />

The volume of each cluster kernel is decided by<br />

the parameter g i which is a functi<strong>on</strong> of b. The crisp<br />

and <str<strong>on</strong>g>fuzzy</str<strong>on</strong>g> memberships co-exist in the FCS. The<br />

cluster center update equati<strong>on</strong>s in the FCS can<br />

be interpreted as a weighted mean of the FCM<br />

cluster centers and the grand mean x. Numerical<br />

examples show that the FCS can have more accurate<br />

results in the parameter estimati<strong>on</strong> than the<br />

FCM. It also shows that FCS can help avoid the<br />

situati<strong>on</strong> where the sample mean x is a unique<br />

optimizer of FCMand is more robust to noise

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!