A particular Gaussian mixture model for clustering and its ...

More documents

Recommendations

Info

H. SahbiFig. 7 This figure shows 18 face prototypes, from different clusters, found after the application of our method. Each prototype corresponds to aGMM vectorFig. 8 This figure shows five clusters from the Olivetti database found after the application of our algorithm123
A particular Gaussian mixture model for clustering and its application to image retrievaland 15, see Fig. 6, top) is close to the actual numbers, resp.20 and 15. Furthermore, the probability of error (17)iscloseto its minimum for both Olivetti and Columbia (see Fig. 6,bottom).5.2 PrecisionA Clustering method can be objectively evaluated when theground truth is available, otherwise the meaning of clusteringcan differ from one intelligent observer to another. Thevalidity criteria are introduced in order to measure the qualityof a clustering algorithm, i.e., its capacity to assign data totheir actual classes. For a survey on these methods, see forexample (Halkidi et al. 2002).In the presence of a ground truth, we consider in our worka simple validity criteria based on the probability of misclassification.The latter occurs when either two examplesbelonging to two different classes are assigned to the samecluster, or when two elements belonging to the same class areassigned different clusters. We denote X and Y as two randomvariables standing respectively for the training examplesand their different possible classes {y 1 ,...,y C } and X ′ , Y ′respectively similar to X, Y . We denote by f (X) the indexof the cluster of X in {y 1 ,...,y C }. Formally, we define themisclassification error as:P(1 {( f (X) = f (X ′ )} ̸= 1 {Y = Y ′ }) = (∗), (16)here:(∗) = P( f (X) ̸= f (X ′ ) | Y = Y ′ ) P(Y = Y ′ )+ P( f (X) = f (X ′ ) | Y ̸= Y ′ ) P(Y ̸= Y ′ ) (17)Again, Fig. 6 (bottom) shows the misclassification error(17) with respect to the scale parameter σ . Tables 3 and 4are the confusion matrices which show the distribution of20 and 15 categories from, respectively, the Olivetti and theColumbia sets through the clusters after the application ofour clustering method. We can see that this distribution isconcentrated in the diagonal and this clearly shows that mostof the training data are assigned to their actual categories (seeFigs. 7, 8).5.3 ComparisonFigure 9 shows a comparison of our kernel clustering methodswith respect to existing state of the art approaches includingfuzzy clustering (Sahbi and Boujemaa 2005), K-means(MacQueen 1965) and hierarchical clustering (Posse 2001).These results are shown for the Olivetti database. The error(17) is measured with respect to the number of clusters.Notice that the latter is fixed for hierarchical clustering andK-means while it corresponds to the resulting number of clustersafter setting σ (see Sect. 4.2) for fuzzy clustering and ourProbability of Error0.450.350.25K-MeansHierarchicalParticular GMMsFuzzy0.1515 20 25Number of ClustersFig. 9 This figure shows a comparison of the generalization error onthe Olivetti set of our method and other existing state of the art methodsincluding fuzzy, k-means and hierarchical clusteringmethod. We clearly see that the generalization error takes itssmallest value when the number of clusters is close to theactual one (i.e., 20).6 Conclusion and future workWe introduced in this work an original approach for clusteringbased on a particular Gaussian mixture model (GMM).The method considers an objective function which acts asa regularizer and minimizes the overlap between the clusterGMMs. The GMM parameters are found by solving aquadratic programming problem using a new decompositionalgorithm which considers trivial linear programming subproblems.The actual number of clusters is found by controllingthe scale parameter of these GMMs; in practice, it turnsout that predicting this parameter is easier than predicting theactual number of clusters mainly for large databases livingin high dimensional spaces.The concept presented in this paper is different from kernelregression which might be assimilated to density estimationwhile our approach performs this estimation for each cluster.Obviously, the proposed approach performs clustering anddensity estimation at the same time.The validity of the method is demonstrated on toy data aswell as database categorization problems. As a future work,we will investigate the application of the method to handlenoisy data.ReferencesBach FR, Jordan MI (2003) Learning spectral clustering. Neural informationprocessing systemsBen-Hur A, Horn D, Siegelmann HT, Vapnik V (2000) Support vectorclustering. Neural information processing systems, pp 367–373Bezdek JC (1981) Pattern recognition with fuzzy objective functionalgorithms. Plenum Press, New York123
Page 1 and 2: Soft ComputDOI 10.1007/s00500-007-0
Page 3 and 4: A particular Gaussian mixture model
Page 5 and 6: A particular Gaussian mixture model
Page 7: A particular Gaussian mixture model

A particular Gaussian mixture model for clustering and its ...

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?