11.07.2015 Views

A Tutorial on Support Vector Machines for Pattern Recognition

A Tutorial on Support Vector Machines for Pattern Recognition

A Tutorial on Support Vector Machines for Pattern Recognition

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

11which i 6= 0 and computing b (note that it is numerically safer to take the mean value ofb resulting from all such equati<strong>on</strong>s).Notice that all we've d<strong>on</strong>e so far is to cast the problem into an optimizati<strong>on</strong> problemwhere the c<strong>on</strong>straints are rather more manageable than those in Eqs. (10), (11). Findingthe soluti<strong>on</strong> <strong>for</strong> real world problems will usually require numerical methods. We willhavemore to say <strong>on</strong>this later. However, let's rst work out a rare case where the problem isn<strong>on</strong>trivial (the number of dimensi<strong>on</strong>s is arbitrary, and the soluti<strong>on</strong> certainly not obvious),but where the soluti<strong>on</strong> can be found analytically.3.3. Optimal Hyperplanes: An ExampleWhile the main aim of this Secti<strong>on</strong> is to explore a n<strong>on</strong>-trivial pattern recogniti<strong>on</strong> problemwhere the support vector soluti<strong>on</strong> can be found analytically, theresults derived here willalso be useful in a later proof. For the problem c<strong>on</strong>sidered, every training point will turnout to be a support vector, which is <strong>on</strong>e reas<strong>on</strong> we can nd the soluti<strong>on</strong> analytically.C<strong>on</strong>sider n +1 symmetrically placed points lying <strong>on</strong> a sphere S n;1 of radius R: moreprecisely, the points <strong>for</strong>m the vertices of an n-dimensi<strong>on</strong>al symmetric simplex. It is c<strong>on</strong>venienttoembed the points in R n+1 in such away thattheyalllieinthehyperplane whichpasses through the origin and which is perpendicular to the (n+1)-vector (1 1 ::: 1) (in this<strong>for</strong>mulati<strong>on</strong>, the points lie <strong>on</strong> S n;1 , they span R n , and are embedded in R n+1 ). Explicitly,recalling that vectors themselves are labeled by Roman indices and their coordinates byGreek, the coordinates are given by:s rRRnx i = ;(1 ; i )n(n +1) + i(22)n +1where the Kr<strong>on</strong>ecker delta, i , is dened by i = 1 if = i, 0 otherwise. Thus, <strong>for</strong>example, the vectors <strong>for</strong> three equidistant points <strong>on</strong> the unit circle (see Figure 12) are:r2x 1 = (3 ;1p 6x 2 = ( ;1 p6x 3 = ( ;1 p6;1p6)r23 ;1p6);1p6r23 ) (23)One c<strong>on</strong>sequence of the symmetry is that the angle between any pair of vectors is thesame (and is equal to arccos(;1=n)):kx i k 2 = R 2 (24)x i x j = ;R 2 =n (25)or, more succinctly,x i x jR 2 = ij ; (1 ; ij ) 1 n : (26)Assigning a class label C 2 f+1 ;1g arbitrarily to each point, we wish to nd thathyperplane which separates the two classes with widest margin. Thus we must maximize

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!