08.10.2016 Views

Foundations of Data Science

2dLYwbK

2dLYwbK

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

etween vertices in different blocks is equivalent to finding a vector x with coordinates<br />

±1 <strong>of</strong> which half <strong>of</strong> its coordinates are +1 and half <strong>of</strong> which are –1 that minimizes<br />

E cut = 1 4<br />

∑<br />

(i,j)∈E<br />

(x i − x j ) 2<br />

Let A be the adjacency matrix <strong>of</strong> G. Then<br />

x T Ax = ∑ a ij x i x j = 2 ∑ x i x j<br />

(<br />

ij<br />

edges<br />

) ( number <strong>of</strong> edges<br />

number <strong>of</strong> edges<br />

= 2 ×<br />

− 2 ×<br />

(<br />

within components<br />

) (<br />

between components<br />

)<br />

total number<br />

number <strong>of</strong> edges<br />

= 2 ×<br />

− 4 ×<br />

<strong>of</strong> edges<br />

between components<br />

)<br />

Maximizing x T Ax over all x whose coordinates are ±1 and half <strong>of</strong> whose coordinates are<br />

+1 is equivalent to minimizing the number <strong>of</strong> edges between components.<br />

Since finding such an x is computational difficult, replace the integer condition on the<br />

components <strong>of</strong> x and the condition that half <strong>of</strong> the components are positive and half <strong>of</strong> the<br />

∑<br />

components are negative with the conditions n ∑<br />

x 2 i = 1 and n x i = 0. Then finding the<br />

optimal x gives us the second eigenvalue since it is easy to see that the first eigenvector<br />

Is along 1<br />

x T Ax<br />

λ 2 = max ∑<br />

x⊥v 1 x<br />

2<br />

i<br />

∑<br />

Actually we should use<br />

n n∑<br />

x 2 i = n not x 2 i = 1. Thus nλ 2 must be greater than<br />

( ) i=1 ( i=1 )<br />

total number<br />

number <strong>of</strong> edges<br />

2 ×<br />

− 4 ×<br />

since the maximum is taken over<br />

<strong>of</strong> edges<br />

between components<br />

a larger set <strong>of</strong> x. The fact that λ 2 gives us a bound on the minimum number <strong>of</strong> cross<br />

edges is what makes it so important.<br />

12.7.8 Distance between subspaces<br />

Suppose S 1 and S 2 are two subspaces. Choose a basis <strong>of</strong> S 1 and arrange the basis<br />

vectors as the columns <strong>of</strong> a matrix X 1 ; similarly choose a basis <strong>of</strong> S 2 and arrange the<br />

basis vectors as the columns <strong>of</strong> a matrix X 2 . Note that S 1 and S 2 can have different<br />

dimensions. Define the square <strong>of</strong> the distance between two subspaces by<br />

i=1<br />

i=1<br />

dist 2 (S 1 , S 2 ) = dist 2 (X 1 , X 2 ) = ||X 1 − X 2 X T 2 X 1 || 2 F<br />

Since X 1 − X 2 X T 2 X 1 and X 2 X T 2 X 1 are orthogonal<br />

‖X 1 ‖ 2 F = ∥ ∥ X1 − X 2 X T 2 X 1<br />

∥ ∥<br />

2<br />

F + ∥ ∥ X2 X T 2 X 1<br />

∥ ∥<br />

2<br />

F<br />

417

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!