08.10.2016 Views

Foundations of Data Science

2dLYwbK

2dLYwbK

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Exercise 8.24 The Gaussian kernel maps points to a higher dimensional space. What is<br />

this mapping?<br />

Exercise 8.25 Agglomerative clustering requires that one calculate the distances between<br />

all pairs <strong>of</strong> points. If the number <strong>of</strong> points is a million or more, then this is impractical.<br />

One might try speeding up the agglomerative clustering algorithm by maintaining a 100<br />

clusters at each unit <strong>of</strong> time. Start by randomly selecting a hundred points and place each<br />

point in a cluster by itself. Each time a pair <strong>of</strong> clusters is merged randomly select one <strong>of</strong><br />

the remaining data points and create a new cluster containing that point. Suggest some<br />

other alternatives.<br />

Exercise 8.26 Let A be the adjacency matrix <strong>of</strong> an undirected graph. Let d(S, S) = A(S,S)<br />

|S|<br />

be the density <strong>of</strong> the subgraph induced by the set <strong>of</strong> vertices S. Prove that d (S, S) is the<br />

average degree <strong>of</strong> a vertex in S. Recall that A(S, T ) = ∑<br />

a ij<br />

i∈S,j∈T<br />

Exercise 8.27 Suppose A is a matrix with non negative entries. Show that A(S, T )/(|S||T |)<br />

is maximized by the single edge with highest a ij . Recall that A(S, T ) = ∑<br />

Exercise 8.28 Suppose A is a matrix with non negative entries and<br />

a ij<br />

i∈S,j∈T<br />

σ 1 (A) = x T Ay = ∑ i,j<br />

x i a ij y j , |x| = |y| = 1.<br />

Zero out all x i less than 1/2 √ n and all y j less than 1/2 √ d. Show that the loss is no more<br />

than 1 / 4 th <strong>of</strong> σ 1 (A).<br />

Exercise 8.29 Consider other measures <strong>of</strong> density such as A(S,T ) for different values <strong>of</strong><br />

|S| ρ |T | ρ<br />

ρ. Discuss the significance <strong>of</strong> the densest subgraph according to these measures.<br />

Exercise 8.30 Let A be the adjacency matrix <strong>of</strong> an undirected graph. Let M be the<br />

matrix whose ij th element is a ij − d id j<br />

. Partition the vertices into two groups S and ¯S.<br />

2m<br />

Let s be the indicator vector for the set S and let ¯s be the indicator variable for ¯S. Then<br />

s T Ms is the number <strong>of</strong> edges in S above the expected number given the degree distribution<br />

and s T M ¯s is the number <strong>of</strong> edges from S to ¯S above the expected number given the degree<br />

distribution. Prove that if s T Ms is positive s T M ¯s must be negative.<br />

Exercise 8.31 Which <strong>of</strong> the three axioms, scale invariance, richness, and consistency<br />

are satisfied by the following clustering algorithms.<br />

1. k-means<br />

2. Spectral Clustering.<br />

Exercise 8.32 (Research Problem): What are good measures <strong>of</strong> density that are also<br />

effectively computable? Is there empirical/theoretical evidence that some are better than<br />

others?<br />

298

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!