08.10.2016 Views

Foundations of Data Science

2dLYwbK

2dLYwbK

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Exercise 8.9 Consider clustering points in the plane according to the k-median criterion,<br />

where cluster centers are required to be data points. Enumerate all possible clustering’s<br />

and select the one with the minimum cost. The number <strong>of</strong> possible ways <strong>of</strong> labeling n<br />

points, each with a label from {1, 2, . . . , k} is k n which is prohibitive. Show that we can<br />

find the optimal clustering in time at most a constant times ( )<br />

n<br />

k + k 2 . Note that ( )<br />

n<br />

k ≤ n<br />

k<br />

which is much smaller than k n when k

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!