08.10.2016 Views

Foundations of Data Science

2dLYwbK

2dLYwbK

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

A clustering algorithm is consistent if increasing the distance between points in different<br />

clusters and reducing the distance between points in the same cluster does not change<br />

the clusters produced by the clustering algorithm.<br />

We now show that no clustering algorithm A can satisfy all three <strong>of</strong> scale invariance,<br />

richness, and consistency. 34<br />

Theorem 8.13 No algorithm A can satisfy all three <strong>of</strong> scale invariance, richness, and<br />

consistency.<br />

Pro<strong>of</strong>: Let’s begin with the simple case <strong>of</strong> just two points, S = {a, b}. By richness there<br />

must be some distance d(a, b) such that A produces two clusters and some other distance<br />

d ′ (a, b) such that A produces one cluster. But this then violates scale-invariance.<br />

Let us now turn to the general case <strong>of</strong> n points, S = {a 1 , . . . , a n }. By richness, there<br />

must exist some distance function d such that A puts all <strong>of</strong> S into a single cluster, and<br />

some other distance function d ′ such that A puts each point <strong>of</strong> S into its own cluster.<br />

Let ɛ be the minimum distance between points in d, and let ∆ be the maximum distance<br />

between points in d ′ . Define d ′′ = αd ′ for α = ɛ/∆; i.e., uniformly shrink distances in d ′<br />

until they are all less than or equal to the minimum distance in d. By scale invariance,<br />

A(d ′′ ) = A(d ′ ), so under d ′′ , A puts each point <strong>of</strong> S into its own cluster. However, note<br />

that for each pair <strong>of</strong> points a i , a j we have d ′′ (a i , a j ) ≤ d(a i , a j ). This means we can reach<br />

d ′′ from d by just reducing distances between points in the same cluster (since all points<br />

are in the same cluster under d). So, the fact that A behaves differently on d ′′ and d<br />

violates consistency.<br />

8.12.2 Satisfying two <strong>of</strong> three<br />

There exist natural clustering algorithms satisfying any two <strong>of</strong> the three axioms. For<br />

example, different versions <strong>of</strong> the single linkage algorithm described in Section 8.7 satisfy<br />

different two <strong>of</strong> the three conditions.<br />

Theorem 8.14<br />

1. The single linkage clustering algorithm with the k-cluster stopping condition (stop<br />

when there are k clusters), satisfies scale-invariance and consistency. We do not<br />

get richness since we only get clusterings with k clusters.<br />

2. The single linkage clustering algorithm with scale α stopping condition satisfies scale<br />

invariance and richness. The scale α stopping condition is to stop when the closest<br />

pair <strong>of</strong> clusters is <strong>of</strong> distance greater than or equal to αd max where d max is the maximum<br />

pair wise distance. Here we do not get consistency. If we select one distance<br />

34 A technical point here: we do not allow d to have distance 0 between two distinct points <strong>of</strong> S. Else,<br />

a simple algorithm that satisfies all three properties is simply “place two points into the same cluster if<br />

they have distance 0, else place them into different clusters”.<br />

291

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!