31.07.2013 Views

Theory of Locality Sensitive Hashing - SNAP - Stanford University

Theory of Locality Sensitive Hashing - SNAP - Stanford University

Theory of Locality Sensitive Hashing - SNAP - Stanford University

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Pick a random vector v, which determines a<br />

hash function h v with two buckets<br />

h v(x) = +1 if v⋅x > 0; = -1 if v⋅x < 0<br />

LS-family H = set <strong>of</strong> all functions derived<br />

from any vector<br />

Claim: For points x and y,<br />

Pr[h(x) = h(y)] = 1 – d(x,y)/180<br />

1/19/2011 Jure Leskovec, <strong>Stanford</strong> C246: Mining Massive Datasets<br />

32

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!