31.07.2013 Views

Theory of Locality Sensitive Hashing - SNAP - Stanford University

Theory of Locality Sensitive Hashing - SNAP - Stanford University

Theory of Locality Sensitive Hashing - SNAP - Stanford University

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

If points are distance d < a/2, prob. they are in same<br />

bucket ≥ 1- d/a = ½<br />

If points are distance > 2a apart, then they can be in<br />

the same bucket only if d cos θ ≤ a<br />

cos θ ≤ ½<br />

60 < θ < 90<br />

i.e., at most 1/3 probability.<br />

Yields a (a/2, 2a, 1/2, 1/3)-sensitive family <strong>of</strong> hash<br />

functions for any a<br />

Amplify using AND-OR cascades<br />

1/19/2011 Jure Leskovec, <strong>Stanford</strong> C246: Mining Massive Datasets<br />

39

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!