08.10.2016 Views

Foundations of Data Science

2dLYwbK

2dLYwbK

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

The volume <strong>of</strong> the hemisphere below the plane x 1 = √ 1<br />

d−1<br />

is a lower bound on the<br />

entire volume <strong>of</strong> the upper hemisphere and this volume is at least that <strong>of</strong> a cylinder <strong>of</strong><br />

√<br />

1<br />

height √<br />

d−1<br />

and radius 1 − 1<br />

1<br />

. The volume <strong>of</strong> the cylinder is V (d−1)(1− ) d−1<br />

2<br />

d−1 d−1<br />

Using the fact that (1 − x) a ≥ 1 − ax for a ≥ 1, the volume <strong>of</strong> the cylinder is at least<br />

V (d−1)<br />

2 √ d−1<br />

for d ≥ 3.<br />

√ 1<br />

d−1<br />

.<br />

Thus,<br />

ratio ≤<br />

V (d−1)<br />

upper bound above plane<br />

lower bound total hemisphere =<br />

c √ d−1 e− c2 2<br />

V (d−1)<br />

2 √ d−1<br />

= 2 c e− c2 2<br />

One might ask why we computed a lower bound on the total hemisphere since it is one<br />

half <strong>of</strong> the volume <strong>of</strong> the unit ball which we already know. The reason is that the volume<br />

<strong>of</strong> the upper hemisphere is 1 V (d) and we need a formula with V (d − 1) in it to cancel the<br />

2<br />

V (d − 1) in the numerator.<br />

Near orthogonality. One immediate implication <strong>of</strong> the above analysis is that if we<br />

draw two points at random from the unit ball, with high probability their vectors will be<br />

nearly orthogonal to each other. Specifically, from our previous analysis in Section 2.3,<br />

with high probability both will have length 1 − O(1/d). From our analysis above, if we<br />

define the vector in the direction <strong>of</strong> the first point as “north”, with high probability the<br />

second will have a projection <strong>of</strong> only ±O(1/ √ d) in this direction. This implies that with<br />

high probability, the angle between the two vectors will be π/2 ± O(1/ √ d). In particular,<br />

we have the theorem:<br />

Theorem 2.8 Consider drawing n points x 1 , x 2 , . . . , x n at random from the unit ball.<br />

With probability 1 − O(1/n)<br />

1. |x i | ≥ 1 − 2 ln n<br />

d<br />

for all i, and<br />

2. |x i · x j | ≤ √ 6 ln n √<br />

d−1<br />

for all i ≠ j.<br />

Pro<strong>of</strong>: For the first part, for any fixed i, by the analysis <strong>of</strong> Section 2.3 Prob ( |x i | <<br />

)<br />

1 − 2 ln n<br />

d ≤ e<br />

−( 2 ln n<br />

d )d = 1/n 2 . So, by the union bound, the probability there exists i such<br />

that |x i | < 1 − 2 ln n is at most 1/n. For the second part, there are ( n<br />

d<br />

2)<br />

pairs i and j, and<br />

for each such pair, if we define x i as “north”, the probability that the projection <strong>of</strong> x j onto<br />

the “North” direction is more than √ √6 ln n<br />

d−1<br />

(a necessary condition for the dot-product to<br />

be large) is at most O(e − 6 ln n<br />

2 ) = O(n −3 ) by Theorem 2.7. Thus, this condition is violated<br />

with probability at most O (( n<br />

2) ) n<br />

−3<br />

= O(1/n) as well.<br />

19

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!