08.10.2016 Views

Foundations of Data Science

2dLYwbK

2dLYwbK

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

integral gives<br />

∫ ∞<br />

0<br />

e −r2 r d−1 dr =<br />

∫ ∞<br />

0<br />

e −t t d−1<br />

2<br />

(<br />

1<br />

2 t− 1 2 dt<br />

)<br />

= 1 2<br />

∫ ∞<br />

0<br />

e −t t d 2 − 1 dt = 1 2 Γ ( d<br />

2<br />

and hence, I(d) = A(d) 1 2 Γ ( d<br />

2)<br />

where the Gamma function Γ (x) is a generalization <strong>of</strong> the<br />

factorial function for noninteger values <strong>of</strong> x. Γ (x) = (x − 1) Γ (x − 1), Γ (1) = Γ (2) = 1,<br />

and Γ ( 1<br />

2)<br />

=<br />

√ π. For integer x, Γ (x) = (x − 1)!.<br />

Combining I (d) = π d 2 with I (d) = A (d) 1 2 Γ ( d<br />

2)<br />

yields<br />

)<br />

establishing the following lemma.<br />

A (d) = π d 2<br />

1<br />

Γ ( )<br />

d<br />

2 2<br />

Lemma 2.6 The surface area A(d) and the volume V (d) <strong>of</strong> a unit-radius ball in d dimensions<br />

are given by<br />

A (d) = 2π d 2<br />

Γ ( ) and V (d) = 2 d<br />

d<br />

2<br />

π d 2<br />

Γ ( d<br />

2<br />

To check the formula for the volume <strong>of</strong> a unit ball, note that V (2) = π and V (3) =<br />

2 π 3 2<br />

3 Γ( 3 2) = 4 π, which are the correct volumes for the unit balls in two and three dimensions.<br />

To check the formula for the surface area <strong>of</strong> a unit ball, note that A(2) = 2π<br />

3<br />

and<br />

A(3) = 2π 3 2<br />

1 √<br />

2 π<br />

= 4π, which are the correct surface areas for the unit ball in two and three<br />

dimensions. Note that π d 2 is an exponential in d and Γ ( )<br />

d<br />

2 2 grows as the factorial <strong>of</strong><br />

d<br />

. 2<br />

This implies that lim V (d) = 0, as claimed.<br />

d→∞<br />

).<br />

2.4.2 Most <strong>of</strong> the Volume is Near the Equator<br />

An interesting fact about the unit ball in high dimensions is that most <strong>of</strong> its volume is<br />

concentrated near its equator no matter what direction one uses to define the North Pole<br />

and hence the “equator”. Arbitrarily letting x 1 denote “north”, most <strong>of</strong> the volume <strong>of</strong> the<br />

unit ball has |x 1 | = O(1/ √ d). Using this fact, we will show that two random points in the<br />

unit ball are with high probability nearly orthogonal, and also give an alternative pro<strong>of</strong><br />

from the one in Section 2.4.1 that the volume <strong>of</strong> the unit ball goes to zero as d → ∞.<br />

Theorem 2.7 For c ≥ 1 and d ≥ 3, at least a 1 − 2 /2 fraction <strong>of</strong> the volume <strong>of</strong> the<br />

c e−c2<br />

d-dimensional unit ball has |x 1 | ≤ √ c<br />

d−1<br />

.<br />

17

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!