08.10.2016 Views

Foundations of Data Science

2dLYwbK

2dLYwbK

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Pro<strong>of</strong>: Apply Theorem 4.1 with α = ε √ np to get that the probability that an individual<br />

vertex has degree outside the range [(1 − ε)np, (1 + ε)np] is at most 3e −ε2 np/8 . By the<br />

union bound, the probability that some vertex has degree outside this range is at most<br />

3ne −ε2 np/8 . For this to be o(1), it suffices for p to be Ω( ln n<br />

nε 2 ). Hence the Corollary.<br />

Note that the assumption p is Ω( ln n ) is necessary. If p = d/n for d a constant, for<br />

nε 2<br />

instance, then some vertices may well have degrees outside the range [(1 − ε)d, (1 + ε)d].<br />

Indeed, shortly we will see that it is highly likely that for p = 1 there is a vertex <strong>of</strong> degree<br />

n<br />

Ω(log n/ log log n).<br />

When p is a constant, the expected degree <strong>of</strong> vertices in G (n, p) increases with n. For<br />

example, in G ( n, 2) 1 , the expected degree <strong>of</strong> a vertex is n/2. In many real applications,<br />

we will be concerned with G (n, p) where p = d/n, for d a constant, i.e., graphs whose<br />

expected degree is a constant d independent <strong>of</strong> n. Holding d = np constant as n goes to<br />

infinity, the binomial distribution<br />

( n<br />

Prob (k) = p<br />

k)<br />

k (1 − p) n−k<br />

approaches the Poisson distribution<br />

Prob(k) = (np)k e −np = dk<br />

k! k! e−d .<br />

To see this, assume k = o(n) and use the approximations n − k ∼ = n, ( )<br />

n ∼= n k<br />

, and<br />

k k!<br />

(<br />

1 −<br />

d n−k<br />

∼<br />

n)<br />

= e −d to approximate the binomial distribution by<br />

( ( ) k n<br />

lim<br />

)p k (1 − p) n−k = nk d<br />

e −d = dk<br />

n→∞ k<br />

k! n k! e−d .<br />

Note that for p = d , where d is a constant independent <strong>of</strong> n, the probability <strong>of</strong> the<br />

n<br />

binomial distribution falls <strong>of</strong>f rapidly for k > d, and is essentially zero for all but some<br />

finite number <strong>of</strong> values <strong>of</strong> k. This justifies the k = o(n) assumption. Thus, the Poisson<br />

distribution is a good approximation.<br />

Example: In G(n, 1 ) many vertices are <strong>of</strong> degree one, but not all. Some are <strong>of</strong> degree<br />

n<br />

zero and some are <strong>of</strong> degree greater than one. In fact, it is highly likely that there is a<br />

vertex <strong>of</strong> degree Ω(log n/ log log n). The probability that a given vertex is <strong>of</strong> degree k is<br />

If k = log n/ log log n,<br />

( ) ( k ( n 1<br />

Prob (k) =<br />

1 −<br />

k n) 1 ) n−k<br />

≈ e−1<br />

n k! .<br />

log k k = k log k =<br />

log n (log log n − log log log n) ≤ log n<br />

log log n<br />

76

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!