08.10.2016 Views

Foundations of Data Science

2dLYwbK

2dLYwbK

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

3k/2<br />

k/2<br />

u<br />

k<br />

t<br />

Ω(k 2 ) vertices at<br />

distance k/2 from t<br />

Figure 4.19: Small worlds.<br />

distance √ n <strong>of</strong> the destination, for at least half <strong>of</strong> the starting points the path length will<br />

be at least √ n. Thus, the expected time is at least 1 2√ n and hence not in polylog time.<br />

For the general r < 2 case, we show that a local algorithm cannot find paths <strong>of</strong> length<br />

O(n (2−r)/4 ). Let δ = (2 − r)/4 and suppose the algorithm finds a path with at most n δ<br />

edges. There must be a long-distance edge on the path which terminates within distance<br />

n δ <strong>of</strong> t; otherwise, the path would end in n δ grid edges and would be too long. There are<br />

O(n 2δ ) vertices within distance n δ <strong>of</strong> t and the probability that the long distance edge from<br />

one vertex <strong>of</strong> the path ends at one <strong>of</strong> these vertices is at most n 2δ ( 1<br />

n 2−r )<br />

= n (r−2)/2 . To<br />

see this, recall that the lower bound on the normalizing constant is θ(n 2−r ) and hence an<br />

upper bound on the probability <strong>of</strong> a long distance edge hitting v is θ ( )<br />

1<br />

n independent<br />

2−r<br />

<strong>of</strong> where v is. Thus, the probability that the long distance edge from one <strong>of</strong> the n δ vertices<br />

on the path hits any one <strong>of</strong> the n 2δ vertices within distance n δ <strong>of</strong> t is n 2δ 1 = n r−2<br />

n 2−r<br />

2 .<br />

The probability that this happens for any one <strong>of</strong> the n δ vertices on the path is at most<br />

n r−2<br />

2 n δ = n r−2<br />

2 n 2−r<br />

4 = n (r−2)/4 = o(1) as claimed.<br />

Short paths exist for r < 2<br />

Finally we show for r < 2 that there are O(ln n) length paths between s and t. The<br />

pro<strong>of</strong> is similar to the pro<strong>of</strong> <strong>of</strong> Theorem 4.17 showing O(ln n) diameter for G(n, p) when<br />

p is Ω(ln n/n), so we do not give all the details here. We give the pro<strong>of</strong> only for the case<br />

when r = 0.<br />

For a particular vertex v, let S i denote the set <strong>of</strong> vertices at distance i from v. Using<br />

only local edges, if i is O( √ ln n), then |S i | is Ω(ln n). For later i, we argue a constant<br />

factor growth in the size <strong>of</strong> S i as in Theorem 4.17. As long as |S 1 |+|S 2 |+· · ·+|S i | ≤ n 2 /2,<br />

for each <strong>of</strong> the n 2 /2 or more vertices outside, the probability that the vertex is not in<br />

128

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!