08.10.2016 Views

Foundations of Data Science

2dLYwbK

2dLYwbK

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

1<br />

Prob(x > 0)<br />

0<br />

1 1<br />

n 1+ɛ n log n<br />

1<br />

n<br />

log n<br />

n<br />

1<br />

2<br />

0.6<br />

n<br />

0.8<br />

n<br />

1<br />

n<br />

1.2<br />

n<br />

1.4<br />

n<br />

1−o(1)<br />

n<br />

1<br />

n<br />

1+o(1)<br />

n<br />

(a) (b) (c)<br />

Figure 4.5: Figure 4.5(a) shows a phase transition at p = 1 . The dotted line shows an<br />

n<br />

abrupt transition in Prob(x) from 0 to 1. For any function asymptotically less than 1 , n<br />

Prob(x)>0 is zero and for any function asymptotically greater than 1 , Prob(x)>0 is one.<br />

n<br />

Figure 4.5(b) expands the scale and shows a less abrupt change in probability unless<br />

the phase transition is sharp as illustrated by the dotted line. Figure 4.5(c) is a further<br />

expansion and the sharp transition is now more smooth.<br />

p<br />

have the property, and when lim 2 (n)<br />

= ∞, G (n, p<br />

n→∞ p(n)<br />

2 (n)) almost surely has the property,<br />

then we say that a phase transition occurs, and p (n) is the threshold. Recall that G(n, p)<br />

“almost surely does not have the property” means that the probability that it has the<br />

property goes to zero in the limit, as n goes to infinity. We shall soon see that every<br />

increasing property has a threshold. This is true not only for increasing properties <strong>of</strong><br />

G (n, p), but for increasing properties <strong>of</strong> any combinatorial structure. If for cp (n), c < 1,<br />

the graph almost surely does not have the property and for cp (n) , c > 1, the graph<br />

almost surely has the property, then p (n) is a sharp threshold. The existence <strong>of</strong> a giant<br />

component has a sharp threshold at 1/n. We will prove this later.<br />

In establishing phase transitions, we <strong>of</strong>ten use a variable x(n) to denote the number<br />

<strong>of</strong> occurrences <strong>of</strong> an item in a random graph. If the expected value <strong>of</strong> x(n) goes to zero as<br />

n goes to infinity, then a graph picked at random almost surely has no occurrence <strong>of</strong> the<br />

item. This follows from Markov’s inequality. Since x is a nonnegative random variable<br />

Prob(x ≥ a) ≤ 1 E(x), which implies that the probability <strong>of</strong> x(n) ≥ 1 is at most E(x(n)).<br />

a<br />

That is, if the expected number <strong>of</strong> occurrences <strong>of</strong> an item in a graph goes to zero, the<br />

probability that there are one or more occurrences <strong>of</strong> the item in a randomly selected<br />

graph goes to zero. This is called the first moment method.<br />

The previous section showed that the property <strong>of</strong> having a triangle has a threshold at<br />

p(n) = 1/n. If the edge probability p 1 (n) is o(1/n), then the expected number <strong>of</strong> triangles<br />

goes to zero and by the first moment method, the graph almost surely has no triangle.<br />

However, if the edge probability p 2 (n) satisfies p 2(n)<br />

→ ∞, then from (4.1), the probability<br />

1/n<br />

<strong>of</strong> having no triangle is at most 6/d 3 +o(1) = 6/(np 2 (n)) 3 +o(1), which goes to zero. This<br />

80

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!