08.10.2016 Views

Foundations of Data Science

2dLYwbK

2dLYwbK

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

and thus k k ≤ n. Since k! ≤ k k ≤ n, the probability that a vertex has degree k =<br />

log n/ log log n is at least 1 k! e−1 ≥ 1 . If the degrees <strong>of</strong> vertices were independent random<br />

en<br />

variables, then this would be enough to argue that there would be a vertex <strong>of</strong> degree<br />

log n/ log log n with probability at least 1 − ( 1 − 1<br />

en) n<br />

= 1 − e<br />

− 1 e ∼ = 0.31. But the degrees<br />

are not quite independent since when an edge is added to the graph it affects the degree<br />

<strong>of</strong> two vertices. This is a minor technical point, which one can get around.<br />

4.1.2 Existence <strong>of</strong> Triangles in G(n, d/n)<br />

What is the expected number <strong>of</strong> triangles in G ( n, d n)<br />

, when d is a constant? As the<br />

number <strong>of</strong> vertices increases one might expect the number <strong>of</strong> triangles to increase, but this<br />

is not the case. Although the number <strong>of</strong> triples <strong>of</strong> vertices grows as n 3 , the probability<br />

<strong>of</strong> an edge between two specific vertices decreases linearly with n. Thus, the probability<br />

<strong>of</strong> all three edges between the pairs <strong>of</strong> vertices in a triple <strong>of</strong> vertices being present goes<br />

down as n −3 , exactly canceling the rate <strong>of</strong> growth <strong>of</strong> triples.<br />

A random graph with n vertices and edge probability d/n, has an expected number<br />

<strong>of</strong> triangles that is independent <strong>of</strong> n, namely d 3 /6. There are ( n<br />

3)<br />

triples <strong>of</strong> vertices.<br />

Each triple has probability ( d 3<br />

n)<br />

<strong>of</strong> being a triangle. Let ∆ijk be the indicator variable<br />

for the triangle with vertices i, j, and k being present. That is, all three edges (i, j),<br />

(j, k), and (i, k) being present. Then the number <strong>of</strong> triangles is x = ∑ ijk ∆ ijk. Even<br />

though the existence <strong>of</strong> the triangles are not statistically independent events, by linearity<br />

<strong>of</strong> expectation, which does not assume independence <strong>of</strong> the variables, the expected value<br />

<strong>of</strong> a sum <strong>of</strong> random variables is the sum <strong>of</strong> the expected values. Thus, the expected<br />

number <strong>of</strong> triangles is<br />

( ∑ )<br />

E(x) = E ∆ ijk = ∑ ( ) ( 3 n d<br />

E(∆ ijk ) =<br />

≈<br />

3 n) d3<br />

6 .<br />

ijk<br />

ijk<br />

Even though on average there are d3 triangles per graph, this does not mean that with<br />

6<br />

high probability a graph has a triangle. Maybe half <strong>of</strong> the graphs have d3 triangles and<br />

3<br />

the other half have none for an average <strong>of</strong> d3 triangles. Then, with probability 1/2, a<br />

6<br />

graph selected at random would have no triangle. If 1/n <strong>of</strong> the graphs had d3 n triangles<br />

6<br />

and the remaining graphs had no triangles, then as n goes to infinity, the probability that<br />

a graph selected at random would have a triangle would go to zero.<br />

We wish to assert that with some nonzero probability there is at least one triangle<br />

in G(n, p) when p = d . If all the triangles were on a small number <strong>of</strong> graphs, then the<br />

n<br />

number <strong>of</strong> triangles in those graphs would far exceed the expected value and hence the<br />

variance would be high. A second moment argument rules out this scenario where a small<br />

fraction <strong>of</strong> graphs have a large number <strong>of</strong> triangles and the remaining graphs have none.<br />

77

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!