08.10.2016 Views

Foundations of Data Science

2dLYwbK

2dLYwbK

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

2. The random variable <strong>of</strong> interest, y = ∑ p<br />

I p , is the number <strong>of</strong> prime divisors <strong>of</strong> x<br />

picked at random. Show that the variance <strong>of</strong> y is O(ln ln n). For this, assume the<br />

known<br />

∑<br />

result that the number <strong>of</strong> primes p between 2 and n is O(n/ ln n) and that<br />

1<br />

= ln ln n. To bound the variance <strong>of</strong> y, think <strong>of</strong> what E(I p pI q ) is for p ≠ q, both<br />

p<br />

primes.<br />

3. Use (1) and (2) to prove that the number <strong>of</strong> prime factors is almost surely θ(ln ln n).<br />

Exercise 4.15 Suppose one hides a clique <strong>of</strong> size k in a random graph G ( n, 1 2)<br />

. I.e.,<br />

in the random graph, choose some subset S <strong>of</strong> k vertices and put in the missing edges to<br />

make S a clique. Presented with the modified graph, find S. The larger S is, the easier it<br />

should be to find. In fact, if k is more than c √ n ln n, then with high probability the clique<br />

leaves a telltale sign identifying S as the k vertices <strong>of</strong> largest degree. Prove this statement<br />

by appealing to Theorem 4.1. It remains a puzzling open problem to do this when k is<br />

smaller, say, O(n 1/3 ).<br />

Exercise 4.16 The clique problem in a graph is to find the maximal size clique. This<br />

problem is known to be NP-hard and so a polynomial time algorithm is thought unlikely.<br />

We can ask the corresponding question about random graphs. For example, in G ( )<br />

n, 1 2<br />

there almost surely is a clique <strong>of</strong> size (2 − ε) log n for any ε > 0. But it is not known how<br />

to find one in polynomial time.<br />

1. Show that in G(n, 1 2 ), there almost surely are no cliques <strong>of</strong> size ≥ 2 log 2 n.<br />

2. Use the second moment method to show that in G(n, 1 ), almost surely there are<br />

2<br />

cliques <strong>of</strong> size (2 − ε) log 2 n.<br />

3. Show that for any ε > 0, a clique <strong>of</strong> size (2 − ε) log n can be found in G ( n, 1 2)<br />

in<br />

time n O(ln n) if one exists.<br />

4. Give an O (n 2 ) algorithm that finds a clique <strong>of</strong> size Ω (log n) in G(n, 1 ) with high<br />

2<br />

probability. Hint: use a greedy algorithm. Apply your algorithm to G ( 1000, 2) 1 .<br />

What size clique do you find?<br />

5. An independent set in a graph is a set <strong>of</strong> vertices such that no two <strong>of</strong> them are<br />

connected by an edge. Give a polynomial time algorithm for finding an independent<br />

set in G ( n, 1 2)<br />

<strong>of</strong> size Ω (log n) with high probability.<br />

Exercise 4.17 Suppose H is a fixed graph on cn vertices with 1 4 c2 (log n) 2 edges. Show<br />

that if c ≥ 2, with high probability, H does not occur as a subgraph <strong>of</strong> G(n, 1/4).<br />

Exercise 4.18 Given two instances, G 1 and G 2 <strong>of</strong> G(n, 1 ), what is the largest vertexinduced<br />

subgraph common to both G 1 and G 2<br />

2<br />

?<br />

132

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!