08.10.2016 Views

Foundations of Data Science

2dLYwbK

2dLYwbK

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

5.4.1 Using Normalized Conductance to Prove Convergence<br />

We now apply Theorem 5.5 to some examples to illustrate how the normalized conductance<br />

bounds the rate <strong>of</strong> convergence. Our first examples will be simple graphs. The<br />

graphs do not have rapid converge, but their simplicity helps illustrate how to bound the<br />

normalized conductance and hence the rate <strong>of</strong> convergence.<br />

A 1-dimensional lattice<br />

Consider a random walk on an undirected graph consisting <strong>of</strong> an n-vertex path with<br />

self-loops at the both ends. With the self loops, the stationary probability is a uniform 1 n<br />

over all vertices. The set with minimum normalized conductance is the set with probability<br />

π ≤ 1 and the maximum number <strong>of</strong> vertices with the minimum number <strong>of</strong> edges leaving<br />

2<br />

it. This set consists <strong>of</strong> the first n/2 vertices, for which total conductance <strong>of</strong> edges from S<br />

to ¯S is (with m = n/2) π m p m,m+1 = Ω( 1 ) and π(S) = 1. (π n 2 m is the stationary probability<br />

<strong>of</strong> vertex numbered m.) Thus<br />

Φ(S) = 2π m p m,m+1 = Ω(1/n).<br />

By Theorem 5.5, for ε a constant such as 1/100, after O(n 2 log n) steps, ||a t −π|| 1 ≤ 1/100.<br />

This graph does not have rapid convergence. The hitting time and the cover time are<br />

O(n 2 ). In many interesting cases, the mixing time may be much smaller than the cover<br />

time. We will see such an example later.<br />

A 2-dimensional lattice<br />

Consider the n × n lattice in the plane where from each point there is a transition to<br />

each <strong>of</strong> the coordinate neighbors with probability 1 / 4 . At the boundary there are self-loops<br />

with probability 1-(number <strong>of</strong> neighbors)/4. It is easy to see that the chain is connected.<br />

Since p ij = p ji , the function f i = 1/n 2 satisfies f i p ij = f j p ji and by Lemma 5.3 is the<br />

stationary probability. Consider any subset S consisting <strong>of</strong> at most half the states. Index<br />

states by their x and y coordinates. For at least half the states in S, either row x or<br />

column y intersects ¯S (Exercise 5.4). So at least Ω(|S|/n) points in S are adjacent to<br />

points in ¯S. Each such point contributes π i p ij = Ω(1/n 2 ) to flow(S, ¯S). So<br />

∑ ∑<br />

π i p ij ≥ c|S|/n 3 .<br />

i∈S<br />

j∈ ¯S<br />

Thus, Φ ≥ Ω(1/n). By Theorem 5.5, after O(n 2 ln n/ε 2 ) steps, |a t − π| 1 ≤ 1/100.<br />

A lattice in d-dimensions<br />

Next consider the n × n × · · · × n lattice in d-dimensions with a self-loop at each<br />

boundary point with probability 1 − (number <strong>of</strong> neighbors)/2d. The self loops make all<br />

π i equal to n −d . View the lattice as an undirected graph and consider the random walk<br />

157

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!