08.10.2016 Views

Foundations of Data Science

2dLYwbK

2dLYwbK

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

or<br />

p[I − (1 − α)A] = α (1, 1, . . . , 1)<br />

n<br />

p = α n (1, 1, . . . , 1) [I − (1 − α) A]−1 .<br />

Thus, in principle, p can be found by computing the inverse <strong>of</strong> [I − (1 − α)A] −1 . But<br />

this is far from practical since for the whole web one would be dealing with matrices with<br />

billions <strong>of</strong> rows and columns. A more practical procedure is to run the random walk and<br />

observe using the basics <strong>of</strong> the power method in Chapter 3 that the process converges to<br />

the solution p.<br />

For the personalized page rank, instead <strong>of</strong> restarting at an arbitrary vertex, the walk<br />

restarts at a designated vertex. More generally, it may restart in some specified neighborhood.<br />

Suppose the restart selects a vertex using the probability distribution s. Then, in<br />

the above calculation replace the vector 1 (1, 1, . . . , 1) by the vector s. Again, the computation<br />

could be done by a random walk. But, we wish to do the random walk calculation<br />

n<br />

for personalized pagerank quickly since it is to be performed repeatedly. With more care<br />

this can be done, though we do not describe it here.<br />

5.9 Bibliographic Notes<br />

The material on the analogy between random walks on undirected graphs and electrical<br />

networks is from [DS84] as is the material on random walks in Euclidean space. Additional<br />

material on Markov chains can be found in [MR95b], [MU05], and [per10]. For<br />

material on Markov Chain Monte Carlo methods see [Jer98] and [Liu01].<br />

The use <strong>of</strong> normalized conductance to prove convergence <strong>of</strong> Markov Chains is by<br />

Sinclair and Jerrum, [SJ] and Alon [Alo86]. A polynomial time bounded Markov chain<br />

based method for estimating the volume <strong>of</strong> convex sets was developed by Dyer, Frieze and<br />

Kannan [DFK91].<br />

179

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!