algorithms

Recommendations

Info

There is nothing inherently wrong with these two functions. If our 250 IP addresses were uniformly drawn from among all N = 2 32 possibilities, then these functions would behave well. The problem is we have no guarantee that the distribution of IP addresses is uniform. Conversely, there is no single hash function, no matter how sophisticated, that behaves well on all sets of data. Since a hash function maps 2 32 IP addresses to just 250 names, there must be a collection of at least 2 32 /250 ≈ 2 24 ≈ 16,000,000 IP addresses that are assigned the same name (or, in hashing terminology, “collide”). If many of these show up in our customer set, we’re in trouble. Obviously, we need some kind of randomization. Here’s an idea: let us pick a hash function at random from some class of functions. We will then show that, no matter what set of 250 IP addresses we actually care about, most choices of the hash function will give very few collisions among these addresses. To this end, we need to define a class of hash functions from which we can pick at random; and this is where we turn to number theory. Let us take the number of buckets to be not 250 but n = 257—a prime number! And we consider every IP address x as a quadruple x = (x 1 , . . . , x 4 ) of integers modulo n—recall that it is in fact a quadruple of integers between 0 and 255, so there is no harm in this. We can define a function h from IP addresses to a number mod n as follows: fix any four numbers mod n = 257, say 87, 23, 125, and 4. Now map the IP address (x 1 , ..., x 4 ) to h(x 1 , ..., x 4 ) = (87x 1 + 23x 2 + 125x 3 + 4x 4 ) mod 257. Indeed, any four numbers mod n define a hash function. For any four coefficients a 1 , . . . , a 4 ∈ {0, 1, . . . , n − 1}, write a = (a 1 , a 2 , a 3 , a 4 ) and define h a to be the following hash function: h a (x 1 , . . . , x 4 ) = 4∑ a i · x i mod n. We will show that if we pick these coefficients a at random, then h a is very likely to be good in the following sense. Property Consider any pair of distinct IP addresses x = (x 1 , . . . , x 4 ) and y = (y 1 , . . . , y 4 ). If the coefficients a = (a 1 , a 2 , a 3 , a 4 ) are chosen uniformly at random from {0, 1, . . . , n − 1}, then i=1 Pr {h a (x 1 , . . . , x 4 ) = h a (y 1 , . . . , y 4 )} = 1 n . In other words, the chance that x and y collide under h a is the same as it would be if each were assigned nicknames randomly and independently. This condition guarantees that the expected lookup time for any item is small. Here’s why. If we wish to look up x in our hash table, the time required is dominated by the size of its bucket, that is, the number of items that are assigned the same name as x. But there are only 250 items in the hash table, and the probability that any one item gets the same name as x is 1/n = 1/257. Therefore the expected number of items that are assigned the same name as x by a randomly chosen hash function h a is 250/257 ≈ 1, which means the expected size of x’s bucket is less than 2. 1 1 When a hash function h a is chosen at random, let the random variable Y i (for i = 1, . . . , 250) be 1 if item i gets the same name as x and 0 otherwise. So the expected value of Y i is 1/n. Now, Y = Y 1 + Y 2 + · · · + Y 250 is the number of items which get the same name as x, and by linearity of expectation, the expected value of Y is simply the sum of the expected values of Y 1 through Y 250. It is thus 250/n = 250/257. 44
Let us now prove the preceding property. Proof. Since x = (x 1 , . . . , x 4 ) and y = (y 1 , . . . , y 4 ) are distinct, these quadruples must differ in some component; without loss of generality let us assume that x 4 ≠ y 4 . We wish to compute the probability Pr[h a (x 1 , . . . , x 4 ) = h a (y 1 , . . . , y 4 )], that is, the probability that ∑ 4 ∑ i=1 a i · x i ≡ 4 i=1 a i · y i mod n. This last equation can be rewritten as 3∑ a i · (x i − y i ) ≡ a 4 · (y 4 − x 4 ) mod n (1) i=1 Suppose that we draw a random hash function h a by picking a = (a 1 , a 2 , a 3 , a 4 ) at random. We start by drawing a 1 , a 2 , and a 3 , and then we pause and think: What is the probability that the last drawn number a 4 is such that equation (1) holds? So far the left-hand side of equation (1) evaluates to some number, call it c. And since n is prime and x 4 ≠ y 4 , (y 4 − x 4 ) has a unique inverse modulo n. Thus for equation (1) to hold, the last number a 4 must be precisely c · (y 4 − x 4 ) −1 mod n, out of its n possible values. The probability of this happening is 1/n, and the proof is complete. Let us step back and see what we just achieved. Since we have no control over the set of data items, we decided instead to select a hash function h uniformly at random from among a family H of hash functions. In our example, H = {h a : a ∈ {0, . . . , n − 1} 4 }. To draw a hash function uniformly at random from this family, we just draw four numbers a 1 , . . . , a 4 modulo n. (Incidentally, notice that the two simple hash functions we considered earlier, namely, taking the last or the first 8-bit segment, belong to this class. They are h (0,0,0,1) and h (1,0,0,0) , respectively.) And we insisted that the family have the following property: For any two distinct data items x and y, exactly |H|/n of all the hash functions in H map x and y to the same bucket, where n is the number of buckets. A family of hash functions with this property is called universal. In other words, for any two data items, the probability these items collide is 1/n if the hash function is randomly drawn from a universal family. This is also the collision probability if we map x and y to buckets uniformly at random—in some sense the gold standard of hashing. We then showed that this property implies that hash table operations have good performance in expectation. This idea, motivated as it was by the hypothetical IP address application, can of course be applied more generally. Start by choosing the table size n to be some prime number that is a little larger than the number of items expected in the table (there is usually a prime number close to any number we start with; actually, to ensure that hash table operations have good performance, it is better to have the size of the hash table be about twice as large as the number of items). Next assume that the size of the domain of all data items is N = n k , a power of n (if we need to overestimate the true number of data items, so be it). Then each data item can be considered as a k-tuple of integers modulo n, and H = {h a : a ∈ {0, . . . , n − 1} k } is a universal family of hash functions. 45
Page 1: Algorithms Copyright c○2006 S. Da
Page 4 and 5: 4 Paths in graphs 109 4.1 Distances
Page 7 and 8: List of boxes Bases and logs . . .
Page 9 and 10: Preface This book evolved over the
Page 11 and 12: Chapter 0 Prologue Look around you.
Page 13 and 14: The first question is moot here, as
Page 15 and 16: for addition, which works on one di
Page 17 and 18: 4. Likewise, any polynomial dominat
Page 19: and in general ( Fn ) = F n+1 ( ) n
Page 22 and 23: Bases and logs Naturally, there is
Page 24 and 25: Figure 1.1 Multiplication à la Fra
Page 26 and 27: Figure 1.3 Addition modulo 8. 6 0 0
Page 28 and 29: Figure 1.4 Modular exponentiation.
Page 30 and 31: Figure 1.6 A simple extension of Eu
Page 32 and 33: We say x is the multiplicative inve
Page 34 and 35: Figure 1.7 An algorithm for testing
Page 36 and 37: Figure 1.8 An algorithm for testing
Page 38 and 39: Randomized algorithms: a virtual ch
Page 40 and 41: Alice’s encryption function is th
Page 42 and 43: Figure 1.9 RSA. Bob chooses his pub
Page 46 and 47: Exercises 1.1. Show that in any bas
Page 48 and 49: the depth of the circuit or the lon
Page 50 and 51: (c) Give an example of positive int
Page 52 and 53: x = x L x R = 2 n/2 x L + x R y = y
Page 54 and 55: Figure 2.2 Divide-and-conquer integ
Page 56 and 57: Binary search The ultimate divide-a
Page 59 and 60: An n log n lower bound for sorting
Page 61 and 62: The three sublists S L , S v , and
Page 63 and 64: to be the best running time possibl
Page 66 and 67: qp t AB CD EF GH IJ KL MN OP QR Why
Page 68 and 69: The equivalence of the two polynomi
Page 70 and 71: Figure 2.6 The complex roots of uni
Page 72 and 73: A matrix reformulation To get a cle
Page 74 and 75: The orthogonality property can be s
Page 76 and 77: FFT n (input: a 0 , . . . , a n−1
Page 78 and 79: The slow spread of a fast algorithm
Page 80 and 81: (e) T (n) = 8T (n/2) + n 3 (f) T (n
Page 82 and 83: 2.21. Mean and median. One of the m
Page 84 and 85: (c) In fact, squaring matrices is n
Page 87 and 88: Chapter 3 Decompositions of graphs
Page 89 and 90: Therefore, each edge appears in exa
Page 91 and 92: Figure 3.3. 1 The previsit and post
Page 93 and 94: Figure 3.6 (a) A 12-node graph. (b)
Page 95 and 96:
Tree edges are actually part of the
Page 97 and 98:
the same thing. Since a dag is line
Page 99 and 100:
Figure 3.10 The reverse of the grap
Page 101 and 102:
Exercises 3.1. Perform a depth-firs
Page 103 and 104:
(a) Model this as a graph problem:
Page 105 and 106:
For instance, in the graph below (w
Page 107 and 108:
3.30. On page 97, we defined the bi
Page 109 and 110:
Chapter 4 Paths in graphs 4.1 Dista
Page 111 and 112:
Figure 4.4 The result of breadth-fi
Page 113 and 114:
Figure 4.6 Breaking edges into unit
Page 115 and 116:
into account. This viewpoint gives
Page 117 and 118:
§¦ ¥¤ £¢ ©¨ #"
Page 119 and 120:
Which heap is best? The running tim
Page 121 and 122:
Figure 4.11 (a) A binary heap with
Page 123 and 124:
Figure 4.13 The Bellman-Ford algori
Page 125 and 126:
Figure 4.15 A single-source shortes
Page 127 and 128:
Input: Undirected graph G = (V, E)
Page 129 and 130:
4.16. Section 4.5.2 describes a way
Page 131 and 132:
Let r ∗ be the maximum ratio achi
Page 133 and 134:
Chapter 5 Greedy algorithms A game
Page 135 and 136:
Trees A tree is an undirected graph
Page 137 and 138:
Figure 5.3 The cut property at work
Page 139 and 140:
eturn x As can be expected, makeset
Page 141 and 142:
Figure 5.7 The effect of path compr
Page 143 and 144:
A randomized algorithm for minimum
Page 145 and 146:
Figure 5.9 Top: Prim’s minimum sp
Page 147 and 148:
Figure 5.10 A prefix-free encoding.
Page 149 and 150:
149
Page 151 and 152:
5.3 Horn formulas In order to displ
Page 153 and 154:
Figure 5.11 (a) Eleven towns. (b) T
Page 155 and 156:
Exercises 5.1. Consider the followi
Page 157 and 158:
(a) Code: {0, 10, 11} (b) Code: {0,
Page 159 and 160:
Input: Undirected graph G = (V, E);
Page 161 and 162:
Chapter 6 Dynamic programming In th
Page 163 and 164:
Figure 6.2 The dag of increasing su
Page 165 and 166:
Programming? The origin of the term
Page 167 and 168:
Figure 6.4 (a) The table of subprob
Page 169 and 170:
Common subproblems Finding the righ
Page 171 and 172:
6.4 Knapsack During a robbery, a bu
Page 173 and 174:
Memoization In dynamic programming,
Page 175 and 176:
Figure 6.7 (a) ((A × B) × C) × D
Page 177 and 178:
ecause it leads right back to the O
Page 179 and 180:
d ij . We must pick the best such i
Page 181 and 182:
Figure 6.11 I(u) is the size of the
Page 183 and 184:
6.8. Given two strings x = x 1 x 2
Page 185 and 186:
Figure 6.12 Two binary search trees
Page 187 and 188:
6.26. Sequence alignment. When a ne
Page 189 and 190:
Chapter 7 Linear programming and re
Page 191 and 192:
It is a general rule of linear prog
Page 193 and 194:
the feasible region. The point of f
Page 195 and 196:
Figure 7.3 A communications network
Page 197 and 198:
Reductions Sometimes a computationa
Page 199 and 200:
Matrix-vector notation A linear fun
Page 201 and 202:
Figure 7.5 An illustration of the m
Page 203 and 204:
L e R s t e ′ We claim that size(
Page 205 and 206:
Figure 7.6 Continued Current Flow R
Page 207 and 208:
from the algorithm, which in such c
Page 209 and 210:
Figure 7.10 A generic primal LP in
Page 211 and 212:
Now suppose the two of them play th
Page 213 and 214:
In LP form, this is min w −3y 1 +
Page 215 and 216:
7.6.2 The algorithm On each iterati
Page 217 and 218:
Figure 7.13 Simplex in action. Init
Page 219 and 220:
close to one another? Unboundedness
Page 221 and 222:
Gaussian elimination Under our alge
Page 223 and 224:
output AND NOT OR AND OR NOT true f
Page 225 and 226:
Exercises 7.1. Consider the followi
Page 227 and 228:
7.12. For the linear program max x
Page 229 and 230:
• f (i) is a valid flow from s i
Page 231 and 232:
7.28. A linear program for shortest
Page 233 and 234:
Chapter 8 NP-complete problems 8.1
Page 235 and 236:
Satisfiability SATISFIABILITY, or S
Page 237 and 238:
cost as the budget. Fine, but how d
Page 239 and 240:
Figure 8.3 What is the smallest cut
Page 241 and 242:
Figure 8.4 A more elaborate matchma
Page 243 and 244:
8.2 NP-complete problems Hard probl
Page 245 and 246:
I. These two translation procedures
Page 247 and 248:
Figure 8.7 Reductions between searc
Page 249 and 250:
Figure 8.8 The graph corresponding
Page 251 and 252:
Figure 8.9 S is a vertex cover if a
Page 253 and 254:
2n − m pets will be left unmatche
Page 255 and 256:
Figure 8.10 Rudrata cycle with pair
Page 257 and 258:
est of the graph as shown in Figure
Page 259 and 260:
RUDRATA CYCLE−→TSP Given a grap
Page 261 and 262:
Now that we know CIRCUIT SAT reduce
Page 263 and 264:
Exercises 8.1. Optimization versus
Page 265 and 266:
Input: An undirected graph G = (V,
Page 267 and 268:
• (w i , w i ′ ) for all i = 1,
Page 269 and 270:
Chapter 9 Coping with NP-completene
Page 271 and 272:
anching. We can expand either of th
Page 273 and 274:
follow the same pattern. As before,
Page 275 and 276:
9.2 Approximation algorithms In an
Page 277 and 278:
2. It can be used to build a vertex
Page 279 and 280:
9.2.3 TSP The triangle inequality p
Page 281 and 282:
Let’s consider the O(nV ) algorit
Page 283 and 284:
iterations will be needed: whether
Page 285 and 286:
Figure 9.8 Local search. Figure 9.7
Page 287 and 288:
What can be done about such subopti
Page 289 and 290:
Figure 9.10 The search space for a
Page 291 and 292:
Exercises 9.1. In the backtracking
Page 293 and 294:
start with any S ⊆ V while there
Page 295 and 296:
Chapter 10 Quantum algorithms This
Page 297 and 298:
Figure 10.2 Measurement of a superp
Page 299 and 300:
Figure 10.3 A quantum algorithm tak
Page 301 and 302:
¡ £¢ ¥¤ §¦ ©¨ #" %
Page 303 and 304:
The Fourier transform of a periodic
Page 305 and 306:
Yet another basic gate, the control
Page 307 and 308:
10.6 Factoring as periodicity We ha
Page 309 and 310:
Figure 10.6 Quantum factoring. 0 QF
Page 311 and 312:
Exercises 10.1. ∣ ψ 〉 = 1 √2
Page 313 and 314:
Historical notes and further readin
Page 315 and 316:
Index O(·), 13 Ω(·), 14 Θ(·),
Page 317 and 318:
interior-point method, 217 interpol
show all

algorithms

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?