Algorithm Design

More documents

Recommendations

Info

750 Chapter 13 Randomized MgofithIns then uses O(r) arithmetic operations on these smaller numbers to compute the hash-function value. $o computing the hash value of a single point involves O(log N/log n) multiplications, on numbers of size log n. This is a total of O(n log N/log n) arithmetic operations over the course of the algorithm, more than the O(n) we were hoping for. In fact, it is possible to decrease the number of arithmetic operations to O(n) by using a more sophisticated class of hash functions. There are other classes of universal hash functions where computing the hash-function value can be done by oniy 0(1) arithmetic operations (though these operations will have to be done on larger numbers, integers of size roughly log N). This class of improved hash functions also comes with one extra difficulty for this application: the hashing scheme needs a prime that is bigger than the size of the universe (rather than just the size of the set of points). Now the universe in this application grows inversely with the minimum distance 3, and so, in particular, it increases every time we discover a new, smaller minimum distance. At such points, we will have to find a new prime and set up a new hash table. Mthough we will not go into the details of this here, it is possible to deal with these difficulties and make the algorithm achieve an expected running time of O(n). 13.8 Randomized Caching We now discuss the use of randomization for the caching problem, which we first encountered in Chapter 4. We begin by developing a class of algorithms, the marking algorithms, that include both deterministic and randomized approaches. After deriving a general performance guarantee that applies to all marking algorithms, we show how a stronger guaraniee can be obtained for a particular marking algorithm that exploits randomization. /.~ The Problem We begin by recalling the Cache Maintenance Problem from Chapter 4. In the m~st basic setup, we consider a processor whose full memory has n addresses; it is also equipped with a cache containing k slots of memory that can be accessed very quickly. We can keep copies of k items from the full memory in the cache slots, and when a memory location is accessed, the processor will first check the cache to see if it can be quickly retrieved. We say the request is a cache hit if the cache contains the requested item; in this case, the access is very qnick. We say the request is a cache miss if the requested item is not in the cache; in this case, the access takes much longer, and moreover, one of the items currently in the cache must be evicted to make room for the new item. (We will assume that the cache is kept full at all times.) 13.8 Randomized Caching The goal of a Cache Maintenance Algorithm is to minimize the number of cache misses, which are the truly expensive part of the process. The sequence of memory references is not under the control of the algorithm--this is simply dictated by the application that is running--and so the job of the algorithms we consider is simply to decide on an eviction policy: Which item currently in the cache should be evicted on each cache miss? In Chapter 4, we saw a greedy algorithm that is optimal for the problem: Always evict the item that will be needed the farthest in the future. While this algorithm is useful to have as an absolute benchmark on caching performance, it clearly cannot be implemented under real operating conditions, since we don’t know ahead of time when each item will be needed next. Rather, we need to think about eviction policies that operate online, using only information about past requests without knowledge of the future. The eviction policy that is typically used in practice is to evict the item that was used the least recently (i.e., whose most recent access was the longest ago in the past); this is referred to as the Least-Recently-Used, or LRU, policy. The empirical justification for LRU is that algorithms tend to have a certain locality in accessing data, generally using the same set of data frequently for a while. If a data item has not been accessed for a long time, this is a sign that it may not be accessed again for a long time. Here we will evaluate the performance of different eviction policies without making any assumptions (such as locality) on the sequence of requests. To do this, we will compare the number of misses made by an eviction policy on a sequence a with the minimum number of misses it is possible to make on a. We will use f(a) to denote this latter quantity; it is the number of misses achieved by the optimal Farthest-in-Future policy. Comparing eviction policies to the optimum is very much in the spirit of providing performance guarantees for approximation algorithms, as we did in Chapter 11. Note, however, the following interesting difference: the reason the optimum was not attainable in our approximation analyses from that chapter (assuming ~P 7~ N:P) is that the algorithms were constrained to run in polynomial time; here, on the other hand, the eviction policies are constrained in their pursuit of the optimum by the fact that they do not know the requests that are coming in the future. For eviction policies operating under this online constraint, it initially seems hopeless to say something interesting about their performance: Why couldn’t we just design a request sequence that completely confounds any online eviction policy? The surprising point here is that it is in fact possible to give absolute guarantees on the performance of various online policies relative to the optimum. 751
752 Chapter 13 Randomized Algorithms We first show that the number of misses incurred by LRU, on any request sequence, can be bounded by roughly k times the optimum. We then use randomization to develop a variation on LRU that has an exponentially stronger bound on its performance: Its number of misses is never more than O(log k) times the optimum. ~ Designing the Class of Marking Algorithms The bounds for both LRU and its randomized variant will follow from a general template for designing online eviction pollcies--a class of policies called marking algorithms. They are motivated by the following intuition. To do well against the benchmark of f(a), we need an eviction policy that is sensitive to the difference between the following two possibilities: (a) in the recent past, the request sequence has contained more than k distinct items; or (b) in the recent past, the request sequence has come exclusively from a set of at most k items. In the first case, we know that [(a) must be increasing, since no algorithm can handle more than k distinct items without incurring a cache miss. But, in the second case, it’s possible that a is pa~sing through a long stretch in which an optimal algorithm need not incur any misses at all. It is here that our policy must make sure that it incurs very few misses. Guided by these considerations, we now describe the basic outline of a marking algorithm, which prefers evicting items that don’t seem to have been used in a long time. Such an algorithm operates in phases; the description of one phase is as follows. Each memory item can be either marke_____~d or unmarked At the beginning of the phase, all items are unmarked On a request to item s: Mark s If s is in the cache, then evict nothing Else s is not in the cache: If all items currently in the cache are marked then Endif Declare the phase over Processing of s is deferred to start of next phase Else evict an unmarked item from the cache Endif Note that this describes a class of algorithms, rather than a single specific algorithm, because the key step--evict an unmarked item from the 13.8 Randomized Caching cache--does not specify which unmarked item should be selected. We will see that eviction policies with different properties and performance guarantees arise depending on how we resolve this ambiguity. We first observe that, since a phase starts with all items unmarked, and items become marked only when accessed, the unmarked items have all been accessed less recently than the marked items. This is the sense in which a marking algorithm is trying to evict items that have not been requested recentfy. Also, at a.ny point in a phase, if there are any unmarked items in the cache, then the least recently used item must be unmarked. It follows that the LRU policy evicts an unmarked item whenever one is available, and so we have the following fact. (13.34) The LRU policy is a marking algorithm. /;~ Analyzing Marking Algorithms We now describe a method for analyzing marking algorithms, ending with a bound on performance that applies to al! marking algorithms. After this, when we add randomization, we will need to strengthen this analysis. Consider an arbitrary marking algorithm operating on a request sequence a. For the analysis, we picture an optimal caching algorithm operating on a alongside this marking algorithm, incurring an overall cost of f(~r). Suppose that there are r phases in this sequence a, as defined by the marking algorithm. To make the analysis easier to discuss, we are going to "pad" the sequence ~r both at the beginning and the end with some extra requests; these will not add any extra misses to the optimal algorithm--that is, they will not cause f(~r) to increase--and so any bound we show on the performance of the marking algorithm relative to the optimum for this padded sequence will also apply to a. Specifically, we imagine a "phase 0" that takes place before the first phase, in which all the items initially in the cache are requested once. This does not affect the cost of either the marking algorithm or the optimal algorithm. We also imagine that the final phase r ends with an epilogue in which every item currently in the cache of the optimal algorithm is requested twice in roundrobin fashion. This does not increase f(a); and by the end of the second pass through these items, the marking algorithm will contain each of them in its cache, and each will be marked. For the performance bound, we need two things: an upper bound on the number of misses incurred by the marking algorithm, and a lower bound saying that the optimum must incur at least a certain number of misses. The division of the request sequence cr into phases turns out to be the key to doing this. First of al!, here is how we can picture the history of a 753
Page 2 and 3:
Cornell University Boston San Franc
Page 4 and 5:
Contents About the Authors Preface
Page 6 and 7:
X Contents 9 10 Extending the Limit
Page 8 and 9:
xiv Preface problems out in the wor
Page 10 and 11:
Preface Chapters 2 and 3 also prese
Page 12 and 13:
xxii Preface Preface xxiii Sue Whit
Page 14 and 15:
2 Chapter 1 Introduction: Some Repr
Page 16 and 17:
6 Chapter 1 Introduction: Some Repr
Page 18 and 19:
10 Chapter 1 Introduction: Some Rep
Page 20 and 21:
14 Chapter ! Introduction: Some Rep
Page 22 and 23:
Page 24 and 25:
Page 26 and 27:
Page 28 and 29:
30 Chapter 2 Basics of Algorithm An
Page 30 and 31:
34 n= I0 n=30 n=50 = 100 = 1,000 10
Page 32 and 33:
Page 34 and 35:
Page 36 and 37:
Page 38 and 39:
Page 40 and 41:
Page 42 and 43:
Page 44 and 45:
Page 46 and 47:
Page 48 and 49:
Page 50 and 51:
74 Chapter 3 Graphs 3.1 Basic Defin
Page 52 and 53:
78 Chapter 3 Graphs Rooted trees ar
Page 54 and 55:
82 Chapter 3 Graphs Current compone
Page 56 and 57:
86 Chapter 3 Graphs Proof. Suppose
Page 58 and 59:
90 Chapter 3 Graphs Two of the simp
Page 60 and 61:
94 Chapter 3 Graphs most ~u nv = 2m
Page 62 and 63:
98 Chapter 3 Graphs It is important
Page 64 and 65:
102 Chapter 3 Graphs ordering, that
Page 66 and 67:
106 Chapter 3 Graphs We can already
Page 68 and 69:
110 Chapter 3 Graphs Exercises 111
Page 70 and 71:
Greedy A~gorithms In Wall Street, t
Page 72 and 73:
118 Chapter 4 Greedy Mgofithms o In
Page 74 and 75:
122 Chapter 4 Greedy Algorithms Our
Page 76 and 77:
126 Chapter 4 Greedy Algorithms Len
Page 78 and 79:
Before swapping: After swapping: d
Page 80 and 81:
134 Chapter 4 Greedy Algorithms wit
Page 82 and 83:
138 Chapter 4 Greedy Algorithms 4.4
Page 84 and 85:
142 Chapter 4 Greedy Algorithms 4.5
Page 86 and 87:
146 Chapter 4 Greedy Algorithms (e
Page 88 and 89:
150 Chapter 4 Greedy Algorithms her
Page 90 and 91:
154 Chapter 4 Greedy Algorithms we
Page 92 and 93:
158 Chapter 4 Greedy Algorithms spa
Page 94 and 95:
162 Chapter 4 Greedy Algorithms ~ T
Page 96 and 97:
166 Chapter 4 Greedy Algorithms g2(
Page 98 and 99:
170 Chapter 4 Greedy Algorithms Sha
Page 100 and 101:
174 Chapter 4 Greedy Algorithms ...
Page 102 and 103:
178 Chapter 4 Greedy Algorithms Fig
Page 104 and 105:
182 Chapter 4 Greedy Algorithms The
Page 106:
186 Chapter 4 Greedy Algorithms Sol
Page 109 and 110:
192 Chapter 4 Greedy Algorithms 8.
Page 111 and 112:
Page 113 and 114:
Page 115 and 116:
204 Chapter 4 Greedy Algorithms Not
Page 117 and 118:
Chapter Divide artd Cortquer Divide
Page 119 and 120:
212 Chapter 5 Divide and Conquer Th
Page 121 and 122:
216 cn time plus recursive calls Ch
Page 123 and 124:
220 Chapter 5 Divide and Conquer mo
Page 125 and 126:
224 Chapter 5 Divide and Conquer El
Page 127 and 128:
228 Chapter 5 Divide and Conquer 5.
Page 129 and 130:
232 Chapter 5 Divide and Conquer (a
Page 131 and 132:
we’ll just give a simple example
Page 133 and 134:
240 Chapter 5 Divide and Conquer po
Page 135 and 136:
244 Chapter 5 Divide and Conquer Th
Page 137 and 138:
248 Chapter 5 Divide and Conquer It
Page 139 and 140:
252 Chapter 6 Dynamic Programming b
Page 141 and 142:
256 Chapter 6 Dynamic Programming 6
Page 143 and 144:
260 Index 1 2 3 4 5 6 L~ I = 2 Chap
Page 145 and 146:
264 Chapter 6 Dynamic Programming w
Page 147 and 148:
268 Chapter 6 Dynamic Programming t
Page 149 and 150:
272 Chapter 6 Dynamic Programming s
Page 151 and 152:
Page 153 and 154:
280 Chapter 6 Dynamic Programming t
Page 155 and 156:
284 Figure 6.18 The OPT values for
Page 157 and 158:
288 Chapter 6 Dynamic Programming U
Page 159 and 160:
292 (a) Figure 6.21 (a) With negati
Page 161 and 162:
296 Chapter 6 Dynamic Programming i
Page 163 and 164:
300 Chapter 6 Dynamic Programming p
Page 165 and 166:
304 Chapter 6 Dynamic Programming e
Page 167 and 168:
308 Chapter 6 Dynamic Programming o
Page 169 and 170:
312 Chapter 6 Dynamic Programming E
Page 171 and 172:
316 Chapter 6 Dynamic Programming T
Page 173 and 174:
320 Chapter 6 Dynamic Programming (
Page 175 and 176:
Page 177 and 178:
328 Chapter 6 Dynamic Programming T
Page 179 and 180:
Page 181 and 182:
336 Chapter 6 Dynamic Programming h
Page 183 and 184:
54O Chapter 7 Network Flow define f
Page 185 and 186:
344 Chapter 7 Network Flow This aug
Page 187 and 188:
348 Chapter 7 Network Flow Proof. ~
Page 189 and 190:
352 Chapter 7 Network Flow In the n
Page 191 and 192:
356 Chapter 7 Network Flow (7.19) T
Page 193 and 194:
360 Chapter 7 Network Flow preflow
Page 195 and 196:
364 Chapter 7 Network Flow Heights
Page 197 and 198:
368 Chapter 7 Network Flow ~ The Pr
Page 199 and 200:
372 Chapter 7 Network Flow that are
Page 201 and 202:
376 Chapter 7 Network Flow I~low ar
Page 203 and 204:
380 Chapter 7 Network Flow Figure 7
Page 205 and 206:
384 Chapter 7 Network Flow Lower bo
Page 207 and 208:
388 BOS 6 DCA 7 DCA 8 Chapter 7 Net
Page 209 and 210:
392 Chapter 7 Network Flow natural
Page 211 and 212:
396 Chapter 7 Network Flow 7.11 Pro
Page 213 and 214:
400 Chapter 7 Network Flow 7.12 Bas
Page 215 and 216:
404 Chapter 7 Network Flow to cross
Page 217 and 218:
4O8 Chapter 7 Network Flow Why are
Page 219 and 220:
412 Chapter 7 Network Flow ProoL Th
Page 221 and 222:
416 Chapter 7 Network Flow Figure 7
Page 223 and 224:
420 Chapter 7 Network Flow 11. Now
Page 225 and 226:
424 Figure 7.29 An instance of Cove
Page 227 and 228:
428 Chapter 7 Network Flow 22. This
Page 229 and 230:
432 Chapter 7 Network Flow generall
Page 231 and 232:
436 Chapter 7 Network Flow (a) (b)
Page 233 and 234:
440 Chapter 7 Network Flow going to
Page 235 and 236:
Chapter 7 Network Flow 44. 45. Give
Page 237 and 238:
448 Chapter 7 Network Flow 51. The
Page 239 and 240:
452 Chapter 8 NP and Computational
Page 241 and 242:
Page 243 and 244:
Page 245 and 246:
Page 247 and 248:
Page 249 and 250:
472 Chapter 8 NP and Computation!!
Page 251 and 252:
Page 253 and 254:
Page 255 and 256:
Page 257 and 258:
Page 259 and 260:
Page 261 and 262:
Page 263 and 264:
5OO Chapter 8 NP and Computational
Page 265 and 266:
Page 267 and 268:
Page 269 and 270:
Page 271 and 272:
Page 273 and 274:
Page 275 and 276:
Page 277 and 278:
Page 279 and 280:
532 Chapter 9 PSPACE: A Class of Pr
Page 281 and 282:
Page 283 and 284:
Page 285 and 286:
Page 287 and 288:
Page 289 and 290:
Page 291 and 292:
556 Chapter 10 Extending the Limits
Page 293 and 294:
Page 295 and 296:
Page 297 and 298:
Page 299 and 300:
Page 301 and 302:
Page 303 and 304:
58O Chapter 10 Extending the Limits
Page 305 and 306:
Page 307 and 308:
Page 309 and 310:
Page 311 and 312:
596 Figure 10.12 A triangulated cyc
Page 313 and 314:
600 Chapter 11 Approximation Algori
Page 315 and 316:
Page 317 and 318:
Page 319 and 320:
Page 321 and 322:
Page 323 and 324:
Page 325 and 326:
Page 327 and 328:
628 Chapter 11 Approximation Mgorit
Page 329 and 330:
Page 331 and 332:
636 Chapter !! Approximation Algori
Page 333 and 334:
640 Chapter 11 Approximation Mgofit
Page 335 and 336:
644 Chapter 11 Approximation Mgofit
Page 337 and 338: 648 Chapter 11 Approximation Algori
Page 339 and 340: 652 Chapter 11 Approximation Algori
Page 341 and 342: 656 Chapter 11 Approximation Mgorit
Page 343 and 344: 660 Chapter 11 Approximation Mgofit
Page 345 and 346: 664 Chapter 12 Local Search basic p
Page 347 and 348: 668 Chapter 12 Local Search moves b
Page 349 and 350: 672 Chapter 12 Local Search of all
Page 351 and 352: 676 Chapter 12 Local Search 12.4 Ma
Page 353 and 354: 680 Chapter 12 Local Search saw ear
Page 355 and 356: 684 Chapter 12 Local Search utting
Page 357 and 358: 688 Chapter 12 Local Search this di
Page 359 and 360: 692 Chapter 12 Local Search sides a
Page 361 and 362: 696 Chapter 12 Local Search Of cour
Page 363 and 364: 7OO Chapter 12 Loca! Search and for
Page 365 and 366: 7O4 Chapter 12 Local Search A previ
Page 367 and 368: 708 Chapter 13 Randomized Algorithm
Page 375 and 376: 724 Chapter 13 Randomized Mgorithms
Page 381 and 382: 736 Chapter 13 Randomized Mgofithms
Page 385 and 386: 744 Chapter 13 Randomized Mgorithms
Page 387: 748 Chapter 13 Randomized Mgorithms
Page 409 and 410: 792 Chapter !3 Randomized Algorithm
Page 411 and 412: 796 Epilogue: Algorithms That Run F
Page 413 and 414: 800 Epilogue: Algorithms That Run F
Page 415 and 416: 8O4 Epilogue: Algorithms That Run F
Page 417 and 418: 808 References L. R. Ford. Network
Page 419 and 420: 812 References M. Mitzenmacher and
Page 421 and 422: 816 Index Approximation algorithms
Page 423 and 424: 820 Cushions in packet switching, 8
Page 425 and 426: 824 Index Graphs (cont.) queues and
Page 427 and 428: 828 Index Mapping genomes, 279, 521
Page 429 and 430: 832 Index Index 833 Preflow-Push Ma
Page 431 and 432: 836 Index Stale items in randomized
show all

Algorithm Design

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?