Algorithm Design

More documents

Recommendations

Info

690 Chapter 12 Local Search 12.7 Best-Response Dynamics and Nash Equilibria Thus far we have been considering local search as a technique for solving - optimization problems with a single objective--in other words, applying local operations to a candidate solution so as to minimize its total cost. There are many settings, however, where a potentially large number of agents, each with its own goals and objectives, collectively interact so as to produce a solution oblem A solution that is produced under these circumstances often .... to some pr ¯ ¯ ....... n .... ~"~’ to null the s°luti°n reflects the "tug-of-war" that led to it, v~m each a~ L ~y-,o ~ in a direction that is favorable to it. We wil! see that these interactions can be viewed as a kind of local search procedure; analogues of local minima have a natural meaning as well, but having multiple agents and multiple objectives introduces new challenges. The field of game theory provides a natural framework in which to talk about what happens in such situations, when a collection of agents interacts strategically--in other words, with each trying to optimize an individual obiective function. To illustrate these issues, we consider a concrete appl$cation, motivated by the problem of routing in networks; along the way, we will introduce some notions that occupy central positions in the area of game theory more generally. J The Problem ’In a network like the Internet, one frequently encounters situations in which a number of nodes all want to establish a connection to a single source node s. For example, the source s may be generating some kind of data stream that all the given nodes want to receive, as in a style of one-to-manY network communication known as multicast. We will model this situation by representing the underlying network as a directed graph G = (V, E), with a cost ce >_ 0 on each edge. There is a designated source node s ~ V and ~ collection of k agents located at distinct terminal nodes tl, t2 ..... tk ~ V. For simplicity, we will not make a distinction between the agents and the nodes at which they reside; in other words, we will think of the agents as being tl, t2 ..... tk. Each agent tj wants to construct a path Pj from s to tl using as little total cost as possible. Now, if there were no interaction among the agents, this would consist of k separate shortest-path problems: Each agent tl would find an s-t~ path for which the total cost of all edges is minimized, and use this as its path P~. What makes this problem interesting is the prospect of agents being able to share the costs of edges. Suppose that after all the agents have chosen their paths, agent t] only needs to pay its "fair share" of the cost of each edge e on its path; that is, rather than paying ce for each e on P~, it pays ce divided by the number of 12.7 Best-Response Dynamics and Nash Equilibria agents whose paths contain e. In this way, there is an incentive for the agents to choose paths that overlap, since they can then benefit by splitting the costs of edges. (This sharing model is appropriate for settings in which the presence of multiple agents on an edge does not significantly degrade the quality of transmission due to congestion or increased latency. If latency effects do come into play, then there is a countervailing penalty for sharing; this too leads to interesting algorithmic questions, but we will stick to our current focus for now, in which sharing comes with benefits only.) Best-Response Dynamics and Nash Equilibria: Definitions and Examples To see how the option of sharing affects the behavior of the agents, let’s begin by considering the pair of very simple examples in Figure 12.8. In example (a), each of the two agents has two options for constructing a path: the middle route through v, and the outer route using a single edge. Suppose that each agent starts out with an initial path but is continually evaluating the current situation to decide whether it’s possible to switch to a better path. In example (a), suppose the two agents start out using their outer paths. Then tl sees no advantage in switching paths (since 4 < 5 + 1), but t2 does (since 8 > 5 + 1), and so t2 updates its path by moving to the middle. Once this happens, things have changed from the perspective of t~: There is suddenly an advantage for tl in switching as well, since it now gets to share the cost of the middle path, and hence its cost to use the middle path becomes 2.5 + 1 < 4. Thus it will switch to the middle path. Once we are in a situation where both Ca) (b) ~: agents Figure 12.8 (a) It is in the two agents’ interest to share the middle path. (b) It would be better for all the agents to share the edge on the left. But if all k agents start on the fight-hand edge, then no one of them will want to unilaterally move from right to left; in other words, the solution in which all agents share the edge on the right is a bad Nash equilibrium. 691
692 Chapter 12 Local Search sides are using the middle path, neither has an incentive to switch, and so this is a stable solution. Let’s discuss two definitions from the area of game theory that capture what’s going on in this simple example. While we wil! continue to focus on our particular mulficast routing problem, these definitions are relevant to any setting in which multiple agents, each with an individual objective, interact to produce a collective solution. As such, we will phrase the definitions in these general terms. o First of all, in the example, each agent was continually prepared to improve its solution in response to changes made by the other agent(s). We will refer to this process as best-response dynamics. In other words, we are interested in the dynamic behavior of a process in which each agent updates based on its best response to the current situation. o Second, we are particularly interested in stable solutions, where the best response of each agent is to stay put. We will refer to such a solution, from which no agent has an incentive to deviate, as a Nash equilibrium. (This is named after the mathematician John Nash, who won the Nobel Prize in economics for his pioneering work on this concept.) Hence, in example (a), the solution in which both agents use the middle path is a Nash equilibrium. Note that the Nash equilibria are precisely the solutions at which best-response dynamics terminate. The example in Figure 12.8(b) illustrates the possibility of multiple Nash equilibria. In this example, there are k agents that all reside at a common node t (that is, t 1 = t2 ..... t k = t), and there are two parallel edges from s to t with different costs. The solution in which all agents use the left-hand edge is a Nash equilibrium in which all agents pay (1 + ~)/k. The solution in which all agents use the fight-hand edge is also a Nash equilibrium, though here the agents each pay k/k = !. The fact that this latter solution is a Nash equilibrium exposes an important point about best-response dynamics. If the agents could somehow synchronously agree to move from the fight-hand edge to the left-hand one, they’d all be better off. But under best-response dynamics, each agent is only evaluating the consequences of a unilateral move by itself. In effect, an agent isn’t able to make any assumptions about future actions of other agents--in an Internet setting, it may not even know anything about these other agents or their current solutions--and so it is only willing to perform updates that lead to an immediate improvement for itself. To quantify the sense in which one of the Nash equilibria in Figure 12.8(13) is better than the other, it is useful to introduce one further deflation. We say that a solution is a social optimum if it minimizes the total cost to all agents. We can think of such a solution as the one that would be imposed by 12.7 Best-Response Dynamics and Nash Equilibria a benevolent central authority that viewed all agents as equally important and hence evaluated the quality of a solution by summing the costs they incurred. Note that in both (a) and (b), there is a social optimum that is also a Nash equilibrium, although in (b) there is also a second Nash equilibrium whose cost is much greater. The Relationship to Local Search Around here, the connections to local search start to come into focus. A set of agents following best-response dynamics are engaged in some kind of gradient descent process, exploring the "landscape" of possible solutions as they try tb minimize their individual costs. The Nash equilibria are the natural analogues of local minima in this process: solutions from which no improving move is possible. And the "local" nature of the search is clear as well, since agents are only updating their solutions when it leads to an immediate improvement. Having said all this, it’s important to think a bit further and notice the crucial ways in which this differs from standard local search. In the beginning of this chapter, it was easy to argue that the gradient descent algorithm for a combinatorial problem must terminate at a local minimum: each update decreased the cost of the solution, and since there were only finitely many possible solutions, the sequence of updates could not go on forever. In other words, the cost function itself provided the progress measure we needed to establish termination. In best-response dynamics, on the other hand, each agent has its own persona! objective function to minimize, and so it’s not clear what overall "progress" is being made when, for example, agent t~ decides to update its path from s. There’s progress for t~, of course, since its cost goes down, but this may be offset by an even larger increase in the cost to some other agent. Consider, for example, the network in Figure 12.9. If both agents start on the middle path, then tl will in fact have an incentive to move to the outer path; its cost drops from 3.5 to 3, but in the process the cost of t2 increases from 3.5 to 6. (Once this happens, t 2 will also move to its outer path, and this solution--with both nodes on the outer paths--is the unique Nash equilibrium.) There are examples, in fact, where the cost-increasing effects of bestresponse dynamics can be much worse than this. Consider the situation in Figure 12.10, where we have k agents that each have th~ option to take a common outer path of cost 1 + e (for some small number e > 0), or to take their own alternate path. The alternate path for tj has cost 1/]. Now suppose we start with a solution in which all agents are sharing the outer path. Each agent pays (1 + e)/k, and this is the solution that minimizes the total cost to all agents. But running best-response dynamics starting from this solution causes things to unwind rapidly. First tk switches to its alternate path, since 1/k < (1 + e)/k. 693 Figure 12.9 A network in which the unique Nash equilibrium differs from the social optimum.
Page 2 and 3:
Cornell University Boston San Franc
Page 4 and 5:
Contents About the Authors Preface
Page 6 and 7:
X Contents 9 10 Extending the Limit
Page 8 and 9:
xiv Preface problems out in the wor
Page 10 and 11:
Preface Chapters 2 and 3 also prese
Page 12 and 13:
xxii Preface Preface xxiii Sue Whit
Page 14 and 15:
2 Chapter 1 Introduction: Some Repr
Page 16 and 17:
6 Chapter 1 Introduction: Some Repr
Page 18 and 19:
10 Chapter 1 Introduction: Some Rep
Page 20 and 21:
14 Chapter ! Introduction: Some Rep
Page 22 and 23:
Page 24 and 25:
Page 26 and 27:
Page 28 and 29:
30 Chapter 2 Basics of Algorithm An
Page 30 and 31:
34 n= I0 n=30 n=50 = 100 = 1,000 10
Page 32 and 33:
Page 34 and 35:
Page 36 and 37:
Page 38 and 39:
Page 40 and 41:
Page 42 and 43:
Page 44 and 45:
Page 46 and 47:
Page 48 and 49:
Page 50 and 51:
74 Chapter 3 Graphs 3.1 Basic Defin
Page 52 and 53:
78 Chapter 3 Graphs Rooted trees ar
Page 54 and 55:
82 Chapter 3 Graphs Current compone
Page 56 and 57:
86 Chapter 3 Graphs Proof. Suppose
Page 58 and 59:
90 Chapter 3 Graphs Two of the simp
Page 60 and 61:
94 Chapter 3 Graphs most ~u nv = 2m
Page 62 and 63:
98 Chapter 3 Graphs It is important
Page 64 and 65:
102 Chapter 3 Graphs ordering, that
Page 66 and 67:
106 Chapter 3 Graphs We can already
Page 68 and 69:
110 Chapter 3 Graphs Exercises 111
Page 70 and 71:
Greedy A~gorithms In Wall Street, t
Page 72 and 73:
118 Chapter 4 Greedy Mgofithms o In
Page 74 and 75:
122 Chapter 4 Greedy Algorithms Our
Page 76 and 77:
126 Chapter 4 Greedy Algorithms Len
Page 78 and 79:
Before swapping: After swapping: d
Page 80 and 81:
134 Chapter 4 Greedy Algorithms wit
Page 82 and 83:
138 Chapter 4 Greedy Algorithms 4.4
Page 84 and 85:
142 Chapter 4 Greedy Algorithms 4.5
Page 86 and 87:
146 Chapter 4 Greedy Algorithms (e
Page 88 and 89:
150 Chapter 4 Greedy Algorithms her
Page 90 and 91:
154 Chapter 4 Greedy Algorithms we
Page 92 and 93:
158 Chapter 4 Greedy Algorithms spa
Page 94 and 95:
162 Chapter 4 Greedy Algorithms ~ T
Page 96 and 97:
166 Chapter 4 Greedy Algorithms g2(
Page 98 and 99:
170 Chapter 4 Greedy Algorithms Sha
Page 100 and 101:
174 Chapter 4 Greedy Algorithms ...
Page 102 and 103:
178 Chapter 4 Greedy Algorithms Fig
Page 104 and 105:
182 Chapter 4 Greedy Algorithms The
Page 106:
186 Chapter 4 Greedy Algorithms Sol
Page 109 and 110:
192 Chapter 4 Greedy Algorithms 8.
Page 111 and 112:
Page 113 and 114:
Page 115 and 116:
204 Chapter 4 Greedy Algorithms Not
Page 117 and 118:
Chapter Divide artd Cortquer Divide
Page 119 and 120:
212 Chapter 5 Divide and Conquer Th
Page 121 and 122:
216 cn time plus recursive calls Ch
Page 123 and 124:
220 Chapter 5 Divide and Conquer mo
Page 125 and 126:
224 Chapter 5 Divide and Conquer El
Page 127 and 128:
228 Chapter 5 Divide and Conquer 5.
Page 129 and 130:
232 Chapter 5 Divide and Conquer (a
Page 131 and 132:
we’ll just give a simple example
Page 133 and 134:
240 Chapter 5 Divide and Conquer po
Page 135 and 136:
244 Chapter 5 Divide and Conquer Th
Page 137 and 138:
248 Chapter 5 Divide and Conquer It
Page 139 and 140:
252 Chapter 6 Dynamic Programming b
Page 141 and 142:
256 Chapter 6 Dynamic Programming 6
Page 143 and 144:
260 Index 1 2 3 4 5 6 L~ I = 2 Chap
Page 145 and 146:
264 Chapter 6 Dynamic Programming w
Page 147 and 148:
268 Chapter 6 Dynamic Programming t
Page 149 and 150:
272 Chapter 6 Dynamic Programming s
Page 151 and 152:
Page 153 and 154:
280 Chapter 6 Dynamic Programming t
Page 155 and 156:
284 Figure 6.18 The OPT values for
Page 157 and 158:
288 Chapter 6 Dynamic Programming U
Page 159 and 160:
292 (a) Figure 6.21 (a) With negati
Page 161 and 162:
296 Chapter 6 Dynamic Programming i
Page 163 and 164:
300 Chapter 6 Dynamic Programming p
Page 165 and 166:
304 Chapter 6 Dynamic Programming e
Page 167 and 168:
308 Chapter 6 Dynamic Programming o
Page 169 and 170:
312 Chapter 6 Dynamic Programming E
Page 171 and 172:
316 Chapter 6 Dynamic Programming T
Page 173 and 174:
320 Chapter 6 Dynamic Programming (
Page 175 and 176:
Page 177 and 178:
328 Chapter 6 Dynamic Programming T
Page 179 and 180:
Page 181 and 182:
336 Chapter 6 Dynamic Programming h
Page 183 and 184:
54O Chapter 7 Network Flow define f
Page 185 and 186:
344 Chapter 7 Network Flow This aug
Page 187 and 188:
348 Chapter 7 Network Flow Proof. ~
Page 189 and 190:
352 Chapter 7 Network Flow In the n
Page 191 and 192:
356 Chapter 7 Network Flow (7.19) T
Page 193 and 194:
360 Chapter 7 Network Flow preflow
Page 195 and 196:
364 Chapter 7 Network Flow Heights
Page 197 and 198:
368 Chapter 7 Network Flow ~ The Pr
Page 199 and 200:
372 Chapter 7 Network Flow that are
Page 201 and 202:
376 Chapter 7 Network Flow I~low ar
Page 203 and 204:
380 Chapter 7 Network Flow Figure 7
Page 205 and 206:
384 Chapter 7 Network Flow Lower bo
Page 207 and 208:
388 BOS 6 DCA 7 DCA 8 Chapter 7 Net
Page 209 and 210:
392 Chapter 7 Network Flow natural
Page 211 and 212:
396 Chapter 7 Network Flow 7.11 Pro
Page 213 and 214:
400 Chapter 7 Network Flow 7.12 Bas
Page 215 and 216:
404 Chapter 7 Network Flow to cross
Page 217 and 218:
4O8 Chapter 7 Network Flow Why are
Page 219 and 220:
412 Chapter 7 Network Flow ProoL Th
Page 221 and 222:
416 Chapter 7 Network Flow Figure 7
Page 223 and 224:
420 Chapter 7 Network Flow 11. Now
Page 225 and 226:
424 Figure 7.29 An instance of Cove
Page 227 and 228:
428 Chapter 7 Network Flow 22. This
Page 229 and 230:
432 Chapter 7 Network Flow generall
Page 231 and 232:
436 Chapter 7 Network Flow (a) (b)
Page 233 and 234:
440 Chapter 7 Network Flow going to
Page 235 and 236:
Chapter 7 Network Flow 44. 45. Give
Page 237 and 238:
448 Chapter 7 Network Flow 51. The
Page 239 and 240:
452 Chapter 8 NP and Computational
Page 241 and 242:
Page 243 and 244:
Page 245 and 246:
Page 247 and 248:
Page 249 and 250:
472 Chapter 8 NP and Computation!!
Page 251 and 252:
Page 253 and 254:
Page 255 and 256:
Page 257 and 258:
Page 259 and 260:
Page 261 and 262:
Page 263 and 264:
5OO Chapter 8 NP and Computational
Page 265 and 266:
Page 267 and 268:
Page 269 and 270:
Page 271 and 272:
Page 273 and 274:
Page 275 and 276:
Page 277 and 278:
Page 279 and 280:
532 Chapter 9 PSPACE: A Class of Pr
Page 281 and 282:
Page 283 and 284:
Page 285 and 286:
Page 287 and 288:
Page 289 and 290:
Page 291 and 292:
556 Chapter 10 Extending the Limits
Page 293 and 294:
Page 295 and 296:
Page 297 and 298:
Page 299 and 300:
Page 301 and 302:
Page 303 and 304:
58O Chapter 10 Extending the Limits
Page 305 and 306:
Page 307 and 308: 588 Chapter 10 Extending the Limits
Page 309 and 310: 592 Chapter 10 Extending the Limits
Page 311 and 312: 596 Figure 10.12 A triangulated cyc
Page 313 and 314: 600 Chapter 11 Approximation Algori
Page 327 and 328: 628 Chapter 11 Approximation Mgorit
Page 331 and 332: 636 Chapter !! Approximation Algori
Page 333 and 334: 640 Chapter 11 Approximation Mgofit
Page 341 and 342: 656 Chapter 11 Approximation Mgorit
Page 345 and 346: 664 Chapter 12 Local Search basic p
Page 347 and 348: 668 Chapter 12 Local Search moves b
Page 349 and 350: 672 Chapter 12 Local Search of all
Page 351 and 352: 676 Chapter 12 Local Search 12.4 Ma
Page 353 and 354: 680 Chapter 12 Local Search saw ear
Page 355 and 356: 684 Chapter 12 Local Search utting
Page 357: 688 Chapter 12 Local Search this di
Page 361 and 362: 696 Chapter 12 Local Search Of cour
Page 363 and 364: 7OO Chapter 12 Loca! Search and for
Page 365 and 366: 7O4 Chapter 12 Local Search A previ
Page 367 and 368: 708 Chapter 13 Randomized Algorithm
Page 375 and 376: 724 Chapter 13 Randomized Mgorithms
Page 381 and 382: 736 Chapter 13 Randomized Mgofithms
Page 409 and 410:
792 Chapter !3 Randomized Algorithm
Page 411 and 412:
796 Epilogue: Algorithms That Run F
Page 413 and 414:
800 Epilogue: Algorithms That Run F
Page 415 and 416:
8O4 Epilogue: Algorithms That Run F
Page 417 and 418:
808 References L. R. Ford. Network
Page 419 and 420:
812 References M. Mitzenmacher and
Page 421 and 422:
816 Index Approximation algorithms
Page 423 and 424:
820 Cushions in packet switching, 8
Page 425 and 426:
824 Index Graphs (cont.) queues and
Page 427 and 428:
828 Index Mapping genomes, 279, 521
Page 429 and 430:
832 Index Index 833 Preflow-Push Ma
Page 431 and 432:
836 Index Stale items in randomized
show all

Algorithm Design

Create successful ePaper yourself

Delete template?

Save as template?