Algorithm Design
Algorithm Design
Algorithm Design
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
786<br />
Chapter 13 Randomized <strong>Algorithm</strong>s<br />
natural. This is based on conceptually dividing the cable into n equallength<br />
"edges" el ..... en, where the first edge el runs from the switching<br />
hub to T1, and the i th edge ei (i > 1) runs from Ti_~ to T~. Now we observe<br />
that, while all the towns benefit from e~, only the last town benefits from<br />
en. So they suggest<br />
Proposal C. Divide the cost separately for each edge el. The cost of e i should<br />
be shared equally by the towns T i, Ti+l ..... Tn, since these are the towns<br />
"downstream" of e i.<br />
So now the towns have many different options; which is the fairest?<br />
To resolve this, they turn to the work of Lloyd Shapley, one of the most<br />
famous mathematical economists of the 20th century. He proposed what<br />
is now called the Shapley value as a general mechanism for sharing costs<br />
or benefits among several parties. It can be viewed as determining the<br />
"marginal contrmulx "-- "on" of each part5’, assuming the parties arrive , in a<br />
random order.<br />
Here’s how it would work concretely in our setting. Consider an<br />
.......... s "arrive" in this order<br />
ordering 0 of the towns, and suppose t_hat m~ Lu .... ¯<br />
The marginal cost of town Ti in order 0 is determined as follows. If T~ is<br />
first in the order O, then T~ pays ki, the cost of running the cable all the<br />
way from the switching hub to T~. Otherwise, look at the Set of towns that<br />
come before Ti in the order (9, and let T i be the farthest among these towns<br />
from the switching hub. When T~ arrives, we assume the cable already<br />
reaches out to T i but no farther. So if] > i (T 1 is farther out than T~), then<br />
the marginal cost of T~ is 0, since the cable already runs past T~ on its way<br />
out to T i. On the other hand, if] < i, then the marginal cost of Ti is k(i -]):<br />
the cost of extending the cable from T 1 out to Ti.<br />
(For example, suppose n = 3 and the towns arrive in the order<br />
T~, T3, T2. First T1 pays k when it arrives. Then, when T3 arrives, it only has<br />
to pay 2k to extend the cable from T~. Finally, when T2 arrives, it doesn’t<br />
have to pay anything since the cable already runs past it out to T~.)<br />
Now, let X~ be the random variable equal to the marginal cost of<br />
town T~ when the order 0 is selected uniformly at random from all<br />
permutations of the towns. Under the rules of the Shapley value, the<br />
amount that T~ should contribute to the overall cost of the cable is the<br />
expected value of X~.<br />
The question is: Which of the three proposals above, ff any, gives the<br />
same division of costs as the Shapley value cost-sharing mechanism? Give<br />
a proof for your answer.<br />
Exercises<br />
One of the (many) hard problems that arises in genome mapping can be<br />
formulated in the following abstract way. We are given a set of n markers<br />
~}--these are positions on a chromosome that we are trying to<br />
map--and our goal is to output a linear ordering of these markers. The<br />
output should be consistent with a set of k constraints, each specified by<br />
a triple (/z~,/z~, ~), requiring that ~i lie between ~z~ and/z~ in the total<br />
ordering that we produce. (Note that this constraint does not specify<br />
which of tzi or ~k should come first in the ordering, only that/z1 should<br />
come between them.)<br />
Now it is not always possible to satisfy all constraints simultaneously,<br />
so we wish to produce an ordering that satisfies as many as possible.<br />
Unforttmately, deciding whether there is an ordering that satisfies at least<br />
k’ of the k constraints is an NP-complete problem (you don’t have to prove<br />
this.)<br />
Give a constant a > 0 (independent of n) and an algorithm with the<br />
following property. If it is possible to satisfy k* of the constraints, then<br />
the algorithm produces an ordering of markers satisfying at least<br />
of the constraints. Your algorithm may be randomized; in this case it<br />
should produce an ordering for which the expected number of satisfied<br />
constraints is at least ak*.<br />
In Section 13.4, we designed an approximation algorithm to within ~ f~ctot<br />
of 7/8 for the MAX 3-SAT Problem, where w~ assumed that each clause<br />
has terms associated with three different variables. In this problem, we<br />
wil! consider the analogous MAX SAT Problem: Given a set of clauses<br />
Ck over a set of variables X = {x~ ..... x~}, ~ind a truth assignment<br />
satisfying as many of the clauses as possible: Each clause has at least<br />
one term in it, and all the variables in a single clause are distinct, but<br />
otherwise we do not make any assumptions on the length of the clauses:<br />
There may be clauses that have a lot of variables, and others may have<br />
just a single variable.<br />
(a) First consider the randomized approximation algorithm we used for<br />
IvIAX 3-SAT, setting each variable independently to true or false with<br />
probability 1/2 each. Show that the expected number of clauses<br />
satisfied by this random assignment is at least k/.2, that is, at least half<br />
of the clauses are satisfied in expectation. Give an example to show<br />
that there are MAX SAT instances such that no assignment satisfies<br />
more than half of the clauses.<br />
(b) If we have a clause that consists only of a single term (e.g., a clause<br />
consisting just of Xl, or just of~), then there is only a single way to satisfy<br />
it: We need to set the corresponding variable in the appropriate<br />
787