Planning under Uncertainty in Dynamic Domains - Carnegie Mellon ...

More documents

Recommendations

Info

24 Chapter 3. Planning under Uncertaintyin which (oil-in-barge barge1) is true. There is only one other way that theplan in Figure 3.4 can be expanded, which isbymoving the step (Move-Barge=west-coast =Richmond) to the head plan. However, this candidateplan would include a state loop, the situation where the exact same state is visitedtwice while the head plan is executed. This plan is therefore pruned from the searchspace, since if it leads to a solution there must be another more ecient solution thatavoids the state loop. The plan in Figure 3.4 cannot be expanded by adding a stepto the tail plan because there are no open preconditions in the tail plan.A summary of the prodigy 4.0 algorithm is shown in Table 3.1 (taken fromal. [Veloso et al. 1995]).prodigy 4.0(G,I)1. Current state C := initial state I,Head-Plan := null,Tail-Plan := null. 2. If the goal statement G is satised in the current state C, thenreturn Head-Plan.3. Either(A) Back-Chainer adds an operator to the Tail-Plan, or(B) Operator-Application moves an operator from Tail-Plan to Head-Plan.Decision point: Decide whether to apply an operator or to add an operator to thetail.4. Goto 2.Operator-Application1. Pick an operator op in Tail-Plan such that(A) there is no operator in Tail-Plan ordered before op, and(B) the preconditions of op are satised in the current state C.Decision point: Choose an operator to apply.2. Move op to the end of Head-Plan and update the current state C.Table 3.1: Summary of the prodigy 4.0 planning algorithm.point is a potential backtracking point in the algorithm.Each decision3.1.3 Organization of the search spaceTable 3.1 gives a nondeterministic algorithm to construct a plan for a given planningproblem. prodigy 4.0 maintains an audit trail of the search if performs whileconstructing the plan, called the search tree. A description of the search tree isuseful for explaining some extensions made to prodigy 4.0 to support conditionalplanning and planning with external events. It is also used in Chapter 4 to describehow Weaver interacts with prodigy 4.0.The nodes in the search tree represent the choices made at each decision point ofthe algorithm, and are typically of four types. An applied operator node represents
3.2. A representation for planning under uncertainty 25the choice to move a particular step from the tail plan to the head plan. As analternative tomoving a step to the head plan, a goal node represents the decision tofocus on an open condition in the tail plan (also known as a goal). The children of agoal node must all be operator nodes, which represent the decision to use a particularoperator schema to achieve the goal represented by the parent node. The children ofan operator node must all be bindings nodes, which represent the decision to use aparticular set of bindings to instantiate the operator represented by the parent node.The combination of a goal node, an operator node and a bindings node togetherrepresent the act of prodigy 4.0 adding a step into the tail plan.While the tail plan and the head plan together represent one candidate plan, thesearch tree represents all the candidate plans that have been examined and also allthe ways that new candidate plans can be expanded. A path from the root node toany other node in the search tree corresponds to a candidate partial plan, which canbe reconstructed by following the decisions encoded in the nodes in the path. Forexample, the candidate plan in Figure 3.4 is actually generated by prodigy 4.0 asthe sequence of search tree nodes shown in Table 3.2. In this sequence, each nodeisthe child of the node in the line above 3 , so the candidate plan is produced withoutbacktracking over any choices. Other sequences of search nodes could produce thecandidate plan just as well: in particular the order in which prodigy 4.0 works onthe goals (oil-in-barge barge1) and (at barge1 Richmond) doesn't matter. Thechoice of this particular sequence was largely due to search heuristics that prodigy4.0 uses, which will not be discussed here (but see [Blythe & Veloso 1992] and[Carbonell et al. 1992]).3.2 A representation for planning under uncertaintyIn Section 3.1 I provided a brief description of planning domains, planning problemsand plans in prodigy 4.0. Here I extend those denitions to the versions usedby Weaver, in which planning domains and problems also contain information aboutuncertainty in the domain, and plans can specify a number of contingencies. Inaddition to operators, which can have more than one possible outcome, planningdomains in Weaver also contain exogenous events, which specify ways that the worldcan be changed independently of the actions in a plan.In order to give a precise characterisation of the problems that Weaver can representand solve, I describe Weaver's representation scheme in terms of Markov decisionprocesses. Specically, I describe how to take a planning problem in Weaver's languageand construct a Markov decision process M which is equivalent in the sense thatif there is a non-looping policy for M with an expected value greater than 0 then there3 except n5 which is the child of a special node that prodigy 4.0 uses to add the top-level goalsto its list of goals.
Page 1 and 2: Planning under Uncertainty in Dynam
Page 3: AbstractPlanning, the process of nd
Page 6 and 7: 4.3.1 Analysing the belief net and
Page 8 and 9: viii
Page 10 and 11: 7.4 Weaver's solution to the exampl
Page 12 and 13: 3.12 Reachability graph of literal
Page 14 and 15: 6.1 Operators in the parameterised
Page 16 and 17: xvi
Page 18 and 19: xviii
Page 20 and 21: 2 Chapter 1. Introductionif the pri
Page 22 and 23: 4 Chapter 1. Introductionweather co
Page 24 and 25: 6 Chapter 1. Introductionnet nodes
Page 26 and 27: 8 Chapter 1. Introduction
Page 28 and 29: 10 Chapter 2. Related workIn additi
Page 30 and 31: 12 Chapter 2. Related workmakes use
Page 32 and 33: 14 Chapter 2. Related workall the s
Page 34 and 35: 16 Chapter 2. Related workValue-Ite
Page 36 and 37: 18 Chapter 2. Related workdescent [
Page 38 and 39: 20 Chapter 3. Planning under Uncert
Page 62 and 63: 44 Chapter 4. The Weaver Algorithmi
Page 64 and 65: 46 Chapter 4. The Weaver Algorithmn
Page 66 and 67: 48 Chapter 4. The Weaver AlgorithmB
Page 68 and 69: 50 Chapter 4. The Weaver Algorithm
Page 70 and 71: 52 Chapter 4. The Weaver Algorithm0
Page 72 and 73: 54 Chapter 4. The Weaver AlgorithmI
Page 74 and 75: 56 Chapter 4. The Weaver Algorithmd
Page 76 and 77: 58 Chapter 4. The Weaver Algorithm(
Page 78 and 79: 60 Chapter 4. The Weaver AlgorithmT
Page 80 and 81: 62 Chapter 4. The Weaver Algorithmn
Page 82 and 83: 64 Chapter 4. The Weaver Algorithm4
Page 84 and 85: 66 Chapter 4. The Weaver Algorithml
Page 86 and 87: 68 Chapter 4. The Weaver Algorithmc
Page 88 and 89: 70 Chapter 5. Eciency improvements
Page 90 and 91: 72 Chapter 5. Eciency improvements
Page 92 and 93:
74 Chapter 5. Eciency improvements
Page 94 and 95:
Page 96 and 97:
Page 98 and 99:
Page 100 and 101:
Page 102 and 103:
Page 104 and 105:
Page 106 and 107:
Page 108 and 109:
Page 110 and 111:
Page 112 and 113:
94 Chapter 7. Experimental results
Page 114 and 115:
Page 116 and 117:
Page 118 and 119:
Page 120 and 121:
Page 122 and 123:
Page 124 and 125:
Page 126 and 127:
Page 128 and 129:
Page 130 and 131:
Page 132 and 133:
Page 134 and 135:
Page 136 and 137:
118 Chapter 8. Conclusions The appl
Page 138 and 139:
120 Chapter 8. Conclusions
Page 140 and 141:
122 Appendix A. Proofs of theoremso
Page 142 and 143:
124 Appendix A. Proofs of theoremsN
Page 144 and 145:
126 Appendix B. The Oil-spill domai
Page 146 and 147:
Page 148 and 149:
Page 150 and 151:
Page 152 and 153:
Page 154 and 155:
Page 156 and 157:
Page 158 and 159:
Page 160 and 161:
Page 162 and 163:
Page 164 and 165:
Page 166 and 167:
Page 168 and 169:
Page 170 and 171:
Page 172 and 173:
Page 174 and 175:
Page 176 and 177:
158 BIBLIOGRAPHY[Blythe & Veloso 19
Page 178 and 179:
160 BIBLIOGRAPHY[Drummond & Bresina
Page 180 and 181:
162 BIBLIOGRAPHY[Koenig & Simmons 1
Page 182 and 183:
164 BIBLIOGRAPHY[Schoppers 1989b] S
Page 184:
166 BIBLIOGRAPHY
show all

Planning under Uncertainty in Dynamic Domains - Carnegie Mellon ...

Create successful ePaper yourself

Delete template?

Save as template?