Planning under Uncertainty in Dynamic Domains - Carnegie Mellon ...

More documents

Recommendations

Info

26 Chapter 3. Planning under Uncertaintyn5 Goal (disposed-oil) top-leveln6 Operator unload-oiln7 Bindings n8 Goal (oil-in-barge barge1) for n7n9 Operator pump-oiln10 Bindings n11 Goal (at barge1 west-coast) for n10n12 Operator move-bargen13 Bindings n14 Apply apply n13n16 Goal (at barge1 richmond) for n7n17 Operator move-bargen18 Bindings Table 3.2: An annotated trace of prodigy 4.0 which shows a sequence of searchtree nodes created to produce the candidate plan shown in Figure 3.3. Each nodeexcept n5 isachild of the node in the line above. The text in italics has been addedto the trace output for clarity of presentation.is a plan in the original problem with probability of success equal to that expectedvalue, and conversely if there is a plan then there is such a policy. A description ofMarkov decision processes was given in Section 2.4.Recall that in prodigy 4.0, a planning domain consists of a type hierarchy T ,aset of literals L and a set of operators O. A planning problem consists of a planningdomain, a set of objects belonging to each type in the hierarchy, an initial state anda goal description. Each literal in the domain has a type signature, specifying thenumber of arguments and the type of each argument, which is a member of T. Giventhe objects in the planning problem, the set of all possible ground literals L can beconstructed by lling in the arguments of each literal's type signature with all objectsthat match, which are those objects of the exact typeorany subtype of the type inthe signature.Consider the oil-spill planning problem that has been used as an example in theprevious sections. The domain includes the literal at with signature (at BargePlace). In the planning problem, there are two objects of type Barge, barge1 andbarge2 and two objects belonging to subtypes of the type Place, Richmond of typeDock and west-coast of type Sea-Sector. Therefore there are four ground literalsderived from at in the set L: (at barge1 Richmond), (at barge1 west-coast),(at barge2 Richmond) and (at barge2 west-coast).A state in a planning problem in prodigy 4.0 assigns a value of true or falseto each literal in the ground literals L. Therefore there are 2 jLj possible states. Someof these assignments may not correspond to valid states in the planning domain beingmodelled. For example, (at barge1 Richmond) and (at barge1 west-coast)
3.2. A representation for planning under uncertainty 27cannot both be true at the same time. However since any state that is reachableby a sequence of actions in the domain from a valid initial state will also be valid,these invalid states are not an issue in practice. This state property ofvalidity couldbe partially derived by considering the states reachable by applying actions to anyphysically possible initial state, or it could be enforced by adding domain axioms suchas those used in [Knoblock 1991]. In what follows I will ignore this distinction.In Weaver the planning domain is generalised as follows.1. The operators in the domain include a duration which isaninteger-valued functionof the bindings of the operator (and therefore a integer for an instantiatedaction or step). This integer may represent any time unit, for example secondsor hours, although the unit must be the same for dierent operators in the samedomain.2. Operators may specify a discrete, conditional probability distribution of possibleoutcomes rather than the single possible outcome used in prodigy 4.0. Anexample of this will be described in more detail below.3. A planning domain includes a set of exogenous events E as well as the set ofoperators O. These are syntactically very similar to operators but are used tospecify the way that the world can change independently of the actions taken ina plan, as I describe below. For example, they can be used to model the actionsof other agents or natural processes.4. A total precedence order < is given over the actions and events. This is used toresolve conicts between their eects if more than one action or event produceschanges to a state. An example is given below.Weaver generalises prodigy 4.0's denition of a planning problem by specifyinga probability distribution of possible initial states rather than a single initial state.The problem also includes a threshold probability, , a minimum probability of successthat a plan must equal or exceed to be considered a solution. The objects and goalstatement in the planning problem are unchanged.In the rest of this section I make the semantics of planning domains and problemsin Weaver precise in terms of an underlying Markov decision process M dened bya planning problem. While this denition is needed to prove that Weaver correctlycomputes probabilities for plans and to discuss its coverage, on a casual reading of thethesis it can be skipped and replaced with the following summary: at each time step,several events may take place simultaneously with one action as a plan is executed.When more than one event or action complete in one time step, their results areapplied to the state in parallel. If more than one possible value is specied for someground literal in the state, the value nominated by the event or action that is highestin the pre-specied precedence order is used. Actions are usually higher than eventsin the precedence order.
Page 1 and 2: Planning under Uncertainty in Dynam
Page 3: AbstractPlanning, the process of nd
Page 6 and 7: 4.3.1 Analysing the belief net and
Page 8 and 9: viii
Page 10 and 11: 7.4 Weaver's solution to the exampl
Page 12 and 13: 3.12 Reachability graph of literal
Page 14 and 15: 6.1 Operators in the parameterised
Page 16 and 17: xvi
Page 18 and 19: xviii
Page 20 and 21: 2 Chapter 1. Introductionif the pri
Page 22 and 23: 4 Chapter 1. Introductionweather co
Page 24 and 25: 6 Chapter 1. Introductionnet nodes
Page 26 and 27: 8 Chapter 1. Introduction
Page 28 and 29: 10 Chapter 2. Related workIn additi
Page 30 and 31: 12 Chapter 2. Related workmakes use
Page 32 and 33: 14 Chapter 2. Related workall the s
Page 34 and 35: 16 Chapter 2. Related workValue-Ite
Page 36 and 37: 18 Chapter 2. Related workdescent [
Page 38 and 39: 20 Chapter 3. Planning under Uncert
Page 62 and 63: 44 Chapter 4. The Weaver Algorithmi
Page 64 and 65: 46 Chapter 4. The Weaver Algorithmn
Page 66 and 67: 48 Chapter 4. The Weaver AlgorithmB
Page 68 and 69: 50 Chapter 4. The Weaver Algorithm
Page 70 and 71: 52 Chapter 4. The Weaver Algorithm0
Page 72 and 73: 54 Chapter 4. The Weaver AlgorithmI
Page 74 and 75: 56 Chapter 4. The Weaver Algorithmd
Page 76 and 77: 58 Chapter 4. The Weaver Algorithm(
Page 78 and 79: 60 Chapter 4. The Weaver AlgorithmT
Page 80 and 81: 62 Chapter 4. The Weaver Algorithmn
Page 82 and 83: 64 Chapter 4. The Weaver Algorithm4
Page 84 and 85: 66 Chapter 4. The Weaver Algorithml
Page 86 and 87: 68 Chapter 4. The Weaver Algorithmc
Page 88 and 89: 70 Chapter 5. Eciency improvements
Page 94 and 95:
76 Chapter 5. Eciency improvements
Page 96 and 97:
Page 98 and 99:
Page 100 and 101:
Page 102 and 103:
Page 104 and 105:
Page 106 and 107:
Page 108 and 109:
Page 110 and 111:
Page 112 and 113:
94 Chapter 7. Experimental results
Page 114 and 115:
Page 116 and 117:
Page 118 and 119:
Page 120 and 121:
Page 122 and 123:
Page 124 and 125:
Page 126 and 127:
Page 128 and 129:
Page 130 and 131:
Page 132 and 133:
Page 134 and 135:
Page 136 and 137:
118 Chapter 8. Conclusions The appl
Page 138 and 139:
120 Chapter 8. Conclusions
Page 140 and 141:
122 Appendix A. Proofs of theoremso
Page 142 and 143:
124 Appendix A. Proofs of theoremsN
Page 144 and 145:
126 Appendix B. The Oil-spill domai
Page 146 and 147:
Page 148 and 149:
Page 150 and 151:
Page 152 and 153:
Page 154 and 155:
Page 156 and 157:
Page 158 and 159:
Page 160 and 161:
Page 162 and 163:
Page 164 and 165:
Page 166 and 167:
Page 168 and 169:
Page 170 and 171:
Page 172 and 173:
Page 174 and 175:
Page 176 and 177:
158 BIBLIOGRAPHY[Blythe & Veloso 19
Page 178 and 179:
160 BIBLIOGRAPHY[Drummond & Bresina
Page 180 and 181:
162 BIBLIOGRAPHY[Koenig & Simmons 1
Page 182 and 183:
164 BIBLIOGRAPHY[Schoppers 1989b] S
Page 184:
166 BIBLIOGRAPHY
show all

Planning under Uncertainty in Dynamic Domains - Carnegie Mellon ...

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?