08.07.2015 Views

Planning under Uncertainty in Dynamic Domains - Carnegie Mellon ...

Planning under Uncertainty in Dynamic Domains - Carnegie Mellon ...

Planning under Uncertainty in Dynamic Domains - Carnegie Mellon ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

3.3. A Markov decision process model of uncerta<strong>in</strong>ty 33transitions are calculated from the event and action outcome probabilities, which areconditionally <strong>in</strong>dependent given the states <strong>in</strong> which the correspond<strong>in</strong>g pend<strong>in</strong>g eectsbegan.0.50.3750.2810.211σ= (del (at barge1 Richmond))(add (at barge1 west-coast))3: σ0.752: σ0.751: σ0.75(at barge1 west-coast)ε= (del (poor-weather)).05(add (fair-weather))3: σ 1: ε0.1670.252: σ 1: ε0.1250.251: σ1: ε0.0940.250.07(at barge1 west-coast)1: ε1.0(at barge1 Richmond)(operational barge1)(poor-weather)(speed barge1 1)(ready barge1)(distance Richmond west-coast 4)0.1670.253: υ0.250.75(fair-weather)2: σ2: υ0.1670.1870.751: σ1: υ0.292(fair-weather)0.140.750.386(at barge1 west-coast)(fair-weather)0.1(at barge1 west-coast)NOT (operational barge1)υ= (del (at barge1 Richmond))(add (at barge1 west-coast))(del (operational barge1))0.0830.0833: υ 1: ε0.252: υ2: υ1: ε(fair-weather)0.0630.0830.251: υ1: υ1: ε0.0470.146(fair-weather)0.250.04(at barge1 west-coast)NOT (operational barge1)1: ε0.193(at barge1 west-coast)NOT (operational barge1)(fair-weather)Figure 3.9: The possible sequences of states <strong>in</strong> the <strong>under</strong>ly<strong>in</strong>g Markov decisionprocess aris<strong>in</strong>g from execut<strong>in</strong>g the Move-Barge operator <strong>in</strong> the state shown on theleft, when the exogenous event Weather-Brightens can also take place. Only literalsthat are changed from a parent state are shown.Formal description of the <strong>under</strong>ly<strong>in</strong>g MDPEach state s of the <strong>under</strong>ly<strong>in</strong>g mdp M consists of two parts:1. a truth assignment to the ground literals L, correspond<strong>in</strong>g to a state <strong>in</strong> theplann<strong>in</strong>g doma<strong>in</strong> and referred to as the literal state .2. a set of pend<strong>in</strong>g eects .Each element of the pend<strong>in</strong>g eects is a triple (t; a; ) where t is an <strong>in</strong>teger denot<strong>in</strong>ga time <strong>in</strong>terval, a is an operator or exogenous event and is a set of \eects", aset of add and delete statements over ground literals <strong>in</strong> L. The pend<strong>in</strong>g eectsrepresent the events and actions that are currently tak<strong>in</strong>g place <strong>in</strong> the given state ofM. Intuitively, t denotes the time left before the eect \completes" or is applied tochange the state.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!