Consider the cost-to-go of executing a plan from a state
. The resulting cost depends on the sequences of states
that are visited, actions that are applied by the plan, and the
applied nature actions. In Chapter 2 this was
obtained by adding the cost terms, but now there is a dependency on
nature. Both worst-case and expected-case analyses are possible,
which generalize the treatment of Section 9.2 to state
spaces and multiple stages.
Let
denote the set of state-action-nature
histories that could arise from
when applied using
as
the initial state. The cost-to-go,
, under a given
plan
from
can be measured using worst-case
analysis as
An optimal plan using worst-case analysis is any plan for which
is minimized over all possible plans (all ways to
assign actions to the states). In the case of feasible planning,
there are usually numerous equivalent alternatives. Sometimes the
task may be only to find a feasible plan, which means that all
trajectories must reach the goal, but the cost does not need to be
optimized.
Using probabilistic uncertainty, the cost of a plan can be measured using expected-case analysis as
An interesting question now emerges: Can the same plan, , be
optimal from every initial state
, or do we need to
potentially find a different optimal plan for each initial state?
Fortunately, a single plan will suffice to be optimal over all initial
states. Why? This behavior was also observed in Section
8.2.1. If
is optimal from some
, then it
must also be optimal from every other state that is potentially
visited by executing
from
. Let
denote some visited
state. If
was not optimal from
, then a better plan would
exist, and the goal could be reached from
with lower cost. This
contradicts the optimality of
because solutions must travel
through
. Let
denote a plan that is optimal from every
initial state.
Steven M LaValle 2020-08-14