This section defines the I-map 
 from Figure
11.3, which converts each history I-state into a
probability distribution over
 from Figure
11.3, which converts each history I-state into a
probability distribution over  .  A Markov, probabilistic model is
assumed in the sense that the actions of nature only depend on the
current state and action, as opposed to state or action histories.  The
set union and intersection of (11.30) and
(11.31) are replaced in this section by marginalization
and Bayes' rule, respectively.  In a sense, these are the probabilistic
equivalents of union and intersection.  It will be very helpful to
compare the expressions from this section to those of Section
11.2.2.
.  A Markov, probabilistic model is
assumed in the sense that the actions of nature only depend on the
current state and action, as opposed to state or action histories.  The
set union and intersection of (11.30) and
(11.31) are replaced in this section by marginalization
and Bayes' rule, respectively.  In a sense, these are the probabilistic
equivalents of union and intersection.  It will be very helpful to
compare the expressions from this section to those of Section
11.2.2.  
Rather than write 
 , standard probability notation
will be applied to obtain
, standard probability notation
will be applied to obtain 
 .  Most expressions in this
section of the form
.  Most expressions in this
section of the form 
 have an analogous expression in
Section 11.2.2 of the form
 have an analogous expression in
Section 11.2.2 of the form 
 .  It is helpful to
recognize the similarities.
.  It is helpful to
recognize the similarities.
The first step is to construct probabilistic versions of  and
 and  .
These are
.
These are 
 and
 and 
 , respectively.  The
latter term was given in Section 10.1.1.  To obtain
, respectively.  The
latter term was given in Section 10.1.1.  To obtain
 , recall from Section 11.1.1 that
, recall from Section 11.1.1 that 
 is easily derived from
is easily derived from 
 .  To obtain
.  To obtain 
 ,
Bayes' rule is applied:
,
Bayes' rule is applied:
 was rewritten using marginalization,
(9.8).  In this case
 was rewritten using marginalization,
(9.8).  In this case  appears as the sum index;
therefore, the denominator is only a function of
 appears as the sum index;
therefore, the denominator is only a function of  , as required.
Bayes' rule requires knowing the prior,
, as required.
Bayes' rule requires knowing the prior,  .  In the coming
expressions, this will be replaced by a probabilistic I-state.
.  In the coming
expressions, this will be replaced by a probabilistic I-state.
Now consider defining probabilistic I-states.  Each is a probability
distribution over  and is written as
 and is written as 
 .  The initial
condition produces
.  The initial
condition produces  .  As for the nondeterministic case,
probabilistic I-states can be computed inductively.  For the base
case, the only new piece of information is
.  As for the nondeterministic case,
probabilistic I-states can be computed inductively.  For the base
case, the only new piece of information is  .  Thus, the
probabilistic I-state,
.  Thus, the
probabilistic I-state, 
 , is
, is 
 .  This is
computed by letting
.  This is
computed by letting  in (11.35) to yield
 in (11.35) to yield
Now consider the inductive step by assuming that 
 is
given.  The task is to determine
 is
given.  The task is to determine 
 , which is
equivalent to
, which is
equivalent to 
 .  As in Section
11.2.2, this will proceed in two parts by first considering
the effect of
.  As in Section
11.2.2, this will proceed in two parts by first considering
the effect of  , followed by
, followed by  .  The first step is to
determine
.  The first step is to
determine 
 from
 from 
 .  First, note
that
.  First, note
that
 contains no additional information regarding the
prediction of
 contains no additional information regarding the
prediction of  once
 once  is given.
Marginalization, (9.8), can
be used to eliminate
 is given.
Marginalization, (9.8), can
be used to eliminate  from
 from 
 .  This must be
eliminated because it is not given.  Putting these steps together
yields
.  This must be
eliminated because it is not given.  Putting these steps together
yields
 in terms of given quantities.
Equation (11.38) can be considered as the probabilistic
counterpart of (11.30).
 in terms of given quantities.
Equation (11.38) can be considered as the probabilistic
counterpart of (11.30).
The next step is to take into account the observation  .
This is accomplished by making a version of (11.35) that is
conditioned on the information accumulated so far:
.
This is accomplished by making a version of (11.35) that is
conditioned on the information accumulated so far:  and
 and  .
Also,
.
Also,  is replaced with
 is replaced with  .  The result is
.  The result is
 , which is the probabilistic
I-state for stage
, which is the probabilistic
I-state for stage  , as desired.  There are two different kinds of
terms on the right.  The expression for
, as desired.  There are two different kinds of
terms on the right.  The expression for 
 is
given in (11.38).  Therefore, the only remaining term to
calculate is
 is
given in (11.38).  Therefore, the only remaining term to
calculate is 
 .  Note that
.  Note that
 is specified
as part of the sensor model, we have now determined how to obtain
 is specified
as part of the sensor model, we have now determined how to obtain
 from
 from 
 ,
,  , and
, and  .
Thus,
.
Thus, 
 is another I-space that can be treated as just
another state space.
 is another I-space that can be treated as just
another state space.
The probabilistic I-space 
 (shown in Figure
11.3) is the set of all probability distributions over
 (shown in Figure
11.3) is the set of all probability distributions over
 .  The update expressions, (11.38) and
(11.39), establish that the I-map
.  The update expressions, (11.38) and
(11.39), establish that the I-map 
 is
sufficient, which means that the planning problem can be expressed
entirely in terms of
 is
sufficient, which means that the planning problem can be expressed
entirely in terms of 
 , instead of maintaining histories.  A
goal region can be specified as constraints on the probabilities.  For
example, from some particular
, instead of maintaining histories.  A
goal region can be specified as constraints on the probabilities.  For
example, from some particular  , the goal might be to reach
any probabilistic I-state for which
, the goal might be to reach
any probabilistic I-state for which 
 .
.
|  | 
 denote the probability
that the current state is
 denote the probability
that the current state is  .  Any probabilistic I-state can be
expressed as
.  Any probabilistic I-state can be
expressed as 
 .  This implies that the I-space
can be nicely embedded in
.  This implies that the I-space
can be nicely embedded in 
 .  By the axioms of probability
(given in Section 9.1.2),
.  By the axioms of probability
(given in Section 9.1.2), 
 , which can
be interpreted as a plane equation in
, which can
be interpreted as a plane equation in 
 that restricts
 that restricts
 to a 2D set.  Also following the axioms of probability, for
each
 to a 2D set.  Also following the axioms of probability, for
each 
 ,
, 
 .  This means that
.  This means that
 is restricted to a triangular region in
 is restricted to a triangular region in 
 .  The
vertices of this triangular region are
.  The
vertices of this triangular region are  ,
,  , and
, and
 ; these correspond to the three different ways to have
perfect state information.  In a sense, the distance away from these
points corresponds to the amount of uncertainty in the state.  The
uniform probability distribution
; these correspond to the three different ways to have
perfect state information.  In a sense, the distance away from these
points corresponds to the amount of uncertainty in the state.  The
uniform probability distribution 
 is equidistant from
the three vertices.  A projection of the triangular region into
 is equidistant from
the three vertices.  A projection of the triangular region into
 is shown in Figure 11.6.  The interpretation in
this case is that
 is shown in Figure 11.6.  The interpretation in
this case is that  and
 and  specify a point in
 specify a point in 
 , and
, and
 is automatically determined from
 is automatically determined from 
 .
.
The triangular region in 
 is an uncountably infinite set, even
though the history I-space is countably infinite for a fixed initial
condition.  This may seem strange, but there is no mistake because for
a fixed initial condition, it is generally impossible to reach all of
the points in
 is an uncountably infinite set, even
though the history I-space is countably infinite for a fixed initial
condition.  This may seem strange, but there is no mistake because for
a fixed initial condition, it is generally impossible to reach all of
the points in 
 .  If the initial condition can be any point
in
.  If the initial condition can be any point
in 
 , then all of the probabilistic I-space is covered
because
, then all of the probabilistic I-space is covered
because 
 , in which
, in which 
 is the initial condition
space..
 is the initial condition
space..
 
 
Steven M LaValle 2020-08-14