11.2.3 Probabilistic Information Spaces

This section defines the I-map ${{\kappa}_{prob}}$ from Figure 11.3, which converts each history I-state into a probability distribution over

. A Markov, probabilistic model is assumed in the sense that the actions of nature only depend on the current state and action, as opposed to state or action histories. The set union and intersection of (11.30) and (11.31) are replaced in this section by marginalization and Bayes' rule, respectively. In a sense, these are the probabilistic equivalents of union and intersection. It will be very helpful to compare the expressions from this section to those of Section 11.2.2.

Rather than write ${{\kappa}_{prob}}({\eta})$ , standard probability notation will be applied to obtain $P(x\vert{\eta})$ . Most expressions in this section of the form $P(x_k\vert\cdot)$ have an analogous expression in Section 11.2.2 of the form $X_k(\cdot)$ . It is helpful to recognize the similarities.

The first step is to construct probabilistic versions of

and

. These are $P(x_k\vert y_k)$ and $P(x_{k+1}\vert x_k,u_k)$ , respectively. The latter term was given in Section 10.1.1. To obtain $P(x_k\vert y_k)$ , recall from Section 11.1.1 that $P(y_k\vert x_k)$ is easily derived from $P(\psi_k\vert x_k)$ . To obtain $P(x_k\vert y_k)$ , Bayes' rule is applied:

Now consider defining probabilistic I-states. Each is a probability distribution over

and is written as $P(x_k\vert{\eta}_k)$ . The initial condition produces

. As for the nondeterministic case, probabilistic I-states can be computed inductively. For the base case, the only new piece of information is

. Thus, the probabilistic I-state, $P(x_1\vert{\eta}_1)$ , is $P(x_1\vert y_1)$ . This is computed by letting

in (11.35) to yield

Now consider the inductive step by assuming that $P(x_k\vert{\eta}_k)$ is given. The task is to determine $P(x_{k+1} \vert {\eta}_{k+1})$ , which is equivalent to $P(x_{k+1} \vert {\eta}_k,u_k,y_{k+1})$ . As in Section 11.2.2, this will proceed in two parts by first considering the effect of

, followed by $y_{k+1}$ . The first step is to determine $P(x_{k+1}\vert{\eta}_k,u_k)$ from $P(x_k\vert{\eta}_k)$ . First, note that

The next step is to take into account the observation $y_{k+1}$ . This is accomplished by making a version of (11.35) that is conditioned on the information accumulated so far: ${\eta}_k$ and

. Also,

is replaced with

. The result is

The probabilistic I-space ${\cal I}_{prob}$ (shown in Figure 11.3) is the set of all probability distributions over

. The update expressions, (11.38) and (11.39), establish that the I-map ${{\kappa}_{prob}}$ is sufficient, which means that the planning problem can be expressed entirely in terms of ${\cal I}_{prob}$ , instead of maintaining histories. A goal region can be specified as constraints on the probabilities. For example, from some particular $x \in X$ , the goal might be to reach any probabilistic I-state for which $P(x_k \vert {\eta}_k) > 1/2$ .

**Figure 11.6:** The probabilistic I-space for the three-state example is a 2-simplex embedded in ${\mathbb{R}}^3$ . This simplex can be projected into ${\mathbb{R}}^2$ to yield the depicted triangular region in ${\mathbb{R}}^2$ .
$\begin{figure}\centerline{\psfig{figure=figs/psimplex.eps,width=1.7truein} }\end{figure}$

Example 11..14 (Three-State Example Revisited) Now return to Example 11.13, but this time use probabilistic models. For a probabilistic I-state, let

denote the probability that the current state is $i \in X$ . Any probabilistic I-state can be expressed as $(p_0,p_1,p_2) \in {\mathbb{R}}^3$ . This implies that the I-space can be nicely embedded in ${\mathbb{R}}^3$ . By the axioms of probability (given in Section 9.1.2),

, which can be interpreted as a plane equation in ${\mathbb{R}}^3$ that restricts ${\cal I}_{prob}$ to a 2D set. Also following the axioms of probability, for each $i \in \{0,1,2\}$ , $0 \leq p_i \leq 1$ . This means that ${\cal I}_{prob}$ is restricted to a triangular region in ${\mathbb{R}}^3$ . The vertices of this triangular region are

, and

; these correspond to the three different ways to have perfect state information. In a sense, the distance away from these points corresponds to the amount of uncertainty in the state. The uniform probability distribution

is equidistant from the three vertices. A projection of the triangular region into ${\mathbb{R}}^2$ is shown in Figure 11.6. The interpretation in this case is that

and

specify a point in ${\mathbb{R}}^2$ , and

is automatically determined from

The triangular region in ${\mathbb{R}}^3$ is an uncountably infinite set, even though the history I-space is countably infinite for a fixed initial condition. This may seem strange, but there is no mistake because for a fixed initial condition, it is generally impossible to reach all of the points in ${\cal I}_{prob}$ . If the initial condition can be any point in ${\cal I}_{prob}$ , then all of the probabilistic I-space is covered because ${{\cal I}_0}= {\cal I}_{prob}$ , in which ${{\cal I}_0}$ is the initial condition space.. $\blacksquare$