Probabilistic forward projections

The probabilistic forward projection can be considered as a Markov process because the ``decision'' part is removed once the actions are given. Suppose that is given and is applied. What is the probability distribution over $x_{k+1}$ ? This was already specified in (10.6) and is the one-stage forward projection. Now consider the two-stage probabilistic forward projection, $P(x_{k+2}\vert x_k,u_k,u_{k+1})$ . This can be computed by marginalization as

$\displaystyle P(x_{k+2}\vert x_k,u_k,u_{k+1}) = \sum_{x_{k+1} \in X} P(x_{k+2}\vert x_{k+1},u_{k+1}) P(x_{k+1}\vert x_k,u_k) .$

(10.13)

Computing further forward projections requires nested summations, which marginalize all of the intermediate states. For example, the three-stage forward projection is

$\begin{displaymath}\begin{split}P(x_{k+3}\vert x_k,u_k,& u_{k+1},u_{k+2}) = &... ...+2}\vert x_{k+1},u_{k+1}) P(x_{k+1}\vert x_k,u_k) . \end{split}\end{displaymath}$

(10.14)

A convenient expression of the probabilistic forward projections can be obtained by borrowing nice algebraic properties from linear algebra. For each action $u \in U$ , let its state transition matrix

be an $n \times n$ matrix, for $n = \vert X\vert$ , of probabilities. The matrix is defined as

$\displaystyle M_u = \begin{pmatrix}m_{1,1} & m_{1,2} & \cdots & m_{1,n} m_{2... ...s & \vdots & & \vdots m_{n,1} & m_{n,2} & \cdots & m_{n,n} \end{pmatrix},$

(10.15)

in which

$\displaystyle m_{i,j} = P(x_{k+1} = i \; \vert \; x_k = j, \;u) .$

(10.16)

For each

, the

th column of

must sum to one and can be interpreted as the probability distribution over

that is obtained if

is applied from state

Let denote an -dimensional column vector that represents any probability distribution over . The product yields a column vector that represents the probability distribution over that is obtained after starting with and applying . The matrix multiplication performs inner products, each of which is a marginalization as shown in (10.13). The forward projection at any stage, , can now be expressed using a product of state transition matrices. Suppose that ${\tilde{u}}_{k-1}$ is fixed. Let $v = [0 \; 0 \; \cdots 0 \; 1 \; 0 \; \cdots \; 0]$ , which indicates that is known (with probability one). The forward projection can be computed as