11.3.3 The Probabilistic Case: POMDPs

Example 11.14 generalizes nicely to the case of states. In operations research and artificial intelligence literature, these are generally referred to as partially observable Markov decision processes or POMDPs (pronounced ``pom dee peez''). For the case of three states, the probabilistic I-space, ${\cal I}_{prob}$ , is a -simplex embedded in ${\mathbb{R}}^3$ . In general, if $\vert X\vert = n$ , then ${\cal I}_{prob}$ is an -simplex embedded in ${\mathbb{R}}^n$ . The coordinates of a point are expressed as $(p_0,p_1,\ldots,p_{n-1}) \in {\mathbb{R}}^n$ . By the axioms of probability, $p_0 + \cdots + p_{n-1} = 1$ , which implies that ${\cal I}_{prob}$ is an -dimensional subspace of ${\mathbb{R}}^n$ . The vertices of the simplex correspond to the cases in which the state is known; hence, their coordinates are $(0,0,\ldots,0,1)$ , $(0,0,\ldots,0,1,0)$ , $\ldots$ , $(1,0,\ldots,0)$ . For convenience, the simplex can be projected into ${\mathbb{R}}^{n-1}$ by specifying a point in ${\mathbb{R}}^{n-1}$ for which $p_1 + \cdots + p_{n-2} \leq 1$ and then choosing the final coordinate as $p_{n-1} = 1 - p_1 + \cdots + p_{n-2}$ . Section 12.1.3 presents algorithms for planning for POMDPs.

Steven M LaValle 2020-08-14