11.5.4 Gaining Information Without Sensors

For some problems, it is remarkable that uncertainty may be reduced without even using sensors. Recall Example 11.17. This is counterintuitive because it seems that information regarding the state can only be gained from sensing. It is possible, however, to also gain information from the knowledge that some actions have been executed and the effect that should have in terms of the state transitions. The example presented in this section is inspired by work on sensorless manipulation planning [321,396], which is covered in more detail in Section 12.5.2. This topic underscores the advantages of reasoning in terms of an I-space, as opposed to requiring that accurate state estimates can be made.

Figure 11.28: A top view of a tray that must be tilted to roll the ball into the desired corner.
\begin{figure}\centerline{\psfig{figure=figs/ptilt1.eps,width=2.5truein}}\end{figure}

Figure 11.29: A plan is shown that places the ball in the desired location using a sequence of six tilts, regardless of its initial position and in spite of the fact that there are no sensors. The thickened black lines and black dots indicate the possible locations for the ball: the nondeterministic I-states. Under each picture, the direction that the ball rolls due to the action is written.
\begin{figure}\begin{center}
\begin{tabular}{ccc}
\psfig{figure=figs/ptilt3.eps,...
...\\
4: up & 5: up-left & 6: down-left \\
\end{tabular}\end{center}
\end{figure}

Example 11..24 (Tray Tilting)   The state space, $ X \subset {\mathbb{R}}^2$, indicates the position of a ball that rolls on a flat surface, as shown Figure 11.28. The ball is confined to roll within the polygonal region shown in the figure. It can be imagined that the ball rolls in a tray on which several barriers have been glued to confine its motion (try this experiment at home!). If the tray is tilted, it is assumed that the ball rolls in a direction induced by gravity (in the same way that a ball rolls to the bottom of a pinball machine).

The tilt of the tray is considered as an action that can be chosen by the robot. It is assumed that the initial position of the ball (initial state) is unknown and there are no sensors that can be used to estimate the state. The task is to find some tilting motions that are guaranteed to place the ball in the position shown in Figure 11.28, regardless of its initial position.

The problem could be modeled with continuous time, but this complicates the design. If the tray is tilted in a particular orientation, it is assumed that the ball rolls in a direction, possibly following the boundary, until it comes to rest. This can be considered as a discrete-stage transition: The ball is in some rest state, a tilt action is applied, and a then it enters another rest state. Thus, a discrete-stage state transition equation, $ x_{k+1} = f(x_k,u_k)$, is used.

To describe the tilting actions, we can formally pick directions for the upward normal vector to the tray from the upper half of $ {\mathbb{S}}^2$; however, this can be reduced to a one-dimensional set because the steepness of the tilt is not important, as long as the ball rolls to its new equilibrium state. Therefore, the set of actions can be considered as $ U = {\mathbb{S}}^1$, in which a direction $ u \in {\mathbb{S}}^1$ indicates the direction that the ball rolls due to gravity. Before any action is applied, it is assumed that the tray is initially level (its normal is parallel to the direction of gravity). In practice, one should be more careful and model the motion of the tray between a pair of actions; this is neglected here because the example is only for illustrative purposes. This extra level of detail could be achieved by introducing new state variables that indicate the orientation of the tray or by using continuous-time actions. In the latter case, the action is essentially providing the needed state information, which means that the action function would have to be continuous. Here it is simply assumed that a sequence of actions from $ {\mathbb{S}}^1$ is applied.

The initial condition is $ X_1 = X$ and the history I-state is

$\displaystyle {\eta}_k = (X_1,u_1,u_2,\ldots,u_{k-1}) .$ (11.74)

Since there are no observations, the path through the I-space is predictable. Therefore, a plan, $ \pi $, is simply an action sequence, $ \pi = (u_1,u_2,\ldots,u_K)$, for any desired $ K$.

It is surprisingly simple to solve this task by reasoning in terms of nondeterministic I-states, each of which corresponds to a set of possible locations for the ball. A sequence of six actions, as shown in Figure 11.29, is sufficient to guarantee that the ball will come to rest at the goal position, regardless of its initial position. $ \blacksquare$

Steven M LaValle 2020-08-14