First consider adding probabilities to the discrete grid problem of Section 12.2.1. A state is once again expressed as . The initial condition is a probability distribution, , over . One reasonable choice is to make a uniform probability distribution, which makes each direction and position equally likely. The robot is once again given four actions, but now assume that nature interferes with state transitions. For example, if , then perhaps with high probability the robot moves forward, but with low probability it may move right, left, or possibly not move at all, even if it is not blocked.
The sensor mapping from Section 12.2.1 indicated whether the robot moved. In the current setting, nature can interfere with this measurement. With low probability, it may incorrectly indicate that the robot moved, when in fact it remained stationary. Conversely, it may also indicate that the robot remained still, when in fact it moved. Since the sensor depends on the previous two states, the mapping is expressed as
To solve the passive localization problem, the expressions from Section 11.2.3 for computing the derived I-states are applied. If the sensor mapping used only the current state, then (11.36), (11.38), and (11.39) would apply without modification. However, since depends on both and , some modifications are needed. Recall that the observations start with for this sensor. Therefore, , instead of applying (11.36).
After each stage, is computed from by first applying (11.38) to take into account the action . Equation (11.39) takes into account the sensor observation, , but is not given because the sensor mapping also depends on . It reduces using marginalization as
(12.21) |
Solving the active localization problem is substantially harder because a search occurs on . The same choices exist as for the discrete localization problem. Computing an information-feedback plan over the whole I-space is theoretically possible but impractical for most environments. The search-based idea that was applied to incrementally grow a directed graph in Section 12.2.1 could also be applied here. The success of the method depends on clever search heuristics developed for this particular problem.
Steven M LaValle 2020-08-14