14.5.1 Problem Definition

Formulation 14.1 assumed that feedback is not necessary. If the initial state is given, then the solution takes the form of an action trajectory, which upon integration yields a time-parametrized path through $ {X_{free}}$. This extended the Piano Mover's Problem of Section 4.3.1 to include phase spaces and differential constraints. Now suppose that feedback is required. The reasons may be that the initial state is not given or the plan execution might not be predictable due to disturbances or errors in the system model. Recall the motivation from Section 8.1.

With little effort, the feedback motion planning framework from Chapter 8 can be extended to handle differential constraints. Compare Formulations 8.2 and 14.1. Feedback motion planning under differential constraints is obtained by making the following adjustments to Formulation 8.2:

  1. In Formulation 8.2, $ X = {\cal C}_{free}$, which automatically removed $ {\cal C}_{obs}$ from $ {\cal C}$ by definition. Now let $ X$ be any C-space or phase space, and let $ {X_{obs}}$ be defined as in Formulation 8.2. This leads to $ {X_{free}}$, as defined in Formulation 14.1.
  2. In Formulation 8.2, the state transition equation was $ {\dot x}= u$, which directly specified velocities in the tangent space $ T_x(X)$. Now let any system, $ {\dot x}=
f(x,u)$, be used instead. In this case, $ U(x)$ is no longer a subset of $ T_x(X)$. It still includes the special termination action $ u_T$.
  3. Formulation 14.1 includes $ {x_{I}}$, which is now removed for the feedback case to be consistent with Formulation 8.2.
  4. A feedback plan is now defined as a function $ \pi : {X_{free}}
\rightarrow U$. For a given state $ x \in {X_{free}}$, an action $ \pi (x)$ is produced. Composing $ \pi $ with $ f$ yields a velocity in $ T_x(X)$ given by $ {\dot x}= f(x,\pi (x))$. Therefore, $ \pi $ defines a vector field on $ {X_{free}}$.
Let $ t_F$ denote the time at which $ u_T$ is applied. Both feasible and optimal planning can be defined using a cost functional,

$\displaystyle L({\tilde{x}}_{t_F},{\tilde{u}}_{t_F}) = \int_0^{t_F} l(x(t),u(t)) dt + l_F(x(t_F)) ,$ (14.26)

which is identical to that given in Section 8.4.1. This now specifies the problem of feedback motion planning under differential constraints.

The most important difference with respect to Chapter 8 is that $ {\dot x}= u$ is replaced with $ {\dot x}=
f(x,u)$, which allows complicated differential models of Chapter 13 to be used. The vector field that results from $ \pi $ must satisfy the differential constraints imposed by $ {\dot x}=
f(x,u)$. In Section 8.4.4, simple constraints on the allowable vector fields were imposed, such as velocity bounds or smoothness; however, these constraints were not as severe as the models in Chapter 13. For example, the Dubins car does not allow motions in the reverse direction, whereas the constraints in Section 8.4.4 permit motions in any direction.

Steven M LaValle 2020-08-14