13.2.1.1 The scalar case

To make the discussion concrete, consider the following differential equation:

$\displaystyle {\ddot y}- 3 {\dot y}+ y = 0 ,$ (13.29)

in which $ y$ is a scalar variable, $ y \in {\mathbb{R}}$. This is a second-order differential equation because of $ {\ddot y}$. A phase space can be defined as follows. Let $ x = (x_1,x_2)$ denote a two-dimensional phase vector, which is defined by assigning $ x_1 = y$ and $ x_2 = {\dot y}$. The terms state space and state vector will be used interchangeably with phase space and phase vector, respectively, in contexts in which the phase space is defined. Substituting the equations into (13.29) yields

$\displaystyle {\ddot y}- 3 x_2 + x_1 = 0 .$ (13.30)

So far, this does not seem to have helped. However, $ {\ddot y}$ can be expressed as either $ {\dot x}_2$ or $ {\ddot x}_1$. The first choice is better because it is a lower order derivative. Using $ {\dot x}_2 =
{\ddot y}$, the differential equation becomes

$\displaystyle {\dot x}_2 - 3 x_2 + x_1 = 0 .$ (13.31)

Is this expression equivalent to (13.29)? By itself it is not. There is one more constraint, $ x_2 = {\dot x}_1$. In implicit form, $ {\dot x}_1 - x_2 = 0$. The key to making the phase space approach work correctly is to relate some of the phase variables by derivatives.

Using the phase space, we just converted the second-order differential equation (13.29) into two first-order differential equations,

\begin{displaymath}\begin{split}{\dot x}_1 &= x_2  {\dot x}_2 &= 3 x_2 - x_1 , \end{split}\end{displaymath} (13.32)

which are obtained by solving for $ {\dot x}_1$ and $ {\dot x}_2$. Note that (13.32) can be expressed as $ {\dot x}= f(x)$, in which $ f$ is a function that maps from $ {\mathbb{R}}^2$ into $ {\mathbb{R}}^2$.

The same approach can be used for any differential equation in implicit form, $ g({\ddot y},{\dot y},y) = 0$. Let $ x_1 = y$, $ x_2 = {\dot y}$, and $ {\dot x}_2 =
{\ddot y}$. This results in the implicit equations $ g({\dot x}_2,x_2,x_1) = 0$ and $ {\dot x}_1 = x_2$. Now suppose that there is a scalar action $ u \in U = {\mathbb{R}}$ represented in the differential equations. Once again, the same approach applies. In implicit form, $ g({\ddot y},{\dot y},y,u) = 0$ can be expressed as $ g({\dot x}_2,x_2,x_1,u) =
0$.

Suppose that a given acceleration constraint is expressed in parametric form as $ {\ddot y}= h({\dot y},y,u)$. This often occurs in the dynamics models of Section 13.3. This can be converted into a phase transition equation or state transition equation of the form $ {\dot x}=
f(x,u)$, in which $ f: {\mathbb{R}}^2 \times {\mathbb{R}}
\rightarrow {\mathbb{R}}^2$. The expression is

\begin{displaymath}\begin{split}{\dot x}_1 &= x_2 \\ {\dot x}_2 &= h(x_2,x_1,u) . \end{split}\end{displaymath} (13.33)

For a second-order differential equation, two initial conditions are usually given. The values of $ y(0)$ and $ {\dot y}(0)$ are needed to determine the exact position $ y(t)$ for any $ t \geq 0$. Using the phase space representation, no higher order initial conditions are needed because any point in phase space indicates both $ y$ and $ {\dot y}$. Thus, given an initial point in the phase and $ u(t)$ for all $ t \geq 0$, $ y(t)$ can be determined.

Example 13..3 (Double Integrator)   The double integrator is a simple yet important example that nicely illustrates the phase space. Suppose that a second-order differential equation is given as $ {\ddot q}
= u$, in which $ q$ and $ u$ are chosen from $ {\mathbb{R}}$. In words, this means that the action directly specifies acceleration. Integrating13.5 once yields the velocity $ {\dot q}$ and performing a double integration yields the position $ q$. If $ q(0)$ and $ {\dot q}(0)$ are given, and $ u(t')$ is specified for all $ t' \in [0,t)$, then $ {\dot q}(t)$ and $ q(t)$ can be determined for any $ t > 0$.

A two-dimensional phase space $ X = {\mathbb{R}}^2$ is defined in which

$\displaystyle x = (x_1,x_2) = (q,{\dot q}).$ (13.34)

The state (or phase) transition equation $ {\dot x}=
f(x,u)$ is

\begin{displaymath}\begin{split}{\dot x}_1 &= x_2  {\dot x}_2 &= u . \end{split}\end{displaymath} (13.35)

To determine the state trajectory, initial values $ x_1(0) = q_0$ (position) and $ x_2(0) = {\dot q}_0$ (velocity) must be given in addition to the action history. If $ u$ is constant, then the state trajectory is quadratic because it is obtained by two integrations of a constant function. $ \blacksquare$

Steven M LaValle 2020-08-14