Tangent spaces on manifolds

Now consider defining tangent spaces on manifolds. Intuitively, the tangent space $ T_p(M)$ at a point $ p$ on an $ n$-dimensional manifold $ M$ is an $ n$-dimensional hyperplane in $ {\mathbb{R}}^m$ that best approximates $ M$ around $ p$, when the hyperplane origin is translated to $ p$. This is depicted in Figure 8.8. The notion of a tangent was actually used in Section 7.4.1 to describe local motions for motion planning of closed kinematic chains (see Figure 7.22).

To define a tangent space on a manifold, we first consider a more complicated definition of the tangent space at a point in $ {\mathbb{R}}^n$, in comparison to what was given in Section 8.3.1. Suppose that $ M = {\mathbb{R}}^2$, and consider taking directional derivatives of a smooth function $ f : {\mathbb{R}}^2
\rightarrow {\mathbb{R}}$ at a point $ p \in {\mathbb{R}}^2$. For some (unnormalized) direction vector, $ v \in {\mathbb{R}}^2$, the directional derivative of $ f$ at $ p$ can be defined as

$\displaystyle \nabla_v(f)\Big\vert_p = v_1 \frac{\partial f}{\partial x_1}\Big\vert_p + v_2 \frac{\partial f}{\partial x_2}\Big\vert_p .$ (8.31)

The directional derivative used here does not normalize the direction vector (contrary to basic calculus). Hence, $ \nabla_v(f) = \nabla(f)
\cdot v$, in which ``$ \cdot$'' denotes the inner product or dot product, and $ \nabla(f)$ denotes the gradient of $ f$. The set of all possible direction vectors that can be used in this construction forms a two-dimensional vector space that happens to be the tangent space $ T_p({\mathbb{R}}^2)$, as defined previously. This can be generalized to $ n$ dimensions to obtain

$\displaystyle \nabla_v(f)\Big\vert_p = \sum_{i=1}^n v_i \frac{\partial f}{\partial x_i}\Big\vert_p ,$ (8.32)

for which all possible direction vectors represent the tangent space $ T_p({\mathbb{R}}^n)$. The set of all directions can be interpreted for our purposes as the set of possible velocity vectors.

Now consider taking (unnormalized) directional derivatives of a smooth function, $ f: M \rightarrow {\mathbb{R}}$, on a manifold. For an $ n$-dimensional manifold, the tangent space $ T_p(M)$ at a point $ p \in M$ can be considered once again as the set of all unnormalized directions. These directions must intuitively be tangent to the manifold, as depicted in Figure 8.8. There exists a clever way to define them without even referring to specific coordinate neighborhoods. This leads to a definition of $ T_p(M)$ that is intrinsic to the manifold.

At this point, you may accept that $ T_p(M)$ is an $ n$-dimensional vector space that is affixed to $ M$ at $ p$ and oriented as shown in Figure 8.8. For the sake of completeness, however, a technical definition of $ T_p(M)$ from differential geometry will be given; more details appear in [133,872]. The construction is based on characterizing the set of all possible directional derivative operators. Let $ C^\infty(p)$ denote the set of all smooth functions that have domains that include $ p$. Now make the following identification. Any two functions $ f,g \in C^\infty(p)$ are defined to be equivalent if there exists an open set $ U \subset M$ such that for any $ p
\in U$, $ f(p) = g(p)$. There is no need to distinguish equivalent functions because their derivatives must be the same at $ p$. Let $ {{\tilde C}^\infty}(p)$ denote $ C^\infty$ under this identification. A directional derivative operator at $ p$ can be considered as a function that maps from $ {{\tilde C}^\infty}(p)$ to $ {\mathbb{R}}$ for some direction. In the case of $ {\mathbb{R}}^n$, the operator appears as $ \nabla_v$ for each direction $ v$. Think about the set of all directional derivative operators that can be made. Each one must assign a real value to every function in $ {{\tilde C}^\infty}(p)$, and it must obey two axioms from calculus regarding directional derivatives. Let $ \nabla_v$ denote a directional derivative operator at some $ p \in M$ (be careful, however, because here $ v$ is not explicitly represented since there are no coordinates). The directional derivative operator must satisfy two axioms:

  1. Linearity: For any $ \alpha,\beta \in {\mathbb{R}}$ and $ f,g\in
{{\tilde C}^\infty}(p)$,

    $\displaystyle \nabla_v(\alpha f + \beta g) = \alpha \nabla_v f + \beta \nabla_v g.$ (8.33)

  2. Leibniz Rule (or Derivation): For any $ f,g\in
{{\tilde C}^\infty}(p)$,

    $\displaystyle \nabla_v(fg) = \nabla_v f \;g(p) + f(p) \nabla_v g.$ (8.34)

You may recall these axioms from standard vector calculus as properties of the directional derivative. It can be shown that the set of all possible operators that satisfy these axioms forms an $ n$-dimensional vector space [133]. This vector space is called the tangent space, $ T_p(M)$, at $ p$. This completes the definition of the tangent space without referring to coordinates.

It is helpful, however, to have an explicit way to express vectors in $ T_p(M)$. A basis for the tangent space can be obtained by using coordinate neighborhoods. An important theorem from differential geometry states that if $ F : M \rightarrow N$ is a diffeomorphism onto an open set $ U \subset N$, then the tangent space, $ T_p(M)$, is isomorphic to $ T_{F(p)}(N)$. This means that by using a parameterization (the inverse of a coordinate neighborhood), there is a bijection between velocity vectors in $ T_p(M)$ and velocity vectors in $ T_{F(p)}(N)$. Small perturbations in the parameters cause motions in the tangent directions on the manifold $ N$. Imagine, for example, making a small perturbation to three quaternion parameters that are used to represent $ SO(3)$. If the perturbation is small enough, motions that are tangent to $ SO(3)$ occur. In other words, the perturbed matrices will lie very close to $ SO(3)$ (they will not lie in $ SO(3)$ because $ SO(3)$ is defined by nonlinear constraints on $ {\mathbb{R}}^9$, as discussed in Section 4.1.2).

Example 8..16 (The Tangent Space for $ {\mathbb{S}}^2$)   The discussion can be made more concrete by developing the tangent space for $ {\mathbb{S}}^2$, which is embedded in $ {\mathbb{R}}^3$ as the set of all points $ (x,y,z) \in
{\mathbb{R}}^3$ for which $ x^2+y^2+z^2 = 1$. A coordinate neighborhood can be defined that covers most of $ {\mathbb{S}}^2$ by using standard spherical coordinates. Let $ f$ denote the coordinate function, which maps from $ (x,y,z)$ to angles $ (\theta,\phi)$. The domain of $ f$ is the open set defined by $ \theta \in (0,2\pi)$ and $ \phi \in (0,\pi)$ (this excludes the poles). The standard formulas are $ \theta = \atan2(y,x)$ and $ \phi = \cos^{-1}z$. The inverse, $ f^{-1}$, yields a parameterization, which is $ x = \cos \theta
\sin\phi$, $ y = \sin\theta \sin\phi$, and $ z = \cos\phi$.

Now consider different ways to express the tangent space at some point $ p \in {\mathbb{S}}^2$, other than the poles (a change of coordinates is needed to cover these). Using the coordinates $ (\theta,\phi)$, velocities can be defined as vectors in $ {\mathbb{R}}^2$. We can imagine moving in the plane defined by $ \theta $ and $ \phi $, provided that the limits $ \theta \in (0,2\pi)$ and $ \phi \in (0,\pi)$ are respected.

We can also use the parameterization to derive basis vectors for the tangent space as vectors in $ {\mathbb{R}}^3$. Since the tangent space has only two dimensions, we must obtain a plane that is ``tangent'' to the sphere at $ p$. These can be found by taking derivatives. Let $ f^{-1}$ be denoted as $ x(\theta,\phi)$, $ y(\theta,\phi)$, and $ z(\theta,\phi)$. Two basis vectors for the tangent plane at $ p$ are

$\displaystyle \left[\frac{dx(\theta,\phi)}{d\theta} \;\;\; \frac{dy(\theta,\phi)}{d\theta} \;\;\; \frac{dz(\theta,\phi)}{d\theta} \right]$ (8.35)

and

$\displaystyle \left[\frac{dx(\theta,\phi)}{d\phi} \;\;\; \frac{dy(\theta,\phi)}{d\phi} \;\;\; \frac{dz(\theta,\phi)}{d\phi} \right] .$ (8.36)

Computing these derivatives and normalizing yields the vectors $ [-\sin\theta \;\; \cos\theta \;\; 0]$ and $ [\cos\theta \cos\phi \;\;
\sin\theta\cos\phi \;\; -\sin\phi]$. These can be imagined as the result of making small perturbations of $ \theta $ and $ \phi $ at $ p$. The vector space obtained by taking all linear combinations of these vectors is the tangent space at $ {\mathbb{R}}^2$. Note that the direction of the basis vectors depends on $ p \in {\mathbb{S}}^2$, as expected.

The tangent vectors can now be imagined as lying in a plane that is tangent to the surface, as shown in Figure 8.8. The normal vector to a surface specified as $ g(x,y,z) = 0$ is $ \nabla g$, which yields $ [x \;\; y \;\; z]$ after normalizing. This could alternatively be obtained by taking the cross product of the two vectors above and using the parameterization $ f^{-1}$ to express it in terms of $ x$, $ y$, and $ z$. For a point $ p = (x_0,y_0,z_0)$, the plane equation is

$\displaystyle x_0 (x - x_0) + y_0 (y - y_0) + z_0 (z - z_0) = 0 .$ (8.37)

$ \blacksquare$

Steven M LaValle 2020-08-14