Section A.1 Differential Geometry in Brief
We present here a very brief summary of the basic building blocks of differential geometry.
A differentiable manifold \(M\) is a toplogical space with smooth local coordinates. A coordinate chart \(x=(x^\alpha)\) on an open subset \(U\subset M\) is a map
that is a topological homeomorphism (takes open sets to open sets); \(n\) is the dimension of \(M\text{,}\) and \(\alpha=1,...,n\text{.}\) We will rarely need to worry about the need to use overlapping coordinate charts to cover the entire manifold, but all such overlaps will be assumed to be smooth (\(C^\infty\)) as maps from \(\RR^n\) to itself. A function \(f\) on \(M\) is a map
which will normally be assumed to be smooth in the sense that \(f\circ x^{-1}\) is smooth as a map from \(\RR^n\) to \(\RR\text{.}\) This setup is illustrated in Figure A.1.1.
Also shown in Figure A.1.1 is a vector \(X\) at a point \(P\text{,}\) sometimes written as \(X_P\text{.}\) Vectors are derivative operators; they act on functions by taking a directional derivative at the given point. A vector field, also written as \(X\text{,}\) is then a vector \(X_P\) at every point \(P\in M\text{.}\) More formally,
for some functions \(X^\alpha\text{,}\) the components of \(X\) in the coordinates \(\{x^\alpha\}\text{,}\) and where there is an implicit sum over \(\alpha\text{.}\) This derivative is often written simply as \(X^\alpha\frac{\partial f}{\partial x^\alpha}\text{.}\) A vector field is smooth if its components are smooth functions.
Example A.1.2. The Sphere.
Consider the sphere \(\SS^2\subset\RR^3\) defined by the equation
where \(a\) is a constant, and which can be described parametrically as
Away from the poles, we can use the coordinates \(x^1=\theta\text{,}\) \(x^2=\phi\) (with suitable domains) to map the sphere into \(\RR^2\text{.}\) Any vector field \(X\) on \(\SS^2\) can then be written (away from the poles) as
This intrinsic description of the sphere makes no explicit use of the rectangular coordinates used in its construction. We can instead give an extrinsic description of the sphere that has the flavor of vector calculus.
Writing \(x\text{,}\) \(y\text{,}\) \(z\) as functions of \(\theta\) and \(\phi\text{,}\) we can express the sphere embedded in \(\RR^3\) in vector parametric form as
A basis for the tangent vectors to the sphere at any point (away from the poles) can then be obtained by differentiating \(\rr\text{;}\) the general tangent vector has the form
If we now define
then, for instance,
that is, this action of vectors on functions is precisely the directional derivative. Noting further that \(\xhat=\frac{\partial\rr}{\partial x}\text{,}\) we have related the intuitive notion of vector (“\(x\)-direction”) to a derivative of the position vector \(\rr\) (“\(x\)-derivative of position”) to a derivative operator (“\(x\)-derivative operator”). These three descriptions are equivalent representations of tangent vectors on manifolds, in this case \(\RR^3\text{.}\)
Returning to the sphere, if we now compute, say,
then
A map \(F\) between two manifolds \(M\) and \(N\) not only maps points from one manifold to the other, but also vectors. This situation is shown in Figure A.1.3, where \(y\) represents a coordinate chart on \(N\text{.}\) It is not necessary for \(M\) and \(N\) to have the same dimension.
Given a function \(g\) on \(N\text{,}\) we automatically get a function \(g\circ F\) on \(M\text{.}\) Given a vector \(X\) at \(P\in M\text{,}\) we can therefore define a vector \(F_*X\) at \(F(P)\in N\) to be the result of acting on \(g\circ F\) with \(X\text{.}\) Explicitly,
where the last term is the Jacobian matrix of \(F\text{,}\) often denoted simply \(\frac{\partial y^\beta}{\partial x^\alpha}\text{.}\) Comparing (A.1.15) with (A.1.3) leads to
which relates the components of \(F_*X\) to those of \(X\text{.}\) Thus, the components of \(F_*X\) are given by multiplying the components of \(X\) by the Jacobian matrix.
Example A.1.4. Rotating the sphere.
Consider the case where \(F:\SS^2\longrightarrow\SS^2\) is a rotation of the sphere about the \(z\)-axis by an angle \(\psi\text{.}\) This transformation is given by
and it is straightforward to show that the Jacobian matrix of \(F\) is the \(2\times2\) identity matrix. Thus, \(F_*\) takes a vector of the form (A.1.8) at a given point \(P\in\SS^2\) to a vector with the same components at \(F(P)\text{.}\)
However, if \(F\) were a rotation about some other axis, it is not obvious what it does to the coordinates \((\theta,\phi)\text{,}\) and therefore not obvious what the Jacobian matrix is.
Switching from the intrinsic description to the extrinsic description, any rotation can be expressed as a \(3\times3\) matrix \(M\) acting on the position vector, thought of as the column vector
that is, any rotation can be described as
Since \(M\) is a constant, it pulls through the computation of the Jacobian matrix! Thus, if we expand a tangent vector \(\vv\) in the form (A.1.10) as a vector in \(\RR^3\), so that
again thought of as a column vector, namely
then it is straightforward to show that
which can be converted to either of the forms (A.1.10) or (A.1.8).
This example shows one of the advantages of working in the extrinsic description. This approach underlies much of the computation elsewhere in this book, with \(\RR^3\) replaced by \(\RR^{n\times n}\text{,}\) the space of \(n\times n\) matrices, and \(\SS^2\) replaced by a matrix Lie group.