\bmit{\SO(3)}

Section 1.2 \(\bmit{\SO(3)}\)

Subsection 1.2.1 Representations

The rotation group in three Euclidean dimensions is known as \(\SO(3)\text{.}\) Let's try to apply the same reasoning in three dimensions that we did in two dimensions.

There is certainly a matrix representation of \(\SO(3)\text{.}\) If we ignore the \(z\)-direction entirely, we can surely embed the \(xy\)-rotations of \(\SO(2)\) in \(\SO(3)\text{.}\) Thus, we expect matrices of the form

\begin{equation} R_z(\alpha) = \begin{pmatrix} \cos\alpha \amp -\sin\alpha \amp 0 \\ \sin\alpha \amp \cos\alpha \amp 0 \\ 0 \amp 0 \amp 1 \end{pmatrix}\tag{1.2.1} \end{equation}

to be in \(\SO(3)\text{.}\) Similarly, we can rotate about the \(x\)- or \(y\)-axis, rather than the \(z\)-axis, yielding

\begin{equation} R_x(\alpha) = \begin{pmatrix} 1 \amp 0 \amp 0 \\ 0 \amp \cos\alpha \amp -\sin\alpha \\ 0 \amp \sin\alpha \amp \cos\alpha \end{pmatrix}, \quad R_y(\alpha) = \begin{pmatrix} \cos\alpha \amp 0 \amp \sin\alpha \\ 0 \amp 1 \amp 0 \\ -\sin\alpha \amp 0 \amp \cos\alpha \end{pmatrix} .\tag{1.2.2} \end{equation}

But what does the general element of \(\SO(3)\) look like?

It turns out that all rotations in three dimensions can be represented as a single rotation about an arbitrary axis. (This statement fails in higher dimensions. Why?) Thus, one description of \(\SO(3)\) involves choosing an axis, then rotating about that axis. The choice of axis is equivalent to the choice of a point on the 2-sphere \(\SS^2\text{,}\) which can be described by its colatitude \(\theta\) and its longitude \(\phi\text{.}\)

Rotate the sphere so that the north pole points in this direction. One possibility is to first rotate the sphere about the \(y\)-axis by an angle \(\theta\text{,}\) thus bringing the north pole to colatitude \(\theta\) (while keeping the longitude zero). Rotating the sphere about the \(z\)-axis by \(\phi\) then brings the north pole to the longitude \(\phi\text{.}\)

But how do we then rotate the sphere about this new axis? Easy; do that rotation first. In other words, before moving the north pole, rotate the sphere about the \(z\)-axis by the desired angle \(\psi\text{,}\) then move the north pole to the desired location.

Thus, the general element of \(\SO(3)\) can be expressed in terms of the Euler angles \((\theta,\phi,\psi)\) as

\begin{align} \amp R(\theta,\phi,\psi)\notag\\ \amp= R_z(\phi) R_y(\theta) R_z(\psi)\notag\\ \amp= \begin{pmatrix} \cos\psi\cos\theta\cos\phi-\sin\psi\sin\phi \amp -\sin\psi\cos\theta\cos\phi-\cos\psi\sin\phi \amp \sin\theta\cos\phi \\ \cos\psi\cos\theta\sin\phi+\sin\psi\cos\phi \amp -\sin\psi\cos\theta\sin\phi+\cos\psi\cos\phi \amp \sin\theta\sin\phi \\ -\cos\psi\sin\theta \amp \sin\psi\sin\theta \amp \cos\theta \end{pmatrix} .\tag{1.2.3} \end{align}

What a mess!

Subsection 1.2.2 Properties

It seems clear from the above discussion that \(\SO(3)\) is a 3-dimensional manifold. But which one?

First of all, all three Euler angles are periodic. So we can think of \(\SO(3)\) as a basepoint (the axis of rotation, determined by \((\theta,\phi)\)), and a rotation about that axis (determined by \(\psi\)). So we might suspect that \(\SO(3)\cong\SS^2\times\SS^1\text{.}\) As we will see later, that's not quite right; \(\SO(3)\) does indeed have the local structure of \(\SS^2\times\SS^1\text{,}\) but turns out to be a fibre bundle rather than a direct product. This fibre bundle is the well-known Hopf fibration of \(\SS^3\) over \(\SS^1\text{,}\) so we now suspect that \(\SO(3)\cong\SS^3\text{.}\) But that's still not quite right, since rotations about antipodal directions are equivalent. We must therefore identify antipodal points on \(\SS^3\text{,}\) and we finally conclude that \(\SO(3)\cong\RP^3\text{.}\)

Again, what a mess!

Subsection 1.2.3 Derivatives

Let's try again. First of all, since \(\SO(3)\) preserves the length of a vector \(v\in\RR^3\text{,}\) by the same argument used for \(\SO(2)\) we must have \(M^TM=1\) for every \(M\in\SO(3)\text{.}\) Again, the “S” tells us that \(|M|=1\text{.}\)

What if we look at derivatives of \(M(\theta,\phi,\psi)\text{,}\) rather than the group elements themselves? But which derivatives?

It is straightforward to compute

\begin{equation} \begin{aligned} r_z \amp= \dot{R}_z = R_z'(0) = \begin{pmatrix} 0 \amp -1 \amp 0 \\ 1 \amp 0 \amp 0 \\ 0 \amp 0 \amp 0 \end{pmatrix} ,\\ r_x \amp= \dot{R}_x = R_x'(0) = \begin{pmatrix} 0 \amp 0 \amp 0 \\ 0 \amp 0 \amp -1 \\ 0 \amp 1 \amp 0 \end{pmatrix} ,\\ \quad r_y \amp= \dot{R}_y = R_y'(0) = \begin{pmatrix} 0 \amp 0 \amp 1 \\ 0 \amp 0 \amp 0 \\ -1 \amp 0 \amp 0 \end{pmatrix} .\end{aligned}\tag{1.2.4} \end{equation}

These matrices live in the tangent space to \(\SO(3)\) at the identity, and are linearly independent. They must therefore span this 3-dimensional vector space.

Alternatively, we have

\begin{equation} \begin{aligned} R_z(\alpha) \amp= M(0,0,\alpha) ,\\ R_x(\alpha) \amp= M(\alpha,0,0) ,\\ R_y(\alpha) \amp= M\left(\alpha,-\frac\pi2,\frac\pi2\right),\end{aligned}\tag{1.2.5} \end{equation}

which suggests that the \(r_m\) can be associated with the derivative operators \(\partial_\theta\text{,}\) \(\partial_\phi\text{,}\) and \(\partial_\psi\)—evaluated at the identity element of \(\SO(3)\text{.}\)

Recall that for \(\SO(2)\) there was a correspondence between \(M'(\alpha)\) and the vector field \(\partial_\phi\text{.}\) Since \(R_z\) is just a 3-dimensional version of \(M\text{,}\) we still expect this correspondence to hold. Furthermore, since \(\{r_z,r_x,r_y\}\) are obtained from each other by cyclic permutations of the coordinates, the same relationship should hold for the corresponding vector fields. In ordinary spherical coordinates, we have

\begin{equation} \begin{aligned} \partial_\phi \amp= x\,\partial_y - y\,\partial_x ,\\ -\sin\phi \,\partial_\theta - \cot\theta\cos\phi \,\partial_\phi \amp= y\,\partial_z - z\,\partial_y ,\\ \cos\phi \,\partial_\theta - \cot\theta\sin\phi \,\partial_\phi \amp= z\,\partial_x - x\,\partial_z .\end{aligned}\tag{1.2.6} \end{equation}

Since these vector fields have been defined cyclically, they should correspond to \(\{r_x,r_y,r_z\}\text{,}\) respectively.

Wait a minute. These vector fields live on \(\SS^2\text{!}\) But we really want vector fields that are tangent to \(\SO(3)\text{,}\) which is locally \(\SS^3\text{...}\)

Although we won't go through the details, extending the vector fields above to \(\SS^3\) is straightforward. The \(\phi\)- and \(\psi\)-directions turn out not to be orthogonal; using Gram-Schmidt orthogonalization corrects this deficiency by adding appropriate \(\partial_\psi\) terms. The resulting correspondence is:

\begin{equation} \begin{aligned} R_z'(\alpha) \amp\longleftrightarrow \partial_\phi ,\\ R_x'(\alpha) \amp\longleftrightarrow -\sin\phi \,\partial_\theta - \cot\theta\cos\phi \,\partial_\phi + \csc\theta\cos\phi \,\partial_\psi ,\\ R_y'(\alpha) \amp\longleftrightarrow \cos\phi \,\partial_\theta - \cot\theta\sin\phi \,\partial_\phi + \csc\theta\sin\phi \,\partial_\psi .\end{aligned}\tag{1.2.7} \end{equation}

However, it is difficult to evaluate these expressions at the identity due to the coordinate singularities there.

What do these several expressions have in common?

Subsection 1.2.4 Commutators

The answer lies in the commutation relations between different elements. The matrix commutator is straightforward, and is defined by

\begin{equation} [A,B] = AB - BA\tag{1.2.8} \end{equation}

for two (\(n\times n\)) matrices \(A\) and \(B\text{.}\) Direct computation shows that

\begin{equation} [r_x,r_y] = r_z , \qquad [r_y,r_z] = r_x , \qquad [r_z,r_x] = r_y ,\tag{1.2.9} \end{equation}

which may look familiar to those who have studied quantum mechanics. ¹ There is also a vector fields commutator, defined for two vector fields \(X\) and \(Y\) by

\begin{equation} [X,Y](f) = X(Y(f)) - Y(X(f)) ,\tag{1.2.10} \end{equation}

and it can now be checked that the two sets of vector fields given above both share the commutation structure of the matrices \(r_m\) (up to an annoying but conventional sign). For instance,

\begin{equation} [y\,\partial_z - z\,\partial_y , z\,\partial_x - x\,\partial_z] f = - (x\,\partial_y - y\,\partial_x) f ,\tag{1.2.11} \end{equation}

since all other terms cancel by reversing the order of differentiation. Thus,

\begin{equation} [y\,\partial_z - z\,\partial_y , z\,\partial_x - x\,\partial_z] = -(x\,\partial_y - y\,\partial_x) .\tag{1.2.12} \end{equation}

The point is that these commutators are constant, that these commutators tell us something about the structure of the group, and that, best of all, we can determine the commutators using the matrices \(r_m\) without worrying about vector fields at all.

Which brings us to one final point: Even though the tangent spaces to \(\SS^2\) are, of course, only 2-dimensional, the three vector fields given above are nonetheless independent. How can this be? The space of vector fields is not a vector space, since the coefficients are functions, rather than constants. On a Lie group, however, the vectors at the identity naturally extend to vector fields everywhere, and so long as we take constant linear combinations of these vector fields, we still obtain a vector space. In this sense, the 3-dimensional vector field machinery on \(\SO(3)\) can be successfully rewritten in terms of vector fields on \(\SO(2)\text{,}\) as we have done above, even though the latter is only 2-dimensional.

Be warned that our derivatives \(r_m\) are antisymmetric, whereas physicists normally work with the Hermitian matrices \(-ir_m\text{.}\)