Skip to main content

Section 10.1 The Geometry of Gradient

How do you compute a derivative of a quantity that depends on a single variable? By taking the ratio of small changes in the quantity to small changes in the variable. But what if the quantity depends on several variables, such as the temperature in the room? Use the same strategy — but the result will depend on which direction you go.

Suppose therefore that you are given a quantity \(f\) that depends on position in space. We could express that position in terms of rectangular coordinates \((x,y,z)\text{,}\) but let's save that for later. To find the derivative of \(f\) at a given point and in a given direction, we must first specify the point and the direction. We know how to do that!

A point in space can be described in terms of the position vector \(\rr\) from the origin to the given point. And the direction is determined by considering any curve through the given point; a small change in position along the curve is, of course, described by \(d\rr\text{.}\)

So consider the small change \(df\) in \(f\) along the curve. As \(ds=|d\rr|\) shrinks to \(0\text{,}\) so does \(df\text{;}\) the derivative of \(f\) along the curve is the (limiting value of the) ratio of the small quantities \(df\) and \(ds\text{.}\)

Now consider computing this derivative in all possible directions. There will be one direction in which the derivative is as large as possible. We define the gradient of \(f\text{,}\) written \(\grad f\text{,}\) to be the vector whose direction is the direction in which \(f\) increases the fastest, and whose magnitude is the derivative of \(f\) in that direction. This construction yields the gradient of \(f\) at a given point, and we can repeat the process at any point; the gradient of \(f\) is a vector field.

How much does \(f\) change in an arbitrary direction? Suppose we have a curve through the given point that goes in this new direction. Since derivatives are linear, the rate of change along this new curve is just the projection of \(\grad f\) along the curve, namely

\begin{equation} \frac{df}{ds} = \grad f \cdot \frac{d\rr}{|d\rr|}\tag{10.1.1} \end{equation}

where \(d\rr\) and \(ds\) now refer to the new curve, and are therefore different than before. Remembering that \(|d\rr|=ds\text{,}\) we can rewrite this expression as

\begin{equation} df = \grad f \cdot d\rr\tag{10.1.2} \end{equation}

which we refer to as the Master Formula, and which can also be taken as the geometric definition of the gradient.

Recall that \(df\) represents the infinitesimal change in \(f\) when moving to a “nearby” point. What information do you need in order to know how \(f\) changes? You must know something about how \(f\) behaves, where you started, and which way you went. The Master Formula organizes this information into two geometrically different pieces, namely the gradient, containing generic information about how \(f\) changes, and the vector differential \(d\rr\text{,}\) containing information about the particular change in position being made.

We refer to (10.1.2) as the Master Formula, because it contains all of the information needed to determine the gradient, and does so without relying on a particular coordinate system.