PARTIAL DERIVATIVE, TOTAL DIFFERENTIAL, TOTAL DERIVATIVE, CHAIN RULE, DIRECTIONAL DERIVATIVE
Let us consider a function
1) u = f(x, y, z, p, q, ... )
of several variables. Such a function can be studied by holding all variables except one constant and observing its variation with respect to one single selected variable. If we consider all the variables except x to be constant, then
represents the partial derivative of f(x, y, z, p, q, ... ) with respect to x (the hats indicating variables held fixed). The variables held fixed are viewed as parameters.
Def. Partial derivative. The partial derivative of a function of two or more variables with respect to one of its variables is the ordinary derivative of the function with respect to that variable, considering the other variables as constants.
Example. The partial derivative of 3x2y + 2y2 with respect to x is 6xy. Its partial derivative with respect to y is 3x2 + 4y.
The partial derivative of a function z = f(x, y, ...) with respect to the variable x is commonly written in any of the following ways:
Its derivative with respect to any other variable is written in a similar fashion.

Geometric interpretation. The geometric interpretation of a partial derivative is the same as that for an ordinary derivative. It represents the slope of the tangent to that curve represented by the function at a particular point P. In the case of a function of two variables
z = f(x, y)
Fig. 1 shows the interpretation of
and
.
corresponds to the slope of
the tangent to the curve APB at point P (where curve APB is the intersection of the surface with a
plane through P perpendicular to the y axis). Similarly,
corresponds to the slope of the
tangent to the curve CPD at point P (where curve CPD is the intersection of the surface with a
plane through P perpendicular to the x axis).
Total differential. In the case of a function of a single variable the differential of the function y = f(x) is the quantity

dy = f '(x) Δx .
This quantity is used to compute the approximate change in the value of f(x) due to a change Δx in x. As is shown in Fig. 2,
Δy = CB = f(x + Δx) - f(x)
while
dy = CT = f '(x)Δx .

When Δx is small the approximation is close. Line AT represents the tangent to the curve at point A.
In the case of a function of two variables the situation is analogous. Let us start at point A(x1, y1, z1) on the surface
z = f(x, y)
shown in Fig. 3 and let x and y change by small amounts Δx and Δy, respectively. The change produced in the value of the function z is
Δz = CB = f(x1 + Δx, y1 + Δy) - f(x1, y1) .
An approximation to Δz is given by
When Δx and Δy are small the approximation is close. Point T lies in that plane tangent to the surface at point A.
The quantity
is called the total differential of the function z = f(x, y). Because is customary to denote increments Δx and Δy by dx and dy, the total differential of a function z = f(x, y) is defined as
The total differential of three or more variables is defined similarly. For a function z = f(x, y, .. , u) the total differential is defined as
Each of the terms represents a partial differential. For example, the term
is the partial differential of z with respect to x. The total differential is the sum of the partial differentials.
Total derivative. Chain rule for functions of functions. If w = f(x, y, z, ...)
is a continuous function of n variables x, y, z, ..., with continuous partial derivatives
, ... and if x, y, z, ... are differentiable functions x = x(t), y = y(t) , z =
z(t), etc. of a variable t, then the total derivative of w with respect to t is given by
2)
This rule is called the chain rule for the partial derivatives of functions of functions.
Similarly, if w = f(x, y, z, ...) is a continuous function of n variables x, y, z, ..., with continuous
partial derivatives
, ... and if x, y, z, ... are differentiable functions of
m independent variables r, s, t ... , then
Note the similarity between total differentials and total derivatives. The total derivative 2) above can be obtained by dividing the total differential
by dt.
As a special application of the chain rule let us consider the relation defined by the two equations
z = f(x, y); y = g(x)
Here, z is a function of x and y while y in turn is a function of x. Thus z is really a function of the single variable x. If we apply the chain rule we get
which is the total derivative of z with respect to x.
Def. Scalar point function. A scalar point function is a function that assigns a real number (i.e. a scalar) to each point of some region of space. If to each point (x, y, z) of a region R in space there is assigned a real number u = Φ(x, y, z), then Φ is called a scalar point function.
Examples. 1. The temperature distribution within some body at a particular point in time. 2. The density distribution within some fluid at a particular point in time.
Directional derivatives. Let Φ(x, y, z) be a scalar point function defined over some region R of space. The function Φ(x, y, z) could, for example, represent the temperature distribution within some body. At some specified point P(x, y, z) of R we wish to know the rate of change of Φ in a particular direction. The rate of change of a function Φ at a particular point P, in a specified direction, is called the directional derivative of Φ at P in that direction. We specify the direction by supplying the direction angles or direction cosines of a unit vector e pointing in the desired direction.
Theorem. The rate of change of a function Φ(x, y, z) in the direction of a vector with direction angles (α, β, γ) is given by
3)
where s corresponds to distance in the metric of the coordinate system. That direction for which the function Φ at point P has its maximum value is called the gradient of Φ at P.

We shall prove the theorem shortly. First let us consider the same problem for two dimensional space.
Let Φ(x, y) be a scalar point function defined over some region R of the plane. At some specified point P(x, y) of R we wish to know the rate of change of Φ in a particular direction. We specify the direction by supplying the angle α that a unit vector e pointing in the desired direction makes with the positive x direction. See Fig. 4. The rate of change of function Φ at point P in the direction of e corresponding to angle α is given by
where s corresponds to distance in the metric of the coordinate system. We show this as follows:
Let
T = f(x, y)

where T is the temperature at any point of the plate shown in Fig. 5. We wish to derive expression 4) above. In other words, we wish to derive the expression for the rate of change of T with respect to the distance moved in any selected direction. Suppose we move from point P to point P'. This represents a displacement Δx in the x-direction and Δy in the y-direction. The distance moved along the plate is
The direction is given by the angle α that PP' makes with the positive x-direction. The change in the value of T corresponding to the displacement from P to P' is
where ε is a quantity that approaches 0 when Δx and Δy approach 0.
If we divide ΔT by the distance moved along the plate, we have

From Fig. 5 we observe that Δx/Δs = cos α and Δy/Δs = sin α . Making these substitutions and letting P' approach P along line PP', we have
This is the directional derivative of T in the direction α.
A geometric interpretation of a directional derivative in the case of a function z = f(x, y) is that of a tangent to the surface at point P as shown in Fig. 6.
Def. Directional derivative. The directional derivative of a scalar point function Φ(x, y, z) is the rate of change of the function Φ(x, y, z) at a particular point P(x, y, z) as measured in a specified direction.
Tech. Let Φ(x, y, z) be a scalar point function possessing first partial derivatives throughout some region R of space. Let P(x0, y0, z0) be some point in R at which we wish to compute the directional derivative and let P'(x1, y1, z1) be a neighboring point. Let the distance from P to P' be Δs. Then the directional derivative of Φ in the direction PP' is given by
5)
where P' approaches P along the line PP' and Δs approaches 0.
Using this definition, let us now derive 3) above. In moving from P to P' the function Φ will change by an amount
where ε1, ε2, ε3 are higher order infinitesimals which approach zero as P' approaches P i.e. as Δx, Δy and Δz approach zero. If we divide the change ΔΦ by the distance Δs we obtain a measure of the rate at which Φ changes as we move from P to P':
6)
We now observe that Δx/Δs, Δy/Δs, Δz/Δs are the direction cosines of the line segment PP'. They are also the direction cosines of a unit vector e located at P pointing in the direction of P'. If the direction angles of e are α, β, γ, then Δx/Δs, Δy/Δs, Δz/Δs are equal to cos α, cos β, and cos γ, respectively. Thus 6) becomes
and
7)
Let us note that 7) can be written in vector form as the following dot product:
8)
The vector
is called the gradient of Φ. Thus the directional derivative of Φ is equal to the dot product of the gradient of Φ and the vector e. In other words,
where
is the directional derivative of Φ in the direction of unit vector e.
If the vector e is pointed in the same direction as the gradient of Φ then the directional derivative of Φ is equal to the gradient of Φ.
Order of differentiation. For most functions that one meets
However, in some cases it is not true. Under what circumstances is it true? It is true if both functions fyx and fxy are continuous at the point where the partials are being taken.
Theorem. Let the function f(x, y) be defined in some neighborhood of the point (a, b). Let the partial derivatives fx, fy, fxy, and fyx also be defined in this neighborhood. Then if fxy and fyx are both continuous at (a, b), fxy(a, b) = fyx(a, b).