Website owner: James Miller
PARTIAL DERIVATIVE. TOTAL DIFFERENTIAL. TOTAL DERIVATIVE. CHAIN RULE. DIRECTIONAL DERIVATIVE. DIFFERENTIATION UNDER THE INTEGRAL SIGN. LEIBNITZ’S RULE.
Let us consider a function
1) u = f(x, y, z, p, q, ... )
of several variables. Such a function can be studied by holding all variables except one constant and observing its variation with respect to one single selected variable. If we consider all the variables except x to be constant, then
represents the partial derivative of f(x, y, z, p, q, ... ) with respect to x (the hats indicating variables held fixed). The variables held fixed are viewed as parameters.
Def. Partial derivative. The partial derivative of a function of two or more variables with respect to one of its variables is the ordinary derivative of the function with respect to that variable, considering the other variables as constants.
Example. The partial derivative of 3x2y + 2y2 with respect to x is 6xy. Its partial derivative with respect to y is 3x2 + 4y.
The partial derivative of a function z = f(x, y, ...) with respect to the variable x is commonly written in any of the following ways:
Its derivative with respect to any other variable is written in a similar fashion.
Geometric interpretation. The geometric interpretation of a partial derivative is the same as that for an ordinary derivative. It represents the slope of the tangent to that curve represented by the function at a particular point P. In the case of a function of two variables
z = f(x, y)
Fig. 1 shows the interpretation of and . corresponds to the slope of the tangent to the curve APB at point P (where curve APB is the intersection of the surface with a plane through P perpendicular to the y axis). Similarly, corresponds to the slope of the tangent to the curve CPD at point P (where curve CPD is the intersection of the surface with a plane through P perpendicular to the x axis).
Total differential. In the case of a function of a single variable the differential of the function y = f(x) is the quantity
dy = f '(x) Δx .
This quantity is used to compute the approximate change in the value of f(x) due to a change Δx in x. As is shown in Fig. 2,
Δy = CB = f(x + Δx) - f(x)
dy = CT = f '(x)Δx .
When Δx is small the approximation is close. Line AT represents the tangent to the curve at point A.
In the case of a function of two variables the situation is analogous. Let us start at point A(x1, y1, z1) on the surface
z = f(x, y)
shown in Fig. 3 and let x and y change by small amounts Δx and Δy, respectively. The change produced in the value of the function z is
Δz = CB = f(x1 + Δx, y1 + Δy) - f(x1, y1) .
An approximation to Δz is given by
When Δx and Δy are small the approximation is close. Point T lies in that plane tangent to the surface at point A.
is called the total differential of the function z = f(x, y). Because is customary to denote increments Δx and Δy by dx and dy, the total differential of a function z = f(x, y) is defined as
The total differential of three or more variables is defined similarly. For a function z = f(x, y, .. , u) the total differential is defined as
Each of the terms represents a partial differential. For example, the term
is the partial differential of z with respect to x. The total differential is the sum of the partial differentials.
Total derivative. Chain rule for functions of functions. If w = f(x, y, z, ...) is a continuous function of n variables x, y, z, ..., with continuous partial derivatives , ... and if x, y, z, ... are differentiable functions x = x(t), y = y(t) , z = z(t), etc. of a variable t, then the total derivative of w with respect to t is given by
This rule is called the chain rule for the partial derivatives of functions of functions.
Similarly, if w = f(x, y, z, ...) is a continuous function of n variables x, y, z, ..., with continuous partial derivatives , ... and if x, y, z, ... are differentiable functions of m independent variables r, s, t ... , then
Note the similarity between total differentials and total derivatives. The total derivative 2) above can be obtained by dividing the total differential
As a special application of the chain rule let us consider the relation defined by the two equations
z = f(x, y); y = g(x)
Here, z is a function of x and y while y in turn is a function of x. Thus z is really a function of the single variable x. If we apply the chain rule we get
which is the total derivative of z with respect to x. Using 2a) we can derive yet another result. Suppose we are given the equation f(x, y) = 0 and wish to solve for y in terms of x. Using 2a) we obtain
Def. Scalar point function. A scalar point function is a function that assigns a real number (i.e. a scalar) to each point of some region of space. If to each point (x, y, z) of a region R in space there is assigned a real number u = Φ(x, y, z), then Φ is called a scalar point function.
Examples. 1. The temperature distribution within some body at a particular point in time. 2. The density distribution within some fluid at a particular point in time.
Directional derivatives. Let Φ(x, y, z) be a scalar point function defined over some region R of space. The function Φ(x, y, z) could, for example, represent the temperature distribution within some body. At some specified point P(x, y, z) of R we wish to know the rate of change of Φ in a particular direction. The rate of change of a function Φ at a particular point P, in a specified direction, is called the directional derivative of Φ at P in that direction. We specify the direction by supplying the direction angles or direction cosines of a unit vector e pointing in the desired direction.
Theorem. The rate of change of a function Φ(x, y, z) in the direction of a vector with direction angles (α, β, γ) is given by
where s corresponds to distance in the metric of the coordinate system. That direction for which the function Φ at point P has its maximum value is called the gradient of Φ at P.
We shall prove the theorem shortly. First let us consider the same problem for two dimensional space.
Let Φ(x, y) be a scalar point function defined over some region R of the plane. At some specified point P(x, y) of R we wish to know the rate of change of Φ in a particular direction. We specify the direction by supplying the angle α that a unit vector e pointing in the desired direction makes with the positive x direction. See Fig. 4. The rate of change of function Φ at point P in the direction of e corresponding to angle α is given by
where s corresponds to distance in the metric of the coordinate system. We show this as follows:
T = f(x, y)
where T is the temperature at any point of the plate shown in Fig. 5. We wish to derive expression 4) above. In other words, we wish to derive the expression for the rate of change of T with respect to the distance moved in any selected direction. Suppose we move from point P to point P'. This represents a displacement Δx in the x-direction and Δy in the y-direction. The distance moved along the plate is
The direction is given by the angle α that PP' makes with the positive x-direction. The change in the value of T corresponding to the displacement from P to P' is
where ε is a quantity that approaches 0 when Δx and Δy approach 0.
If we divide ΔT by the distance moved along the plate, we have
From Fig. 5 we observe that Δx/Δs = cos α and Δy/Δs = sin α . Making these substitutions and letting P' approach P along line PP', we have
This is the directional derivative of T in the direction α.
A geometric interpretation of a directional derivative in the case of a function z = f(x, y) is that of a tangent to the surface at point P as shown in Fig. 6.
Def. Directional derivative. The directional derivative of a scalar point function Φ(x, y, z) is the rate of change of the function Φ(x, y, z) at a particular point P(x, y, z) as measured in a specified direction.
Tech. Let Φ(x, y, z) be a scalar point function possessing first partial derivatives throughout some region R of space. Let P(x0, y0, z0) be some point in R at which we wish to compute the directional derivative and let P'(x1, y1, z1) be a neighboring point. Let the distance from P to P' be Δs. Then the directional derivative of Φ in the direction PP' is given by
where P' approaches P along the line PP' and Δs approaches 0.
Using this definition, let us now derive 3) above. In moving from P to P' the function Φ will change by an amount
where ε1, ε2, ε3 are higher order infinitesimals which approach zero as P' approaches P i.e. as Δx, Δy and Δz approach zero. If we divide the change ΔΦ by the distance Δs we obtain a measure of the rate at which Φ changes as we move from P to P':
We now observe that Δx/Δs, Δy/Δs, Δz/Δs are the direction cosines of the line segment PP'. They are also the direction cosines of a unit vector e located at P pointing in the direction of P'. If the direction angles of e are α, β, γ, then Δx/Δs, Δy/Δs, Δz/Δs are equal to cos α, cos β, and cos γ, respectively. Thus 6) becomes
Let us note that 7) can be written in vector form as the following dot product:
is called the gradient of Φ. Thus the directional derivative of Φ is equal to the dot product of the gradient of Φ and the vector e. In other words,
is the directional derivative of Φ in the direction of unit vector e.
If the vector e is pointed in the same direction as the gradient of Φ then the directional derivative of Φ is equal to the gradient of Φ.
Differentiation under the integral sign. Leibnitz’s rule. We now consider differentiation with respect to a parameter that occurs under an integral sign, or in the limits of integration, or in both places.
Theorem 1. Let
where a x b and f is assumed to be integrable on [a, b]. Then the function F(x) is continuous and = f(x) at each point where f(x) is continuous.
Theorem 2. Let f(x, α) and ∂f/∂α be continuous in some region R: (a x b, c α d) of the x-α plane. Let
Theorem 3. Leibnitz’s rule. Let
where u1 and u2 are functions of the parameter α i.e.
u1 = u1(α)
u2 = u2(α).
Let f(x, α) and ∂f/∂α be continuous in both x and α in a region R of the x-α plane that includes the region u1 x u2, c α d. Let u1 and u2 be continuous and have continuous derivatives for c α d. Then
where f(u1, α) is the expression obtained by substituting the expression u1(α) for x in f(x, α). Similarly for f(u2, α). The quantities f(u1, α) and f(u2, α) correspond to ∂G/u1 and ∂G/u2 respectively and 12) represents the chain rule.
Order of differentiation. For most functions that one meets
However, in some cases it is not true. Under what circumstances is it true? It is true if both functions fyx and fxy are continuous at the point where the partials are being taken.
Theorem. Let the function f(x, y) be defined in some neighborhood of the point (a, b). Let the partial derivatives fx, fy, fxy, and fyx also be defined in this neighborhood. Then if fxy and fyx are both continuous at (a, b), fxy(a, b) = fyx(a, b).
James & James. Mathematics Dictionary
Middlemiss. Differential and Integral Calculus
Spiegel. Advanced Calculus
Taylor. Advanced Calculus
Spiegel. Vector Analysis