This MedLibrary.org supplementary page on Chain rule is provided directly from the open source Wikipedia as a service to our readers. Please see the note below on authorship of this content, as well as the Wikipedia usage guidelines. To search for other content from our encyclopedia supplement, please use the form below:
Related Sponsors
In calculus, the chain rule is a formula for the derivative of the composite of two functions.
In intuitive terms, if a variable, y, depends on a second variable, u, which in turn depends on a third variable, x, then the rate of change of y with respect to x can be computed as the rate of change of y with respect to u multiplied by the rate of change of u with respect to x.
Contents |
Informal discussion
- For an explanation of notation used in this section, see Function composition.
The chain rule states that, under appropriate conditions,
which in short form is written as
Alternatively, in the Leibniz notation, the chain rule is
In integration, the counterpart to the chain rule is the substitution rule.
Theorem
The chain rule in one variable may be stated more completely as follows.[1] Let f be a real-valued function on (a,b) which is differentiable at c ∈ (a,b); and g a real-valued function defined on an interval I containing the range of f and f(c) as an interior point. If g is differentiable at f(c), then
is differentiable at x = c, and
Examples
Example I
Suppose that a mountain climber ascends at a rate of 0.5 kilometers per hour. The temperature is lower at higher elevations; suppose the rate by which it decreases is 6 °C per kilometer. If one multiplies 6 °C per kilometer by 0.5 kilometer per hour, one obtains 3 °C per hour. This calculation is a typical chain rule application.
Example II
Consider the function f(x) = (x2 + 1)3. Since f(x) = h(g(x)) where g(x) = x2 + 1 and h(x) = x3 it follows from the chain rule that
In order to differentiate the trigonometric function
one can write f(x) = h(g(x)) with h(x) = sin x and g(x) = x2. The chain rule then yields
since h′(g(x)) = cos(x2) and g′(x) = 2x.
Example III
Differentiate arctan(sin x).
Thus, by the chain rule,
and in particular,
Chain rule for several variables
The chain rule works for functions of more than one variable. Consider the function z = f(x, y) where x = g(t) and y = h(t), and g(t) and h(t) are differentiable with respect to t, then
Suppose that each argument of z = f(u, v) is a two-variable function such that u = h(x, y) and v = g(x, y), and that these functions are all differentiable. Then the chain rule would look like:
If we considered
above as a vector function, we can use vector notation to write the above equivalently as the dot product of the gradient of f and a derivative of
:
More generally, for functions of vectors to vectors, the chain rule says that the Jacobian matrix of a composite function is the product of the Jacobian matrices of the two functions:
Proof of the chain rule
Let f and g be functions and let x be a number such that f is differentiable at g(x) and g is differentiable at x. Then by the definition of differentiability,
where ε(δ) → 0 as δ → 0. Similarly,
where η(α) → 0 as α → 0.
Now
where
Observe that as δ → 0, αδ/δ → g′(x) and αδ → 0, and thus η(αδ) → 0. It follows that
The fundamental chain rule
The chain rule is a fundamental property of all definitions of derivative and is therefore valid in much more general contexts. For instance, if E, F and G are Banach spaces (which includes Euclidean space) and f : E → F and g : F → G are functions, and if x is an element of E such that f is differentiable at x and g is differentiable at f(x), then the derivative (the Fréchet derivative) of the composition g o f at the point x is given by
Note that the derivatives here are linear maps and not numbers. If the linear maps are represented as matrices (namely Jacobians), the composition on the right hand side turns into a matrix multiplication.
A particularly clear formulation of the chain rule can be achieved in the most general setting: let M, N and P be Ck manifolds (or even Banach-manifolds) and let
- f : M → N and g : N → P
be differentiable maps. The derivative of f, denoted by df, is then a map from the tangent bundle of M to the tangent bundle of N, and we may write
In this way, the formation of derivatives and tangent bundles is seen as a functor on the category of C∞ manifolds with C∞ maps as morphisms.
Tensors and the chain rule
See tensor field for an advanced explanation of the fundamental role the chain rule plays in the geometric nature of tensors.
Higher derivatives
Faà di Bruno's formula generalizes the chain rule to higher derivatives. The first few derivatives are
See also
- Inverse chain rule
- Triple product rule
- Derivative
- Leibniz integral rule
- Leibniz rule (generalized product rule)
References
- ^ Apostol, Tom (1974). Mathematical analysis, 2nd ed., Addison Wesley, Theorem 5.5.
External links
Wikipedia content modification information:
- This page was last modified on 20 August 2008, at 14:16.
Wikipedia Authorship and Review
Wikipedia content provided here is not reviewed directly by MedLibrary.org. Wikipedia content is authored by an open community of volunteers and is not produced by or in any way affiliated with MedLibrary.org.
Wikipedia Usage Guidelines
This article is licensed under the GNU Free Documentation License. It uses material from the Wikipedia article on "Chain rule".
The URL for this specific entry is:
All Wikipedia text is available under the terms of the GNU Free Documentation License. (See Copyrights for details). Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc.
































