Proof of The Chain Rule of Derivatives

The Chain Rule is one of the major tools used in Differential Calculus (or Calculus I) derivation applications. It is very essential for the derivative of compositions of at least two different types of functions.

However, as easy as it seems to just use a standard formula to derive the composition of functions, it is significant to learn the concepts and foundations behind the creation of this standard formula that will satisfy the principles of the chain rule. Hence, in this chapter, we will focus mainly on the proofs of the chain rule formula.

CALCULUS
Chain rule formulas

Relevant for

Learning how to prove the chain rule using limits.

See proofs

CALCULUS
Chain rule formulas

Relevant for

Learning how to prove the chain rule using limits.

See proof

What is the Chain Rule?

The chain rule is defined as the derivative of the composition of at least two different types of functions. This rule can be used to derive a composition of functions such as but not limited to:

$$y’ = \frac{d}{dx}[f \left( g(x) \right)]$$

where g(x) is a domain of function f. In this composition, functions f and g must be two different types of function, which cannot be algebraically evaluated into a single type of function.

But how exactly do we derive that given function using the chain rule?

The Chain Rule states that the derivative of a composition of at least two different types of functions is equal to the derivative of the outside function f, and then multiplied by the derivative of its inner function g. The function g will be the domain of the derivative of the outside function f. 

To better illustrate, we have

$$\frac{d}{dx} (f(g(x))) = \frac{d}{dx} \left(f(g(x)) \right) \cdot \frac{d}{x}(g(x))$$

where you derive the outside function f by using the function f‘s derivative method while composing the original form of function g as its domain and then multiply the whole quantity by the derivative of the inner function g or g(x).

We can also illustrate the chain rule formula as:

$$\frac{dy}{dx} = \frac{dy}{du} \cdot \frac{du}{dx}$$

where

  • $latex f(u) =$ the outer function 
  • $latex u = g(x)$, the domain of outer function $latex f(u)$
  • $latex \frac{dy}{du}=$ the derivative of the outer function $latex f(u)$ in terms of $latex u$
  • $latex \frac{du}{dx} =$ the derivative of the inner function $latex g(x)$ in terms of $latex x$

Most of the time, this form of formula is used for beginners. Although, it has more steps, but it is proven to be more convenient and less confusing. This form uses a substitution method in the chain rule formula to derive the outside function/s.

Somewhat easy, right? But we should not take this formula superficially if we aim to be able to derive any composition of functions. In order to learn and understand the concepts behind the development of this chain rule formula, we need to be familiarized with any proof which would satisfy the statement of the chain rule.


Proof of The Chain Rule

To understand this proof, you are highly recommended to be familiarized with the topics, The Slope of a Tangent Line and Derivatives Using Limits.

We can recall that a derivative can be expressed in terms of limits in the following way:

$$\frac{d}{dx} f(x) = \lim \limits_{h \to 0} {\frac{f(x+h)-f(x)}{h}}$$

Now, let’s suppose we have the following function:

$latex H(x) = f(g(x))$

Then, if we get the derivative, we have,

$$H'(x) = \frac{d}{dx} \left( f(g(x)) \right)$

We can use the derivation in terms of limits:

$$H'(x) = \lim \limits_{h \to 0} {\frac{H(x+h)-H(x)}{h}}$$

By substituting the equation $latex H(x) = f(g(x))$, we have

$$H'(x) = \lim \limits_{h \to 0} {\frac{f(g(x+h)) – f(g(x))}{h}}$$

What manipulations can we do in this limit to arrive at the chain rule formula?

In our limit, we can multiply the function by

$$\frac{g(x+h)-g(x)}{g(x+h)-g(x)}$$

which is basically just equal to one. Thus, it won’t change our limit at all. By doing so, we have

$$\frac{d}{dx} \left( f(g(x)) \right) = \lim \limits_{h \to 0} \left( {\frac{f(g(x+h)) – f(g(x))}{h}} \cdot \frac{g(x+h)-g(x)}{g(x+h)-g(x)} \right)$$

Since we have a product of two fractions in our limit equation, we can apply the commutative property of multiplication in their denominators. By doing so, we have

$$\frac{d}{dx} \left( f(g(x)) \right) = \lim \limits_{h \to 0} \left( {\frac{f(g(x+h)) – f(g(x))}{g(x+h)-g(x)}} \cdot \frac{g(x+h)-g(x)}{h} \right)$$

By applying the properties of limits, we have

$$\frac{d}{dx} \left( f(g(x)) \right) = \lim \limits_{h \to 0} \left(\frac{f(g(x+h)) – f(g(x))}{g(x+h)-g(x)} \right) \cdot \lim \limits_{h \to 0} \left(\frac{g(x+h)-g(x)}{h} \right)$$

The second part of the right-hand side is simply the derivative of g(x) written in limis. Then, we have

$$\frac{d}{dx} \left( f(g(x)) \right) = \lim \limits_{h \to 0} \left(\frac{f(g(x+h)) – f(g(x))}{g(x+h)-g(x)} \right) \cdot \frac{d}{dx}(g(x))$$

We can use the laws of limits to rearrange our limit like this:

$$ \frac{d}{dx} \left( f(g(x)) \right) = \lim \limits_{h \to 0} \left(\frac{1}{g(x+h)-g(x)} \right) \cdot \lim \limits_{h \to 0} \Big( f(g(x+h)) – f(g(x)) \Big) \cdot \frac{d}{dx}(g(x))$$

Let’s mark this equation as $latex EQ.01$

But before evaluating $latex EQ.01$, let

$latex \gamma = g(x+h)-g(x)$

Let’s mark this equation as $latex EQ.02$

Equating $latex EQ.02$ in terms of $latex g(x+h)$, we have

$latex g(x+h) = g(x) + \gamma$

Let’s mark this equation as $latex EQ.03$

If we evaluate

$$\lim \limits_{h \to 0} \left( \frac{1}{g(x+h)-g(x)} \right)$$

Let’s mark this expression as $latex ex.01$

from $latex EQ.01$, we have

$$\lim \limits_{h \to 0} \left( \frac{1}{g(x+h)-g(x)} \right) = \lim \limits_{h \to 0} \left( \frac{1}{\gamma} \right) $$

$$\lim \limits_{h \to 0} \left( \lim \limits_{h \to 0} \left( \frac{1}{\gamma} \right)\right) = \lim \limits_{h \to 0} \left( \lim \limits_{h \to 0} \left( \frac{1}{0} \right)\right) $$

And if evaluate the limit of $latex ex.01$ in terms of $latex \gamma \to 0$, we have

$$\lim \limits_{\gamma \to 0} \left( \lim \limits_{h \to 0} \left( \frac{1}{\gamma} \right) \right)= \lim \limits_{\gamma \to 0} \left( \lim \limits_{h \to 0} \left( \frac{1}{0} \right) \right)$$

Because of this, we can conclude that

$latex h \to 0 = \gamma \to 0$

Therefore, we can now evaluate $latex EQ.01$ in terms of $latex \gamma \to 0$ instead of $latex h \to 0$ to further solve our remaining limits. Thus, we have

$$\frac{d}{dx} \left( f(g(x)) \right) = \lim \limits_{\gamma \to 0} \left( \frac{1}{g(x+h)-g(x)} \right) \cdot \lim \limits_{\gamma \to 0} \Big( f(g(x+h))– f(g(x)) \Big) \cdot \frac{d}{dx}(g(x))$$

Let’s mark this equation as $latex EQ.04$

By substituting $latex EQ.02$ and $latex EQ.03$ into $latex EQ.04$, we have

$$\frac{d}{dx} \left( f(g(x)) \right) = \lim \limits_{\gamma \to 0} \left( \frac{1}{\gamma} \right) \cdot \lim \limits_{\gamma \to 0} \Big( f(g(x) + \gamma) – f(g(x)) \Big) \cdot \frac{d}{dx}(g(x))$$

Multiplying the equation above, we have

$$\frac{d}{dx} \left( f(g(x)) \right) = \lim \limits_{\gamma \to 0} \left( {\frac{f(g(x) + \gamma) – f(g(x))}{\gamma}} \right) \cdot \frac{d}{dx}(g(x))$$

By solving the remaining limit, we have

$$\frac{d}{dx} \left( f(g(x)) \right) = \frac{d}{dx} \left( f(g(x)) \right) \cdot \frac{d}{dx}(g(x))$$

or it can be simply illustrated as

$$H'(x) = \left( f(g(x)) \right)’ \cdot g'(x)$$

which is now The Chain Rule Formula.

Finally, we now have proven the chain rule formula by applying the concepts of limits.


See also

Interested in learning more about the chain rule? Take a look at these pages:

Profile picture for author Jefferson Huera Guzman

Jefferson Huera Guzman

Jefferson is the lead author and administrator of Neurochispas.com. The interactive Mathematics and Physics content that I have created has helped many students.

Learn mathematics with our additional resources in different topics

LEARN MORE