derivation of some differentiation rules f '(x) = lim f (x + h) − f (x) h

Derivation of Some Differentiation Rules These notes are intended to provide methods of deriving some of the formulas used in differentiation which are different from those described in the textbook. We will be making use of the limit definition ( “h-definition” ) of the derivative of a function,

€

f ' ( x ) = limh→ 0

f ( x + h ) − f ( x )h

. Since we can (and will often need to) construct new functions by combining simpler functions through arithmetic operations or by composition of one function on another, we will need to know how to differentiate these newly-created functions. Product Rule We can create a function F ( x ) = f ( x ) · g ( x ) through multiplication of two simpler functions. In calculating its derivative F ’( x ) , it will be convenient to define a symbol for the change in a function by Δf = f ( x + h ) – f ( x ) , in order to save a bit of writing in places. So we will have f ( x + h ) = f ( x ) + Δf and we will need to apply binomial multiplication:

€

F ' ( x ) = [ f ( x ) g( x ) ]' = limh→ 0

f ( x + h ) g( x + h ) − f ( x ) g( x )h

€

= limh→ 0

[ f ( x ) + Δ f ] [ g( x ) + Δg ] − f ( x ) g( x )h

€

= limh→ 0

[ f ( x ) g( x ) + f ( x )Δg + g( x )Δ f + Δ f Δg ] − f ( x ) g( x )h

€

= limh→ 0

f ( x )Δg + g( x )Δ f + Δ f Δgh .

At this point, we can now express this result as the sum of three separate limits and write out explicitly the changes in the functions f and g :

€

[ f ( x ) g( x ) ]' = limh→ 0

f ( x )Δgh

+ limh→ 0

g( x )Δ fh

+ limh→ 0

Δ f Δgh

€

= limh→ 0

f ( x )[ g( x + h ) − g( x ) ]h

+ limh→ 0

g( x )[ f ( x + h ) − f ( x ) ]h

€

+ limh→ 0

[ f ( x + h ) − f ( x ) ][ g( x + h ) − g( x ) ]h

(continued)

€

= f ( x ) ⋅ limh→ 0

g( x + h ) − g( x )h

+ g( x ) ⋅ limh→ 0

f ( x + h ) − f ( x )h

€

+ limh→ 0

f ( x + h ) − f ( x )h

⋅ [ g( x + h ) − g( x ) ] ,

where we have extracted the factor which does not depend on h in the first two of these limit terms, and have simply separated one factor in the third limit. By applying the limit definition of a derivative function, we at last have

€

[ f ( x ) g( x ) ]' = f ( x ) ⋅g ' ( x ) + g( x ) ⋅ f ' ( x )

€

+ limh→ 0

f ( x + h ) − f ( x )h

⎡

⎣ ⎢ ⎤

⎦ ⎥ ⋅ [ lim

h→ 0g( x + h ) − g( x ) ]

“the limit of a product is the product of the limits”

€

= f ( x ) ⋅ g ' ( x ) + g( x ) ⋅ f ' ( x ) + f ' ( x ) ⋅ [ g( x + 0) − g( x ) ]*

€

= f ( x ) ⋅g ' ( x ) + g( x ) ⋅ f ' ( x ) + f ' ( x ) ⋅ 0 = f ( x ) ⋅ g ' ( x ) + g( x ) ⋅ f ' ( x ) . * We would obtain a similar result if we separated out the factor involving f instead. More simply, the Product Rule is often expressed as ( f g ) ’ = f ‘ · g + f · g ’ . Notice that this Rule can be repeatedly applied to work out the derivative for a product of more than two functions; for three functions, for instance,

€

( f ⋅ g ⋅ h ) ' = ( f ⋅ g ) ' ⋅ h + ( f ⋅ g ) ⋅ h '

€

= ( f ' ⋅ g + f ⋅ g ' ) ⋅ h + ( f ⋅ g ) ⋅ h ' = f ' ⋅ g h + f ⋅ g ' ⋅ h + f g ⋅ h ' . In other words, the derivative of any product of functions can be expressed by a set of terms in which each function is differentiated in turn and multiplied by all the other functions in the set. The Product Rule applies at those values of x for which every one of the functions in the product is continuous. Quotient Rule We take a similar approach here with a new function defined by the ratio of two

functions,

€

G ( x ) =f ( x )g( x ) . Naturally, we expect the algebra here to be a little more

complicated.

€

G ' ( x ) =f ( x )g( x )⎡ ⎣ ⎢

⎤ ⎦ ⎥

'= lim

h→ 0

f ( x + h )g( x + h ) −

f ( x )g( x )

h= lim

h→ 0

g( x ) ⋅ f ( x + h ) − g( x + h ) ⋅ f ( x )g( x + h ) ⋅ g( x )

⎡ ⎣ ⎢

⎤ ⎦ ⎥

h subtracting fractions in the numerator

€

= limh→ 0

g( x ) ⋅ [ f ( x ) + Δ f ] − f ( x ) ⋅ [ g( x ) + Δg ]h ⋅ g( x + h ) ⋅ g( x ) (continued)

€

=1

g( x )⋅ limh→ 0

f ( x ) g( x ) + g( x )Δ f − f ( x ) g( x ) − f ( x )Δgh ⋅ g( x + h )

using our expression for f ( x + h ) and g ( x + h ) and extracting a factor which does not involve h

€

=1

g( x )⋅ limh→ 0

g( x )Δ f − f ( x )Δgh ⋅ g( x + h )

.

We will now write out the changes in the functions f and g again, so that we can apply the limit definition of derivative:

€

f ( x )g( x )⎡ ⎣ ⎢

⎤ ⎦ ⎥

'=

1g( x )

⋅ limh→ 0

1g( x + h )

⋅g( x ) ⋅ [ f ( x + h ) − f ( x ) ] − f ( x )[ g( x + h ) − g( x ) ]

h⎧ ⎨ ⎩

⎫ ⎬ ⎭

⎡

⎣ ⎢

⎤

⎦ ⎥

€

=1

g( x )⋅ limh→ 0

1g( x + h )

⎡

⎣ ⎢

⎤

⎦ ⎥ ⋅ lim

h→ 0g( x ) ⋅ f ( x + h ) − f ( x )

h− lim

h→ 0f ( x ) ⋅ g( x + h ) − g( x )

h⎡

⎣ ⎢

⎤

⎦ ⎥

€

=1

g( x )⋅ limh→ 0

1g( x + h )

⎡

⎣ ⎢

⎤

⎦ ⎥ ⋅ g( x ) ⋅ lim

h→ 0

f ( x + h ) − f ( x )h

− f ( x ) ⋅ limh→ 0

g( x + h ) − g( x )h

⎡

⎣ ⎢

⎤

⎦ ⎥

extracting factors which do not involve h

€

=1

g( x )⋅

1g( x + 0)⎡

⎣ ⎢

⎤

⎦ ⎥ ⋅ g( x ) ⋅ f ' ( x ) − f ( x ) ⋅ g ' ( x )[ ] =

g( x ) ⋅ f ' ( x ) − f ( x ) ⋅ g ' ( x )[ g( x ) ]2

.

The Quotient Rule applies at those values of x for which both f ( x ) and g ( x ) are

continuous and where g ( x ) ≠ 0 (that is, where f ( x )/g( x ) is defined and thus

continuous) .

Chain Rule It is a bit more of a challenge to differentiate a composite function, which is formed by taking the result of one function and subjecting it to the operation of a second function. So we need to be somewhat careful about what the changes in the two functions mean. Applying the limit definition of derivative to the composite function H ( x ) = f ( g ( x ) ) , we have

€

H ' ( x ) = [ f (g( x ) ) ] ' = limh→ 0

f ( g( x + h ) ) − f (g( x ) )h

€

= limh→ 0

f ( g( x ) + Δg ) − f (g( x ) )h .

We write the last expression in this way as a reminder that the change in the composite function f ( g ( x ) ) is connected to the change in the function g ( x ) . When we then use our way of showing the shift in the value of the first term of the numerator to write

(continued)

€

[ f (g( x ) ) ] ' = limh→ 0

[ f ( g( x ) ) + Δ f ] − f (g( x ) )h

,

it is then perhaps easier to keep in mind that this change in the function f , Δf , is dependent upon the change in the function g , Δg (whereas in our derivations of the Product and Quotient Rules above, these changes were not connected). We can now say

€

[ f (g( x ) ) ] ' = limh→ 0

f ( g( x ) ) + Δ f − f (g( x ) )h

= limh→ 0

Δ fh

€

= limh→ 0

Δ fΔg

⋅Δgh

= limh→ 0

Δ fΔg

⋅ limh→ 0

Δgh

€

= limh→ 0

Δ fΔg

⋅ limh→ 0

g( x + h ) − g( x )h

= limh→ 0

Δ fΔg

⋅ g ' ( x ) .

What remains to be understood is this first limit term. Since it is certainly the case that Δg approaches zero as h approaches zero, we can think of this limit as

€

limh→ 0

Δ fΔg

= limΔg→ 0

Δ fΔg

= limΔg→ 0

f ( g( x ) + Δg ) − f (g( x ) )Δg

,

reverting the numerator to a form it had earlier. But this resembles the limit definition

for f ‘ ( x ) ,

€

limh→ 0

f ( x + h ) − f ( x )h , with Δg standing in for h and g ( x ) in place

of x . This limit in question then gives the derivative function f ‘ ( u ) evaluated at the value u = g ( x ) . This permits us to write the Chain Rule for differentiation of a composite function,

€

[ f (g( x ) ) ] ' = limh→ 0

Δ fΔg

⋅ g' ( x ) = f ' (u ) u = g(x) ⋅ g' ( x ) ,

or, as it is often more simply written,

€

[ f (g( x ) ) ] ' = f ' (g( x ) ) ⋅ g' ( x ) . The Chain Rule applies at those values of x for which both g ( x ) and f ( g ( x ) ) are continuous.

Derivatives of f ( x ) = sin x and g ( x ) = cos x These are the first of the elementary functions we encounter where something more than simple algebra is required in order to work out their derivative functions. We will need to construct a couple of new “limit laws” for the purpose.

The first of these is to find the value for

€

limx→ 0

sin xx . One method of

calculating this is provided in the textbook (Stewart, 6th ed., pp. 190-191). A couple of others are shown here to offer alternative approaches.

For any of these methods, we must consider a wedge of a circle of radius 1 , with center at point O and the angle ∠AOB having measure (size) θ . The area of this

wedge is Aw = ½ · r2 · θ = ½ · 12 · θ = ½ θ .

We can extend a line downward from point A which is perpendicular to the line OB and meets it at point C to form the right triangle ΔOCA . From trigonometry, we know that, since the hypotenuse OA is a radius of the circle and so has a length of 1 , then OC has length cos θ and AC has length sin θ . The segments OC and AC are

the base and altitude of the right triangle ΔOCA , so its area is AOCA

= ½ · cos θ · sin θ . We can then also extend a line upward from point B which is perpendicular to the line OB , and we will also extend the segment OA . These lines meet at a point D , allowing us to make another right triangle ΔOBD . Since OB is a radius of the circle, it has a length of 1 . Again, from trigonometry, the altitude of this triangle BD has a height h , thus h / 1 = tan θ ⇒ h = tan θ . As the segments OB and BD are the

base and altitude of this right triangle, its area is AOBD

= ½ · 1 · tan θ = ½ tan θ . The wedge of the circle is enclosed between these two right triangles, so we can write the inequality for the areas of these geometrical figures as

€

AOCA < Aw < AOBD ⇒ 12 cosθ sinθ < 1

2θ < 12 tanθ .

If we now divide the inequality through by ½ sin θ and take the limit of the terms as the angle approaches zero, we have

(continued)

€

12 cosθ sinθ12 sinθ

<12θ

12 sinθ

<12 tanθ12 sinθ

⇒ cosθ <θsinθ

<1

cosθ

€

⇒ limθ → 0+

cosθ < limθ → 0+

θsinθ

< limθ → 0+

1cosθ

.

Upon evaluating the limits at each end of the inequality, we find

€

1 < limθ → 0+

θsinθ

<11 ,

and therefore, by the “Squeeze Theorem”,

€

limθ → 0+

θsinθ

= 1 . By another of the limit

laws, we can now write

€

limθ → 0+

sinθθ

= 1lim

θ → 0+

θsinθ

=11

= 1 ,

giving us our new trigonometric limit law.

Another method involves lengths of lines and arcs, rather than the areas of wedges and triangles. We start once again with the wedge of the unit circle, OAB . Since the angle ∠AOB has measure θ , the length of the arc AB is s

w = θ · r = θ · 1 = θ .

We again “drop” a perpendicular line from point A to the line OB to form the right triangle ΔOCA . This time, we are interested in the length of this line, which is

the altitude of the triangle we earlier found to be LAC

= sin θ . We will now make a new circular wedge using the segment OC as the radius. The angle ∠DOC must also have measure θ . We know that OC has length cos θ , so

the length of the arc CD is sCD

= θ · rCD

= θ · cos θ . The way in which the altitude of the right triangle falls between the arcs of the two wedges gives us the inequality

€

sCD < LAC < sw ⇒ θ cosθ < sinθ < θ . (continued)

We divide this inequality through by the angle θ and take the limits of the terms as this angle approaches zero:

€

θ cosθθ

<sinθθ

<θθ

⇒ limθ → 0+

cosθ < limθ → 0+

sinθθ

< limθ → 0+

1 ⇒ 1 < limθ → 0+

sinθθ

< 1 ,

which gives us

€

limθ → 0+

sinθθ

= 1 by the “Squeeze Theorem”.

We can proceed from this result to the other limit law we will need. We can make a product of certain limits and then use the already known limit laws to write

€

limθ → 0+

sinθθ

⋅ limθ → 0+

sinθ ⋅ limθ → 0+

11+ cosθ

= 1 ⋅ 0 ⋅ 11+ 1⎛

⎝ ⎜

⎞

⎠ ⎟ = 0

€

⇒ limθ → 0+

sinθθ

⋅ sinθ ⋅1

1+ cosθ= lim

θ → 0+

sin2θθ (1+ cosθ )

= 0

€

⇒ limθ → 0+

(1 − cos2θ )θ (1+ cosθ )

= 0 ⇒ limθ → 0+

(1− cosθ ) ⋅ (1+ cosθ )θ (1+ cosθ )

= 0 applying the Pythagorean Identity factoring difference of two squares

€

⇒ limθ → 0+

(1− cosθ )θ

= 0 . safe to divide through, since

€

limθ → 0

(1+ cosθ ) ≠ 0

We now have the trigonometric limit laws we need to calculate the derivative functions for sin x and cos x . Using the “angle-addition formulas” for sine and cosine (discussed in another Note), we have the limits

€

[ sin x ]' = limh→ 0

sin ( x + h ) − sin xh

= limh→ 0

(sin x cos h + cos x sinh ) − sin xh

€

= limh→ 0

(sin x cos h − sin x )h

+ limh→ 0

(cos x sinh )h

€

= limh→ 0

sin x ⋅( cos h − 1)

h+ lim

h→ 0cos x ⋅

sinhh

€

= limh→ 0

sin x ⋅ limh→ 0

(cos h − 1)h

⎡

⎣ ⎢ ⎤

⎦ ⎥ + lim

h→ 0cos x ⋅ lim

h→ 0

sinhh

⎡

⎣ ⎢ ⎤

⎦ ⎥

€

= (sin x ⋅ 0) + ( cos x ⋅ 1) = cos x and

€

[ cos x ]' = limh→ 0

cos ( x + h ) − cos xh

= limh→ 0

(cos x cos h − sin x sinh ) − cos xh

€

= limh→ 0

(cos x cos h − cos x )h

+ limh→ 0

(− sin x sinh )h

€

= limh→ 0

cos x ⋅( cos h − 1)

h− lim

h→ 0sin x ⋅

sinhh

€

= limh→ 0

cos x ⋅ limh→ 0

(cos h − 1)h

⎡

⎣ ⎢ ⎤

⎦ ⎥ − lim

h→ 0sin x ⋅ lim

h→ 0

sinhh

⎡

⎣ ⎢ ⎤

⎦ ⎥

€

= ( cos x ⋅ 0) − (sin x ⋅ 1) = − sin x . Derivative of the general exponential function This is another function that requires some specific handling and also touches upon topics beyond the scope of Calculus I. We can apply the limit definition of derivative to the general exponential function f ( x ) = ax , with a > 0 , to obtain

€

[ ax ]' = limh→ 0

ax + h − ax

h= lim

h→ 0

(ax ⋅ ah ) − ax

h= lim

h→ 0ax ⋅ a

h − 1h

applying properties of exponents

€

= ax ⋅ limh→ 0

ah − 1h

. extracting factor which does not involve h

We are not in a position to evaluate this last limit (we will know how to do that in Calculus II), but we can recognize that this is the point derivative for our function, f ‘ ( 0 ) , the slope of the tangent line to the exponential function y = ax at x = 0 (as discussed in Stewart, 6th ed., pp. 178-79). By experimenting with different values of a > 0 , we find that this limit has a value which depends upon the value of a . Mathematicians basically assign a name to the value at which this limit is exactly 1 ; that number is called ‘ e ‘ . (This is to say

that we don’t prove that e is the number for which this limit is 1 ; instead, we prove that there must be such a number and the value at which this occurs is approximately 2.718281828… , which is designated as the constant ‘ e ‘ .) So we can say that

€

limh→ 0

eh − 1h

= 1 and thus [ ex ]' = ex ⋅ limh→ 0

eh − 1h

= ex ⋅ 1 = ex .

The function ex is thus a function which is its own derivative function; in fact, it is the only (non-constant) function for which that is the case. Because it emerges directly from the structure of mathematics, ex is called the “natural exponential function”.

We can take this a bit further by looking at the function g ( x ) = ekx , for which the limit definition of derivative yields

€

[ ekx ]' = limh→ 0

ek ( x + h ) − ekx

h= ekx ⋅ lim

h→ 0

ekh − 1h

, following the argument we used above for ax . Now if k is a positive integer, we can

write the numerator of the ratio in the limit expression as ( eh )k – 1 , and apply the so-called “geometric expression”,

€

xk − 1 = ( x − 1) ⋅ ( xk −1 + xk − 2 + K + x 2 + x + 1k terms

1 2 4 4 4 4 4 4 3 4 4 4 4 4 4 ) ,

to re-write the derivative function as

€

[ ekx ]' = ekx ⋅ limh→ 0

(eh )k − 1h

€

= ekx ⋅ limh→ 0

(eh − 1) ⋅ ([ eh ]k −1 + [ eh ]k − 2 + K + [ eh ]2 + eh + 1)h

€

= ekx ⋅ limh→ 0

(eh − 1)h

⋅ ([ eh ]k −1 + [ eh ]k − 2 + K + [ eh ]2 + eh + 1k terms

1 2 4 4 4 4 4 4 4 4 4 3 4 4 4 4 4 4 4 4 4 )

€

= ekx ⋅ 1 ⋅ ([1]k −1 + [1]k − 2 + K + [1]2 + 1 + 1k terms

1 2 4 4 4 4 4 4 4 3 4 4 4 4 4 4 4 ) = ekx ⋅ 1 ⋅ k .

Hence, we have shown that

€

[ ekx ]' = k ⋅ ekx , at least when k is a positive integer. This is akin to the proof we’ve given earlier in the course that

€

[ xn ]' = n ⋅ xn −1 , where n is a positive integer (see, for example, Stewart, p. 174).

We can now show immediately that for a = ek , with k being a positive integer , that

€

[ ax ]' = k ⋅ ax . But from what we’ve learned prior to this course,

a = ek ⇒ k = ln a . So we can argue plausibly that

€

[ ax ]' = ( ln a ) ⋅ ax , even though we have really only so far shown it to be true when ln a is a positive integer. We will be able to demonstrate (elsewhere) the derivative rule for ax more generally using the Chain Rule. From the discussion earlier, we have also shown that the slope of

the tangent line to f ( x ) = ax at x = 0 is

€

f ' ( 0) = limh→ 0

ah − 1h = ln a .

-- G. Ruffa May – June 2010

derivation of some differentiation rules f '(x) = lim f (x + h) − f (x) h

Documents