Dif F
Dif F
Differentiation *
Leonid Pankratov
But just as much as it is easy to find the differential [derivative] of a given quantity, so it is difficult
to find the integral of a given differential. Moreover, sometimes we cannot say with certainty whether
the integral of a given quantity can be found or not.
Johann Bernoulli
Science is the Differential Calculus of the mind. Art the Integral Calculus; they may be beautiful
when apart, but are greatest only when combined. Ronald Ross
* File :DIF-F.tex
1
Contents
1 Definition and Basic Properties of the Derivative 4
3 Higher-Order Derivatives 13
7 L’Hôspital’s Rule 24
8 Taylor’s Theorem 31
2
Table of derivatives
1. C ′ = 0
1
2. x′ = 1 12. (cot x)′ = − , x ̸= kπ, k ∈ Z
sin2 x
√ ′ 1
3. x = √ , x>0 1
2 x 13. (arcsin x)′ = √ , |x| < 1
1 − x2
4. (ax )′ = ax ln a, a > 0 1
14. (arccos x)′ = − √ , |x| < 1
5. (xα )′ = αxα−1 , x > 0 1 − x2
6. (ex )′ = ex 1
15. (arctan x)′ =
1 1 + x2
7. (loga x)′ = , x, a > 0 1
x ln a 16. (arccotx)′ = −
1 1 + x2
8. (ln x)′ = , x > 0 17. (sinh x)′ = cosh x
x
′
9. (sin x) = cos x 18. (cosh x)′ = sinh x
10. (cos x)′ = − sin x 19. (tanh x)′ =
1
1 cosh2 x
11. (tan x)′ = , 1
cos2 x 20. (cotanhx)′ = − , x ̸= 0
π
x ̸= + kπ, k ∈ Z sinh2 x
2
3
In this chapter, we discuss basic properties of the derivative of a function and several major theo-
rems, including the Mean Value Theorem and L’Hôpital’s Rule. The applications of derivatives will
be discussed in the next chapter.
f (x) − f (a)
x−a
is defined on G \ {a}. If we think about x as time variable, and f (x) the distance from the start point,
then the last quotient represents the average velocity.
Since G is an open set, then a is a limit point of G \ {a}, that is a point each neighborhood of
which contains at least one point of G \ {a} different from a. To introduce the instantaneous velocity,
we consider the limit
f (x) − f (a)
lim .
x→a x−a
Definition 1.1 (The Derivative at a Point) Let G be an open subset of R and let a ∈ G. The function
f defined on G is said to have a finite derivative f ′ (x) at a if the limit
f (x) − f (a)
lim
x→a x−a
Definition 1.2 (One-sided derivative) A generalization of the concept of a derivative, in which the
ordinary limit is replaced by a one-sided limit. If the following limit exists for a function f of a real
variable x :
f (x) − f (a) f (x) − f (a)
lim or lim
x→a+0 x−a x→a−0 x−a
then it is called the right (respectively, left) derivative of f at the point a.
Notice that if the one-sided derivatives are equal, then the function has an ordinary derivative at a.
Definition 1.3 (The Differentiability) Function f (x) defined in some neighborhood of x0 is referred
to as differentiable at x0 , if its increment at this point can be represented as
4
where
lim α(∆x) = 0,
∆x→0
i.e.
∆f (x0 ) = A∆x + o(∆x), ∆x → 0.
Theorem 1.4 Function f is differentiable at x0 if and only if finite f ′ (x0 ) exists and, moreover,
A = f ′ (x0 ).
and
f (x0 + ∆x) − f (x0 )
= A + α(∆x),
∆x
i.e.,
f (x0 + ∆x) − f (x0 )
f ′ (x0 ) = lim = A.
∆x→0 ∆x
Let now
f (x0 + ∆x) − f (x0 )
lim = A.
∆x→0 ∆x
Then
f (x0 + ∆x) − f (x0 )
= A + α(∆x), with lim α(∆x) = 0.
∆x ∆x→0
Example 1.6 In this example we consider some simple functions for which the derivative can be
obtained directly from the definition.
5
1. Let f : R → R be defined by f (x) = x and let a ∈ R. Then
= lim (x + a) = 2a.
x→a
and
f (x) − f (0) |x| −x
lim− = lim− = lim+ = −1.
x→0 x−0 x→0 x x→0 x
This means that the limit
f (x) − f (0)
lim
x→0 x−0
does not exist and f = |x| is not differentiable at 0.
The next result follows directly from the definition of continuity and Definition 1.1.
Theorem 1.8 Let G be an open subset of R and let f, g : G → R. Suppose that the functions f, g
are differentiable at a ∈ G. Then we have
6
2. Let C be a constant. The function Cf is differentiable at a and
(C f )′ = C f ′ (a).
4. Suppose now that g(a) ̸= 0. Then the quotient (f /g) is differentiable at a and
′
f f ′ (a)g(a) − f (a)g ′ (a)
(a) = .
g (g(a))2
Proof of Theorem 1.8. The proofs of statements (1), (2) are evident and we leave them to the reader.
1 f (a)
(f (x) − f (a)) + (g(a) − g(x))
g(x) g(x)g(a)
=
x−a
1 f (x) − f (a) g(x) − g(a)
= g(a) − f (a) .
g(x)g(a) x−a x−a
Passing to the limit as x → a, we obtain statement (4). This completes the proof of Theorem 1.8.
7
Example 1.9 Let f : R → R be defined by f (x) = x2 and let a ∈ R. Using the previous example
and Theorem 1.8, statement (3), compute f ′ (a).
Using the method of mathematical induction, one obtains the derivative of g(x) = xn for n ∈ N:
(xn )′ = n xn−1 . (1.2)
Moreover, if we make use of this fact and Theorem 1.8 we obtain the well known formula for the
derivative of a polynomial p(x) = an xn + ... + a1 x + a0 . It is given by p ′ (x) = nan xn−1 + ... + a1 .
The following result is useful for studying the differentiability of the composition of two functions
f ◦g :A→C
g:A→B f :B→C
x g(x) f (g(x))
A B C
Lemma 1.1 Let G be an open subset of R and f : G → R and let f be differentiable at a. Then
there is a function u : G → R such that
8
The function f is differentiable at a. Then
f (x) − f (a)
lim u(x) = lim − f ′ (a) = f ′ (a) − f ′ (a) = 0.
x→a x→a x−a
Thus, the function u satisfies all the conditions of the lemma.
Theorem 1.10 (Chain Rule) Let f : G1 → R and let g : G2 → R, where G1 , G2 are open subsets
of R with such that f (G1 ) ⊂ G2 . Let f be differentiable at a and g be differentiable at f (a). Then the
composition of functions g ◦ f is differentiable at a and
Proof of Theorem 1.10. We observe that the function f is differentiable at a. Then by Lemma 1.1,
there is a function u defined on G1 such that
f (x) − f (a) = f ′ (a) + u(x) (x − a) ∀x ∈ G1 , lim u(x) = 0.
(1.6)
x→a
In a similar way, due to the differentiability of g at f (a), there is a function v defined on G2 with
g(t) − g(f (a)) = g ′ (f (a)) + v(t) t − f (a) ∀t ∈ G2 , lim v(t) = 0.
(1.7)
t→f (a)
Therefore
g(f (x)) − g(f (a))
lim = g ′ (f (a))f ′ (a).
x→a x−a
This completes the proof of Theorem 1.10
Find h′ (x).
9
Solution to Example 1.11. The function h(x) is a polynomial and we can find h′ (x) by expanding
the power and using the previous example. However, Theorem 1.10 suggests a different approach.
We define the functions f, g : R → R as follows:
Then the function h can be defined as h = g ◦ f . Given a ∈ R from Theorem 1.10 we have that
Example 1.12 By iterating the chain rule, generalize the result of Theorem 1.10 to a composition of
more than two functions.
Solution to Example 1.12. For the sake of simplicity, we study the case of three functions. Consider
the functions f : G1 → R, g : G2 → R, and h : G3 → R such that f (G1 ) ⊂ G2 , g(G2 ) ⊂ G3 , the
function f is differentiable at a, g is differentiable at f (a), and h is differentiable at g(f (a)). Then
we have that h ◦ g ◦ f is differentiable at a and
′
h ◦ g ◦ f = h′ g(f (a)) g ′ f (a) f ′ (a).
Proof of Theorem 2.1. Since the functions f and f −1 are mutually inverse, then
are both nonzero if x ̸= x0 . Moreover, it follows from the continuity of the function f at the point x0
and f −1 at the point y0 that
(X ∋ x → x0 ) ⇔ (Y ∋ y → y0 ).
10
Applying now the theorem on the limit of a composite function and the arithmetic properties of the
limit, we get
f −1 (y) − f −1 (y0 ) x − x0 1
lim = lim = lim f (x)−f (x0 )
=
Y ∋y→y0 y − y0 X∋x→x0 f (x) − f (x0 ) X∋x→x0
x−x0
1
= .
f ′ (x 0)
This means that the function f −1 : Y → X has a derivative at the point y0 and
′ 1
f −1 (y0 ) = ′ .
f (x0 )
This completes the proof of Theorem 2.1.
f −1 ◦ f (x) = x
11
Example 2.2 Let us show (see (13) in the table of derivatives) that
1
arcsin′ y = p |y| < 1.
1 − y2
Solution to Example 2.2. First, we consider the function arcsin x. The sine function sin x is not
one-to-one, and the inverse cannot be defined for all x. Indeed, sin 0 = sin π = 0. We should restrict
the sine in order to have an inverse. The arcsin x is defined as the inverse of sin y for −1 ⩽ x ⩽ 1
and −π/2 ⩽ y ⩽ π/2 so that
sin y = x, arcsin x = sin−1 x = y.
The arcsine-function returns the angle whose sine is a given number. Notice that sin−1 x means the
inverse sin and does not mean 1/ sin.
For example,
π
arcsin 1 = sin−1 1 = = 90◦ .
2
The functions
sin : [−π/2, π/2] → [1, 1] and arcsin : [1, 1] → [−π/2, π/2]
are mutually inverse and continuous. Moreover
sin′ (x) = cos x ̸= 0 if |x| < π/2.
If |x| < π/2, then |y| < 1 for y = sin x. Therefore, by Theorem 2.1
1 1 1 1
arcsin′ y = ′ = =p =p .
sin x cos x 1 − sin2 x 1 − y2
Notice that the sign in front of the radical is taken due to the inequality
cos x > 0 for |x| < π/2.
12
Example 2.4 Show that
1
arctan′ y = y ∈ R.
1 + y2
Solution to Example 2.4. Consider the function arctan x. The arctan x is defined as the inverse
tan x function for x ∈ R when tan y = x, −π/2 < y < π/2. Then the arctangent is such that
arctan x = tan−1 x = y. The derivative is
1 1 cos2 x
arctan′ y = = = cos2 x =
tan′ x 1 cos2 x + sin2 x
cos2 x
1 1
= 2
= .
1 + tan x 1 + y2
The desired result is then proved.
3 Higher-Order Derivatives
Suppose that a function f : E → R (E is an open set) is differentiable at every point x ∈ E. Then
the function f ′ : E → R can be defined and its value at x ∈ E is the derivative f ′ of f at this point.
The function f ′ : E → R can also have a derivative (f ′ )′ : E → R. This derivative is said to be
the second derivative of the function f and is usually denoted as
d2 f (x)
f ′′ (x) or .
dx2
In order to generalize this notion, we give the following definition.
Definition 3.1 If the derivative f (n−1) (x) of order (n − 1) of f is defined, then the derivative of order
n is given by
′
f (n) (x) = f (n−1) (x)
The following notation are used in the mathematical literature for the derivative of order n
dn f (x)
f (n) (x) or .
dxn
def
By definition, f (0) (x) = f (x).
Theorem 3.2 (The Leibniz formula) Let u(x) and v(x) be functions with the derivatives up to order
n on a set E. Then Leibniz formula is valid for the n-th derivative of their product. Namely,
n
X
(n)
(uv) = Cnk u(n−k) v (k) , (3.1)
k=0
13
where the binomial coefficients are
n!
Cnk = .
(n − k)!k!
Proof of Theorem 3.2. The proof of (3.1) will be done by the method of mathematical induction.
For n = 1 formula (3.1) is exactly the formula for the derivative of a product of two functions.
If the functions u and v have derivatives up to order (n + 1) then we assume that (3.1) holds for order
n. We differentiate the left-hand and right-hand sides to find
n
X n
X
(n+1)
(uv) = Cnk u (n−k+1) (k)
v + Cnk u(n−k) v (k+1) =
k=0 k=0
n
X
(n+1) (0)
m
Cn + Cnm−1 u(n+1−m) v (m) + u(0) v (n+1) .
=u v + (3.2)
m=1
are parametric equations defining the circle centered at the origin of radius R.
14
Let us find the derivative of a parametric function. To this end we suppose that the functions x = x(t),
y = y(t) are differentiable in the interval α < t < β and x′ (t) ̸= 0. Moreover, we assume that
x = x(t) has an inverse function t = t(x). By the inverse function theorem,
dt 1
= t′x = ′ . (4.2)
dx xt
1 yt′
yx′ = yt′ · t′x = yt′ · = . (4.4)
x′t x′t
This formula enables to find the derivative of a parametric function y(x). In this case we do not
need the explicit form of this function.
Let us consider some examples.
x = t2 and y = t3 .
Solution to Example 4.1. First, we find the derivatives of the functions x, y with respect to the
parameter t. We have ′ ′
x′t = t2 = 2t, yt′ = t3 = 3t2 .
Then
dy y′ 3t2 3t
= yx′ = t′ = = (t ̸= 0) .
dx xt 2t 2
Solution to Example 4.2. In this case, as in the previous example, we have that
′ ′
x′t = e2t = 2e2t , yt′ = e3t = 3e3t .
Therefore
dy y′ 3e3t 3 3
= yx′ = t′ = 2t = e3t−2t = et .
dx xt 2e 2 2
15
Now we turn to the high-order derivatives of functions defined parametrically. Namely, let us
consider a function y = f (x) given by
x = x(t), y = y(t). (4.5)
Formula (4.4) implies that the first derivative of y is given by
yt′
y ′ = yx′ = .
x′t
We differentiate this relation with respect to x in order to obtain the second derivative of the function
y(x). We have that ′ ′ ′ ′
′′ yt ′ yt 1
yxx = ′
· tx = ′
· ′.
xt t xt t xt
Thus
′′ x′ y ′′ − y ′ x′′
yxx = t t ′ 3t t . (4.6)
(xt )
Find the first and the second derivatives of y(x) with respect to x.
Solution to Example 4.3. By (4.4), we have
yt′ sin t 2 sin(t/2) cos(t/2)
yx′ = ′ = = = cot(t/2).
xt 1 − cos t 2 sin2 (t/2)
Now we differentiate this relation with respect to x to get
′ ′
′′ t ′ t 1 1 1
yxx = cot · tx = cot · ′ =− 2 · =
2 t 2 t xt 2 sin (t/2) 1 − cos t
1
=− 4 .
4 sin (t/2)
16
Definition 5.1 Let D be a nonempty subset of R and let f : D → R. The function f is said to have
a local (or relative) minimum at c ∈ D if there is δ > 0 such that
In a similar way, the function f is said to have a local (or relative) maximum at a c ∈ D if there
is δ > 0 such that
f (x) ⩽ f (c) for all x ∈ B(c, d) ∩ D.
Theorem 5.2 (Fermat’s Rule) Let I be an open interval and f : I → R. If the function f has a
local minimum (maximum) at c ∈ I and it is differentiable at c, then
f ′ (c) = 0.
y f ′ (b) = 0
y = f (x)
a
b x
f ′ (a) = 0
Proof of Theorem 5.2. Let us assume that the function f has a local minimum at the point c (the case
of a local maximum is similar). Then there is δ > 0 sufficiently small such that
f (x) − f (c)
⩾0 for all x ∈ B+ (c, d).
x−c
Now it follows from the differentiability of f at c that
17
In a similar way
f (x) − f (c)
⩽0 for all x ∈ B− (c, d)
x−c
and
f (x) − f (c) f (x) − f (c)
f ′ (c) = lim = lim− ⩽ 0.
x→c x−c x→c x−c
Therefore, f ′ (c) = 0.
The proof is similar in the case when f has a local maximum at c. Theorem 5.2 is proved.
Theorem 5.3 (Rolle’s Theorem) Let a, b ∈ R with a < b and f : [a, b] → R. We assume that
3. f (a) = f (b).
f ′ (c) = 0
y = f (x)
a c b x
f (a) = f (b)
Proof of Theorem 5.1. First, we observe that the function f is continuous on the compact set [a, b].
e1 ∈ [a, b] and x
Then by the Extreme Value Theorem, there exist x e2 ∈ [a, b] such that
Then
f (e
x1 ) ⩽ f (x) ⩽ f (e
x2 ) for all x ∈ [a, b]. (5.2)
18
e1 ∈ (a, b) or x
If x e2 ∈ (a, b), then f has a local minimum at x e1 or f has a local maximum at x e2 . By
Theorem 5.2, f ′ (ex1 ) = 0 or f ′ (e
x2 ) = 0, and (5.1) holds for c = xe1 or c = xe2 .
If x
e1 , x
e2 are the endpoints of the interval [a, b], then f (e x1 ) = f (e
x2 ) because f (a) = f (b). By
inequality (5.2), f is a constant function, so f ′ (c) = 0 for any c ∈ (a, b). This completes the proof of
Theorem 5.3.
Example 5.4 (Counterexamples to Theorem 5.3) Let us show that all the three conditions of Rolle’s
theorem are necessary for the theorem to be true.
Counterexample to condition (1). Consider the fractional part f (x) = {x}, on the interval [0, 1].
The derivative of {x} on the open interval (0, 1) is 1 everywhere. In this case, Rolle’s theorem is not
true since f (x) has a discontinuity at x = 1, i.e. f (x) is not continuous everywhere on [0, 1].
Figure 6: Absolute value of x (on the left) and the linear function y(x) = x (on the right).
19
Counterexample to condition (2). Consider the absolute value function f (x) = |x| (see Figure 6)
on the interval [−1, 1]. We have shown that f (x) is not differentiable at x = 0. The function f (x) is
continuous on the closed interval [−1, 1], but there is no point inside the interval (−1, 1) where the
derivative equals zero. The Rolle theorem fails because f (x) is not differentiable over (−1, 1).
Counterexample to condition (3). The linear function f (x) = x is continuous on the closed interval
[0, 1] and differentiable on (0, 1). The derivative of the function is 1 everywhere on the interval. The
Rolle’s theorem is not valid because the 3-rd condition is not satisfied (since f (0) ̸= f (1).)
Now we will make use of Rolle’s Theorem to prove the Mean Value Theorem.
Theorem 5.5 (Mean Value Theorem (Lagrange)) Let a, b ∈ R with a < b and f : [a, b] → R. Let
f be continuous on [a, b] and differentiable on (a, b). Then there is c ∈ (a, b) such that
f (b) − f (a)
f ′ (c) = . (5.3)
b−a
y
f (b)
f (c)
f (b) − f (a)
f ′ (c) = y = f (x)
b−a
a c b x
f (a)
Proof of Theorem 5.5. Consider the linear function passing through the points (a, f (a)) and (b, f (b)):
f (b) − f (a)
g(x) = · (x − a) + f (a).
b−a
Now we introduce the function
f (b) − f (a)
h(x) = f (x) − g(x) = f (x) − (x − a) + f (a) for x ∈ [a, b].
b−a
20
Then
f (b) − f (a)
h(a) = f (a) − · 0 + f (a) = 0,
b−a
and
f (b) − f (a)
h(b) = f (b) − (b − a) + f (a) =
b−a
= f (b) − [f (b) − f (a) + f (a)] = 0.
Thus, h(a) = h(b), and the function h satisfies the assumptions of Theorem 5.3. Therefore, there is
c ∈ (a, b) such that h′ (c) = 0. The derivative of the function h(x) is given by
f (b) − f (a)
h′ (x) = f ′ (x) − .
b−a
Then we have that
f (b) − f (a)
f ′ (c) − = 0.
b−a
This means that (5.3) holds true. The proof of Theorem 5.5 is completed.
Theorem 5.5 which is also called the First Mean Value Theorem enables to describe the increment
of a function on an interval in terms of the derivative of this function at an intermediate point of the
interval.
Geometric interpretation. Lagrange’s Theorem has a clear geometrical interpretation. The chord
which goes through the points of the graph corresponding to the ends of the interval a, b has the slope
given by the formula
f (b) − f (a)
k = tan α = .
b−a
Then there is x = c inside [a, b], where the tangent to the graph is parallel to the chord (see Fig. 7).
Corollary 5.6 In the case when the values of f (x) at the endpoints of [a, b] are equal, i.e. f (a) =
f (b), the Lagrange Theorem implies that there exists c ∈ (a, b) such that
f (b) − f (a)
f ′ (c) = = 0,
b−a
which is the statement of Rolle’s theorem. In this way, the last one is a particular case of Lagrange’s
Theorem.
Corollary 5.7 If the derivative f ′ (x) = 0 at all points of [a, b], then f is constant on [a, b]. In fact,
for any two points x1 and x2 in [a, b], there is a point c ∈ (a, b) such that
21
Remark 2 It is important to note that the fact that f ′ = 0 for x ∈ [a, b] ∪ [c, d] does not imply that f
is (the same) constant on these two intervals. In fact, f might be equal to different constants on these
two intervals.
Solution to Example 5.8. Consider the function f (x) = sin x, x ∈ R. Then f ′ (x) = cos x. Now, we
fix x ∈ R and consider two cases.
Case 1: x > 0. By the Mean Value Theorem applied to f on [0, x], there is c ∈ (0, x) such that
sin x − sin 0
= cos c
x−0
and, consequently,
sin x
= | cos c|.
x
Since | cos c| ⩽ 1 we have that | sin x| ⩽ |x| for all x > 0.
Case 2: x < 0. Applying once more the Mean Value Theorem one can show that there is c ∈ (x, 0)
such that
sin 0 − sin x
= cos c.
0−x
Then
sin x
= | cos c| ⩽ 1
x
and one can conclude that | sin x| ⩽ |x| for all x < 0.
Finally, we observe that the equality holds for x = 0 and we get that | sin x| ⩽ |x| for all x ∈ R.
√ (5 + 2x)
1 + 4x < , x > 2. (5.4)
3
√
Solution to Example 5.9. Consider the function f (x) = 1 + 4x for x ⩾ 2. We compute the
derivative of this function:
4 2
f ′ (x) = √ =√ .
2 1 + 4x 1 + 4x
Let us fix x ∈ R such that x > 2 and apply the Mean Value Theorem to f on [2, x]. Since
f (2) = 3, there is c ∈ (2, x) such that
√
1 + 4x − 3 = f ′ (c)(x − 2).
22
We observe that f ′ (2) = 2/3 and f ′ (c) < f ′ (2) for c > 2. Then
√ 2 √ 2x − 4
1 + 4x − 3 < (x − 2) or 1 + 4x < 3 + .
3 3
Now it is clear that (5.4) is valid.
We formulate also a generalization of the Mean Value Theorem.
Theorem 5.10 (Cauchy’s Theorem) Given a, b ∈ R with a < b, suppose the functions f , g are
continuous on [a, b] and differentiable on (a, b). Then there is c ∈ (a, b) such that
Then
h(a) = f (b)g(a) − f (a)g(b) = h(b),
and the function h verifies all the assumptions of Theorem 5.3. Then there is c ∈ (a, b) such that
h′ (c) = 0. It is clear that
Lemma 6.1 Let f be continuous on [a, b] and differentiable on (a, b). If f ′ (x) = 0 for all x ∈ (a, b),
then f is constant on [a, b].
Proof of Lemma 6.1. The lemma will be proved by contradiction. To this end we suppose that f is
not constant on [a, b]. Then there exist a1 and b1 such that a ⩽ a1 < b1 ⩽ b, and f (a1 ) ̸= f (b1 ). By
the Mean Value Theorem, there is c ∈ (a1 , b1 ) such that
f (b1 ) − f (a1 )
f ′ (c) = ̸= 0,
b 1 − a1
which is a contradiction. This proves Lemma 6.1.
The next result gives direct criteria for the monotonicity of a function in terms of its derivative.
23
y
f ′ (c) > 0
y = f (x)
a c b x
Figure 8: Strictly increasing function: for all points c ∈ (a, b) we have f ′ (c) > 0.
1. if f ′ (x) > 0 for all x ∈ (a, b), then f is strictly increasing on (a, b);
2. if f ′ (x) < 0 for all x ∈ (a, b), then f is strictly decreasing on (a, b).
Proof of Lemma 6.2. Let us prove statement (1). Fix any x1 , x2 ∈ (a, b) with x1 < x2 . By the Mean
Value Theorem, there is c ∈ (x1 , x2 ) such that
f (x2 ) − f (x1 )
= f ′ (c) > 0.
x2 − x1
This means that f (x1 ) < f (x2 ). Therefore, f is strictly increasing on (a, b).
The proof of statement (2) can be done in a similar way.
Example 6.1 Let n ∈ N and f : (0, +∞) → R be defined by f (x) = xn . Then f ′ (x) = n · xn−1 ,
and f ′ (x) > 0 for all x > 0. This implies that f is strictly increasing. In particular, this proves that
every positive real number has exactly one n-th root.
7 L’Hôspital’s Rule
In this section we are dealing with the results which enable to find the limits by using the derivatives.
The theorems given below are known as L’Hôpital’s Rule.
24
Theorem 7.1 Suppose the functions f , g are continuous on [a, b] and differentiable on (a, b) and, in
addition, f (e
x) = g(e e ∈ [a, b]. Suppose also that there is δ > 0 such that g ′ (x) ̸= 0 for
x) = 0, where x
all x ∈ B(e e, where B(e
x; δ) ∩ [a, b], x ̸= x x − δ, x
x; δ) = (e e + δ). Then it holds
f ′ (x) f (x)
if lim =ℓ =⇒ lim = ℓ. (7.1)
x g ′ (x)
x→e x g(x)
x→e
Proof of Theorem 7.1. Let {xk } ⊂ [a, b] that converges to x e and such that xk ̸= x
e for every k. By
Cauchy’s Theorem, for each k, there is a sequence {ck }, with ck between xk and xe, such that
f (xk ) f ′ (ck )
= ′ .
g(xk ) g (ck )
Taking into account that g ′ (x) ̸= 0 for x near x x) = 0, we also have that g(xk ) ̸= 0 for k
e and g(e
sufficiently large. By the Squeeze Theorem, the sequence {ck } converges to x
e. Therefore,
Consequently relation (7.1) follows from the Sequential Characterization of Limits Theorem. Theo-
rem 7.1 is proved.
25
Example 7.3 Calculate the limit
3x3 − 2x2 + 4x − 5
lim .
x→1 4x4 − 2x − 2
Solution to Example 7.3. As in the previous example we apply Theorem 7.1. We introduce the
functions
f (x) = 3x3 − 2x2 + 4x − 5 and g(x) = 4x4 − 2x − 2.
It is clear that f (1) = g(1) = 0. Moreover,
Since the derivative g ′ (1) = 16 − 2 = 14 ̸= 0 and it is a continuous function, then we have that
g ′ (x) ̸= 0 for x near the point 1. It is easy to calculate that
9x2 − 4x + 4 9
lim 3
= .
x→1 16x − 2 14
9
The desired limit is also .
14
If the derivatives of f , g satisfy the assumptions of Theorem 7.1 we can use L’Hospital’s rule to
determine first the limit of f ′ (x)/g ′ (x) and then apply this rule once more in order to find the limit of
f (x)/g(x).
Solution to Example 7.4. In this example f (x) = x2 and g(x) = 1 − cos x. It is clear that these
functions and all their derivatives are continuous. In addition f (0) = g(0) = 0, g ′ (x) = sin x,
g ′ (0) = 0, f ′ (0) = 0 = g ′ (0) and g ′′ (x) = cos x ̸= 0 for x in a neighborhood of 0. Moreover,
f ′′ (x) 2
lim = lim = 2.
x→0 g ′′ (x) x→0 cos x
f ′ (x) f ′′ (x) 2
lim = lim ′′ = lim = 2.
x→0 g ′ (x) x→0 g (x) x→0 cos x
x2 f (x) f ′ (x)
lim = lim = lim ′ = 2.
x→0 1 − cos x x→0 g(x) x→0 g (x)
26
Example 7.5 Consider the functions g(x) = x + 3x2 and f : R → R defined by
2 1
x sin , if x ̸= 0,
x
f (x) =
0, if x = 0.
Solution to Example 7.5. Taking into account the definition of derivative at the point x = 0 one can
see that the function f is differential and its derivative is given by
2x sin 1 − cos 1 , if x ̸= 0,
′ x x
f (x) =
0, if x = 0.
One can observe that limx→0 f ′ (x) does not exist. Then we conclude that L’Hospital’s rule cannot be
applied. However
x2 sin x1
lim does exist.
x→0 x + 3x2
Let us suppose that there exists δ > 0 such that g ′ (x) ̸= 0 for all x ∈ B(e
x; δ) ∩ (a, b), x ̸= x
e.
Then, if ℓ ∈ R and
f ′ (x)
lim = ℓ, (7.2)
x g ′ (x)
x→e
we have that
f (x)
lim = ℓ. (7.3)
x g(x)
x→e
27
Proof of Theorem 7.6. It is given that
Then one can choose δ > 0 such that f (x) ̸= 0 and g(x) ̸= 0 for all x ∈ B(e
x; δ) ∩ (a, b). Let us prove
that
f (x)
lim+ = ℓ.
x→ex g(x)
f (x)
−ℓ <ε
g(x)
f ′ (x) f ′ (x) ε
⩽K and −ℓ < , (7.4)
g ′ (x) ′
g (x) 2
lim f (x) = ∞.
x→e
x
Then we can find δ2 > 0, δ2 < min{δ1 , α − x e} and f (x) ̸= f (α) for x ∈ B+ (e x; δ2 ) ∩ (a, b) =
B+ (e ′
x; δ2 ). Moreover, for such x, taking into account that g (z) ̸= 0 if x < z < α, Rolle’s Theorem
implies that g(x) ̸= g(α). Therefore, for all x ∈ B+ (e
x; δ2 ),
g(α)
f (x) f (x) − f (α) 1 − g(x)
= · . (7.5)
g(x) g(x) − g(α) 1 − f (α)
f (x)
Since
lim f (x) = lim g(x) = ∞,
x→e
x x→e
x
28
Now it follows from Cauchy’s Theorem applied on [x, α] that for any x ∈ B+ (e
x; γ),
[f (x) − f (α)]g ′ (c) = [g(x) − g(α)]f ′ (c),
for some c ∈ (x, α) ∩ B(e
x; δ1 ). For this c, from (7.5), we have that
f (x) f ′ (c)
= ′ Hα (x).
g(x) g (c)
Since c ∈ B(e
x; δ1 ), from (7.4) we obtain that, for x ∈ B+ (e
x; γ),
f ′ (c) f ′ (c) f ′ (c)
f (x)
− ℓ = ′ Hα (x) − ℓ = ′ (Hα (x) − 1) + −ℓ
g(x) g (c) g (c) g ′ (c)
f ′ (c) f ′ (c) ε ε
⩽ ′
|Hα (x) − 1| + ′
−ℓ <K + = ε.
g (c) g (c) 2K 2
Now we set δ0 = γ. This completes the proof of Theorem 7.6.
The next theorem can be proved in the same way as Theorem 7.1.
Theorem 7.8 Suppose that f and g are differentiable on (a, ∞), g ′ (x) ̸= 0 for all x ∈ (a, ∞), and
If ℓ ∈ R, then
f ′ (x) f (x)
lim =ℓ =⇒ lim = ℓ. (7.6)
x→∞ g ′ (x) x→∞ g(x)
29
Example 7.9 Find the limit
1
lim π .
x→∞
x − arctan x
2
Solution to Example 7.9. We divide the numerator and the denominator by x and use Theorem 7.8.
The limit of the quotient of the derivatives is given by
1/x −1/x2 x2 + 1
lim = lim −1 = lim = 1.
x→∞ ( π − arctan x) x→∞ x→∞ x2
2 x2 +1
Theorem 7.10 Let f and g be differentiable on (a, ∞), g ′ (x) ̸= 0 for all x ∈ (a, ∞) and
If ℓ ∈ R, then
f ′ (x) f (x)
lim = ℓ, ⇒ lim = ℓ. (7.7)
x→∞ g ′ (x) x→∞ g(x)
ln x
Example 7.11 Find the limit lim .
x→∞ x
Solution to Example 7.11. In this example we set f (x) = ln x and g(x) = x. These functions satisfy
the conditions of Theorem 7.10. Then we have that
f ′ (x) 1/x 1
lim = lim = lim =0
x→∞ g ′ (x) x→∞ 1 x→∞ x
Example 7.12 Show that the following limit cannot be calculated by L’Hospital’s rule and find it :
x + sin x
lim .
x→∞ x
Solution to Example 7.12. Here
f (x) = x + sin x, g(x) = x and f ′ (x) = 1 + cos x, g ′ (x) = 1.
def def
It is clear that
1 + cos x
lim = lim (1 + cos x) does not exist!!!
x→∞ 1 x→∞
However, we can easily find the desired limit. In fact,
x + sin x sin x
lim = lim 1 + = 1.
x→∞ x x→∞ x
30
8 Taylor’s Theorem
In this section, we will discuss the results which enable to approximate differentiable functions by
polynomials.
Theorem 8.1 (Taylor’s Theorem. Remainder in Lagrange’s form) Let n ∈ N. Suppose f : [a, b] →
R is a function such that its n-th derivative f (n) is a continuous function on [a, b] and (n + 1)-th
derivative f (n+1) exist for all x ∈ (a, b). Let x0 ∈ [a, b]. Then for any x ∈ [a, b] with x ̸= x0 , there is
θ ∈ (x0 , x) (or θ ∈ (x, x0 )) such that
f (n+1) (θ)
f (x) = Pn (x) + (x − x0 )n+1 ,
(n + 1)!
where
n
X f (k) (x0 )
Pn (x) = (x − x0 )k .
k=0
k!
Proof of Theorem 8.1. Consider x0 ∈ [a, b] and fix x ̸= x0 . Since x − x0 ̸= 0, then there is λ ∈ R
such that
λ
f (x) = Pn (x) + (x − x0 )n+1 . (8.1)
(n + 1)!
Our goal is to prove that
λ = f (n+1) (θ), for θ ∈ (x0 , x).
Let us introduce the function
n
X f (k) (t) λ
g(t) = f (x) − (x − t)k − (x − t)n+1 . (8.2)
k=0
k! (n + 1)!
λ
= f (x) − Pn (x) − (x − x0 )n+1 = 0,
(n + 1)!
and
n
X f (k) (x) λ
g(x) = f (x) − (x − x)k − (x − x)n+1
k=0
k! (n + 1)!
n
X f (k) (x) λ
= f (x) − f (x) − (x − x)k − (x − x)n+1
k=1
k! (n + 1)!
= f (x) − f (x) = 0.
31
It follows from Rolle’s theorem that there is θ ∈ (x0 , x) such that g ′ (θ) = 0. Now taking into
account that x is fixed, the independent variable is denoted by t, and applying the product rule for the
derivatives we obtain that
n
f (k+1) (θ) f (k) (θ)
X
′ ′ k k−1
g (θ) = −f (θ) + − (x − θ) + (x − θ)
k=1
k! (k − 1)!
λ λ f (n+1) (θ)
+ (x − θ)n = · (x − θ)n − · (x − θ)n = 0.
n! n! n!
From this relation we conclude that λ = f (n+1) (θ). This completes the proof of Theorem 8.1.
Definition 8.2 The polynomial Pn (x) in Theorem 8.1 is called the n-th Taylor polynomial of the
function f at the point x0 .
Example 8.3 Estimate the error of approximation the function f (x) = sin x with its Taylor polyno-
mial at x0 = 0 on the interval [−1, 1].
|f (4) (θ)| 4 1
|sin x − P3 (x)| = |x| ⩽ ≃ 0, 042.
4! 4!
Theorem 9.1 Let n be an even positive natural number. Suppose f (n) exists and continuous on (a, b).
Consider x0 ∈ (a, b) satisfying
Then
32
1. f (n) (x0 ) > 0 if and only if f has a local minimum at x0 ;
f (x) = x2 cos x.
and
f (0) = f ′ (0) = 0, f ′′ (0) = 2 > 0.
33
In what follows we are going to explain how to find the limits with the help of Taylor’s and
Maclaurin’s Theorems. We start by introducing the big O and small o notations, and then we will
formulate Taylor’s and Maclaurin’s theorems with the remainder in Peano’s form.
34
Here the polynomial Pn (x) is known as the n-th Taylor polynomial of f at x0 and Rn (f, x) is the
remainder in Taylor’s formula.
◦
Lemma 11.1 Suppose that there exist f (n) (x0 ) and f ′ in some deleted neighborhood B (x0 ) of the
◦
point x0 . Then for x ∈ B (x0 ),
(Rn (f, x))′ = Rn−1 (f ′ , x). (11.2)
Theorem 11.1 (Taylor’s formula with Peano’s form of reminder) If f is a function such that its
n-th derivative f (n) (x0 ) at x0 exists then
n
X f (k) (x0 )
f (x) = · (x − x0 )k + o ((x − x0 )n ) as x → x0 . (11.3)
k=0
k!
Proof of Theorem 11.1. The proof of the theorem is done by the method of mathematical induction.
For n = 1 the statement of the theorem holds true. In fact, in this case the function f is differen-
tiable at x0 . Then
Suppose that the statement of the theorem is valid for (n − 1), n ⩾ 1. Let us prove that (11.3) holds
true. We assume for the sake of definiteness that x > x0 and make use of Lagrange’s theorem and
Lemma 11.1. Then
Rn (f, x) = o ((x − x0 )n ) as x → x0 .
35
◦
Theorem 11.2 (Uniqueness of Taylor’s expansion) Suppose that for x ∈ B (x0 ), some function f
is represented by two Taylor’s expansions
and
f (x) = b0 + b1 (x − x0 ) + ... + bn (x − x0 )n + o ((x − x0 )n ) as x → x0 . (11.7)
Then
a0 = b0 , a1 = b1 , ... , an = bn .
Proof of Theorem 11.2. We subtract term by term representation (11.7) from (11.6) and obtain
where c0 = (a0 − b0 ) , c1 = (a1 − b1 ) , ..., cn = (an − bn ). Therefore, in order to prove the theorem
we have to show that
c0 = c1 = ... = cn = 0. (11.9)
Let us pass to the limit in (11.8) as x → x0 . Obviously, c0 = 0. We now divide (11.8) by (x − x0 )
and obtain
c1 + ... + cn · (x − x0 )n−1 = o (x − x0 )n−1
as x → x0 . (11.10)
Passing to the limit in (11.10) as x → x0 , we get c1 = 0. In a similar way we prove that
c2 = ... = cn = 0.
Definition 11.3 Taylor’s formula (11.3) for x0 = 0 is called Maclaurin’s formula with Peano’s form
of remainder, and it takes the form
n
X f (k) (0) k
f (x) = · x + o (xn ) as x → 0. (11.1)
k=0
k!
Now let us present examples of Maclaurin formulae for some common functions.
36
Maclaurin’s Formulae for the Elementary Functions
a) exponential function
x2 xn
ex = 1 + x + + ··· + + o(xn )
2! n!
or n
X xk
ex = + o(xn );
k=0
k!
b) hyperbolic functions
x3 x5 x2n+1
sinh x = x + + + ··· + + o(x2n+2 )
3! 5! (2n + 1)!
or n
X x2k+1
sinh x = + o(x2n+2 );
k=0
(2k + 1)!
2
x x4 x2n
cosh x = 1 + + + ··· + + o(x2n+1 )
2! 4! (2n)!
or n
X x2k
cosh x = + o(x2n+1 );
k=0
(2k)!
c) trigonometric functions
x3 x5 (−1)n x2n+1
sin x = x − + + ··· + + o(x2n+2 )
3! 5! (2n + 1)!
or n
X (−1)k x2k+1
sin x = + o(x2n+2 );
k=0
(2k + 1)!
2
x x4 (−1)n x2n
cos x = 1 − + + ··· + + o(x2n+1 )
2! 4! (2n)!
or n
X (−1)k x2k
cos x = + o(x2n+1 );
k=0
(2k)!
d) power function
n
p
X p
(1 + x) = xk + o(xn ),
k
k=0
where
p p(p − 1)(p − 2) . . . (p − k + 1) p
= and = 1;
k k! 0
In particular,
1
= 1 − x + x2 + · · · + (−1)n xn + o(xn )
1+x
37
or n
1 X
= (−1)k xk + o(xn );
1 + x k=0
1
= 1 + x + x2 + · · · + xn + o(xn )
1−x
or n
1 X
= xk + o(xn );
1 − x k=0
e) logarithmic function
x2 x 3 (−1)n−1 xn
ln(1 + x) = x − + + ··· + + o(xn )
2 3 n
or n
X (−1)k−1 xk
ln(1 + x) = + o(xn )
k=1
k
and n
X xk
ln(1 − x) = − + o(xn ).
k=1
k
Example 12.1 Suppose that f is n times differentiable and g is m times differentiable, and f (n) (0) ̸=
0, g (m) (0) ̸= 0. Find a limit
f (x)
lim , where f (0) = g(0) = 0
x→0 g(x)
and
f (i) (0) = 0, g (j) (0) = 0, i = 1, n − 1, j = 1, n − 1.
f (x) = axn + o(xn )(a ̸= 0) and g(x) = bxm + o(xm )(b ̸= 0).
38
If n > m, then
f (x)
lim = 0. (12.2)
x→0 g(x)
If n < m, then
f (x)
lim = ∞. (12.3)
x→0 g(x)
Example 12.2 Taylor’s formula is often used to calculate the limits of the form
g(x)
lim f (x) ,
x→x0
where
f (x) > 0, lim f (x) = 1, lim g(x) = ∞.
x→x0 x→x0
Solution to Example 12.2. Consider first the case x0 = 0. Suppose that the functions f and g can be
represented as
1
f (x) = 1 + axk + o(xk ) and g(x) = , as x → 0,
bxk + o(xk )
where a ̸= 0, b ̸= 0, k ∈ N. Since
1 axk + o(xk ) a
lim 1 + axk + o(xk ) axk +o(xk ) = e and
lim = ,
x→0 x→0 bxk + o(xk ) b
then g(x) 1
= lim 1 + axk + o(xk ) bxk +o(xk ) =
lim f (x)
x→0 x→0
1 axk +o(xk )
k k axk +o(xk ) bxk +o(xk )
= lim 1 + ax + o(x ) =
x→0
1
axk +o(xk )
( )
axk +o(xk ) ln 1+axk +o(xk )
a
= lim e bxk +o(xk ) = eb . (12.4)
x→0
1 + axk + o(xk )
f (x) = = 1 + (a − a1 )xk + o(xk ), as x → 0
1 + a1 xk + o(xk )
and
1
g(x) = , as x → 0,
bxk + o(xk )
39
where a ̸= 0, a1 ̸= 0, b ̸= 0, k ∈ N, then
g(x) (a−a1 )
lim f (x) =e b . (12.5)
x→0
g(x)
Remark 3 Notice that in order to find the limit of f (x) as x → 0, one can find the limit of the
logarithm of this function limx→0 g(x) ln f (x), and then compute
g(x)
= elimx→0 [g(x) ln f (x)] .
lim f (x)
x→0
Remark 4 If we calculate the limits with Taylor’s formula at some point x0 ̸= 0, one can set
t = x − x0
and reduce our problem to the calculation of the limit at t = 0. The case when x → ∞ can be reduced
to the case t = 0 by the change of variable x = 1/t.
40
Solution to Example 12.3. We observe that the numerator and the denominator in (12.7) go to zero
as x → 0, i.e.,
lim N (x) = lim D(x) = 0.
x→0 x→0
x3 x3
sin x = x − + o(x3 ) and arcsin x = x + + o(x3 ) as x → 0,
6 6
and Maclaurin’s formula becomes
x3 x3 x3
D(x) = arcsin x − sin x = x + −x+ + o(x3 ) = + o(x3 ) as x → 0.
6 6 3
This means that we have to represent N (x) by Maclaurin’s formula with o(x3 ). To this end we make
use of the formulae
√ 1 1 1
1 + t = 1 + t − t2 + t3 + o(t3 ) as t → 0, (12.8)
2 8 16
x3
tan x = x + + o(x3 ) as x → 0. (12.9)
3
Then, from (12.8), (12.9) we get
√ 1 1 1
1 + 2 tan x = 1 + (2 tan x) − (2 tan x)2 + (2 tan x)3 + o (tan x)3
2 8 16
3 2 3
x x x
=1+x+ − + + o(x3 )
3 2 2
x2 5 3
=1+x− + x + o(x3 ) as x → 0. (12.10)
2 6
Recall Maclaurin’s formula for the exponential ex
x 2 x3
ex = 1 + x + + + o(x3 ) as x → 0. (12.11)
2 6
Then we can obtain the representation of the function N (x). Expansions (12.10) and (12.11) yield
√ 2
N (x) = 1 + 2 tan x − ex + x2 = x3 + o(x3 ) as x → 0.
3
In this way the fraction becomes
2x3 /3 + o(x3 )
as x → 0.
x3 /3 + o(x3 )
41
13 Exercises with Solutions
Exercise 13.1 Find the derivatives of sin x and cos x using the definition of the derivative.
Exercise 13.2 Find the derivative of f (x) = |x| using the chain rule.
√ √
Solution to Exercise 13.2. By definition, |x| = x2 . Let us consider the function f (x) = x2 as a
composite, i.e.
√
f = g and g = x2 .
Applying now Theorem 1.10, we obtain that
′ √ ′ 1 x x
|x| = x2 = √ · 2x = √ = .
2 x2 x2 |x|
42
Exercise 13.3 Compute the derivative of function y(x) = 2x ln |x|, x ̸= 0.
e2x + e−2x
= = cosh(2x).
2
x
Exercise 13.5 Find the derivative of y(x) = ln ln .
2
Solution to Exercise 13.5. By the chain rule,
1 2 1 1
y′ = x · · = x.
ln x 2 x ln
2 2
Solution to Exercise 13.6. First we notice that x cannot be a multiple of π, that is x ̸= πk, k ∈ Z.
Using the chain rule, we obtain
1 ′ 1 sin x
y′ = · | sin x| = · · cos x = cot x,
| sin x| | sin x| | sin x|
where sin x ̸= 0 or x ̸= πk (k ∈ Z).
43
√
Exercise 13.7 Find the derivative of y(x) = ln x2 + x4 + 1 .
4x3
1 1
= √ · 2x + · √ =
x2 + x4 + 1 2 x4 + 1
√
x2 x2 + x4 + 1
1 2x
= √ · 2x · 1 + √ = √ · √
x2 + x4 + 1 x4 + 1 x2 + x4 + 1 x4 + 1
2x
=√ .
x4 + 1
In the next Exercises (13.8)-(13.11) we illustrate how the Inverse Function Theorem 2.1 can be
used for computing the derivatives.
√
Exercise 13.8 Find the derivative of the function y(x) = n
x.
√
Solution to Exercise 13.8. Let us find the inverse function for y = n x. To this end, we represent x
in terms of y, i.e.,
√ √ n
y = f (x) = n x ⇒ y n = n x ⇒ x = φ (y) = y n .
Since
φ′ (y) = (y n )′ = n y n−1 ,
√
we plug in y = n
x instead of y to get
√ 1 1
( n x)′ = √ n−1
= √
n
, x > 0,
n( x)
n
n xn−1
i.e.,
√ ′ 1
n
x = √
n
(x > 0) .
n xn−1
44
Solution to Exercise 13.9. The functions ln x and ex are mutually inverse. Then
x = ϕ (y) = ey , x > 0, y ∈ R.
Let us recall that eln x = x. Then the derivative of ln x can be found from the derivative of exponential
function, namely,
1 1 1 1
(ln x)′ = ′ = y ′ = ln x = .
ϕ (y(x)) (e ) y=ln x e x
Thus
1
(ln x)′ = .
x
√
3
Exercise 13.10 Find the derivative of y(x) = x + 1.
Solution to Exercise 13.10. Let us find the inverse function for our function y, which is monotonically
increasing. We have √
y = 3 x + 1 ⇒ y 3 = x + 1 ⇒ x = y 3 − 1.
Let us find the derivative y ′ . We have
√ ′ 1 1
3
x + 1 = y ′ (x) = = =
x′ (y) y=y(x) (y 3 − 1)′ y=y(x)
1 1
= = 2 , x ̸= −1.
3y 2
p
y=y(x) 3
3 (x + 1)
1
Exercise 13.11 Find the derivative of y(x) = arctan .
x
Solution to Exercise 13.11. The inverse function x = x(y) to the function y = y(x) can be found in
a following way:
1 1 1
y = arctan ⇒ = tan y ⇒ x = , where x ̸= 0.
x x tan y
1 1 tan2 y
= · =− .
− tan12 y cos2 y y=y(x) 1 + tan2 y y=y(x)
45
The desired derivative is
′ 1 2
tan2 arctan x1
1 x 1
arctan =− 2 1
=− 2 =− .
x 1 + tan arctan x 1+ 1 1 + x2
x
Let us now use the chain rule and Statement 15 from the Table of Derivatives to compute the same
derivative: ′ ′
1 1 1 1 1 1
arctan = 1 · =− 2 · 1 = − .
x 1 + x2 x x 1 + x2 1 + x2
In the Exercises 13.12-13.16 we are dealing with the higher order derivatives.
Exercise 13.12 Find the derivatives of the n-th order, n = 1, ..., 5 of the function
Now, by (1.2), we can find the derivatives of the n-th order (n = 1, ..., 5):
′
y ′′ = (y ′ ) = (32x3 + 12x2 − 12x − 5)′ = 96x2 + 24x − 12,
′
y (3) = (y ′′ ) = (96x2 + 24x − 12)′ = 192x + 24,
′
y (4) = y (3) = (192x + 24)′ = 192,
′
y (5) = y (4) = (192)′ = 0.
46
Solution to Exercise 13.13. In this exercise we make use of the method of mathematical induction.
We start by computing several derivatives of y = y(x) to make a guess. It follows from (1.2) that
1
y ′ = (ln x)′ =
,
x
′
′′ ′ ′ 1 ′ 1
y = (y ) = = x−1 = −x−2 = − 2 ,
x x
′
′′ ′ 1 2
y (3)
= (y ) = − 2 = 2x−3 = 3 ,
x x
′
′′′ ′ 2 6
y (4)
= (y ) = = −6x−4 = − ,
x3 x4
′
(4) ′ 6 24
(5)
= − 4 = 24x−5 = 5 .
y = y
x x
Our hypothesis is that the derivative of the n-th order has the form
(−1)n−1 (n − 1)!
y (n) = . (13.1)
xn
As it was shown above, for n = 1 the formula (13.1) holds true. Now we differentiate the n-th
derivative to find the derivative of the (n + 1)th order. From (13.1) and (1.2) we obtain that
′
y (n+1) = (−1)n−1 (n − 1)! · x−n = (−1)n−1 (n − 1)! · (−n)x−[n+1] =
(−1)n n!
= ,
x[n+1]
and (13.1) is established.
Solution to Exercise 13.14. We make use of the formula for the derivative of the product and formula
(1.2) to have
1
y ′ = (x ln x)′ = 1 · ln x + x · = ln x + 1,
x
1
y ′′ = (ln x + 1)′ = = x−1 ,
x
′ 1
y (3) = x−1 = −1 · x−2 = − ,
x2
′ 2!
y (4) = −1 · x2 = 2! x−3 = 3,
x
′ 3!
y (5) = 2! x−3 = 2! · (−3) · x−4 = −3! x−4 = − .
x4
47
Thus, our guess is that the n-th derivative, for n ⩾ 2, is given by
Exercise 13.15 Find the n-th order derivative of the function y = sin x.
Solution to Exercise 13.15. In this exercise we apply the method of mathematical induction. To
guess a form for the nth derivative, we compute several derivatives of y = sin x:
π
y ′ = (sin x)′ = cos x = sin x + ,
2
π
y ′′ = (cos x)′ = − sin x = sin x + 2 · ,
2
(3) ′
π
y = (− sin x) = − cos x = sin x + 3 · ,
2
π
y (4) = (− cos x)′ = sin x = sin x + 4 · .
2
Our hypothesis is that the derivative of the n-th order has the form
(n) (n)
nπ
y = (sin x) = sin x + .
2
This result can be proved by the method of mathematical induction.
1
Exercise 13.16 Find the n-th order derivative of the function y(x) = .
x
Solution to Exercise 13.16. As in the previous exercises the proof of the final result is based on the
method of mathematical induction. To this end we have to find the first derivatives of the function
48
y = y(x). Here we make use of the formula (1.2). We have
′
′ 1 (−1)
y = = −x−2 = 2 ,
x x x
′
(−1)2 · 2
′′ 1 −2 ′
−3
y = − 2 = −1 · x x = −1 · (−2) · x = ,
x x x3
!′
2
(−1) · 2 2 −3 ′
y ′′′ = = (−1)3 · 2 · 3 · x−4
3
= (−1) · 2 · x x
x
x
3
(−1) · 2 · 3
= ,
x4 !′
(−1)3 · 2 · 3 3 −4 ′
y (4) = (−1)4 · 4! · x−5
= 4
= (−1) · 2 · 3 · x
x
x
(−1)4 4!
= .
x5
Our hypothesis is that the derivative of the n-th order has the form
(n) (−1)n n!
y = .
xn+1
The proof can be done by the method of mathematical induction, as before.
In the Exercises 13.17-13.19 we are dealing with the derivatives of parametric functions.
where a, b are the semi-axes of the ellipse and t is a parameter. Find the first three derivatives of
y = y(x) with respect to x.
Solution to Exercise 13.17. Consider the first derivative of the function y with respect to x. We have
b 1 b 3
=− 2 3 = − 2 csc t,
a sin t a
49
where
1
csc t = . (13.2)
sin t
(3)
Finally, yxxx is given by
′
′′ ′ − ab2 csc3 t t − ab2 · 3csc2 t · (csc t)′t
(3) (3) (yxx )t
y = yxxx = = =
x′t (a cos t)′t (−a sin t)
x = 1 + t2 , y = t − t3 .
Find the first three derivatives of the function y = y(x) with respect to x at t = 1.
Solution to Exercise 13.18. The first derivative of the function y with respect to x is
′
yt′ (t − t3 )t 1 − 3t2 1 3t
y ′ = yx′ = ′
= ′ = = − .
xt (1 + t2 )t 2t 2t 2
′′′
Finally, yxxx is given by
′
′′ ′ − 4t13 − 4t3 − 14 · (−3t−4 ) − 34 · (−t−2 )
′′′ ′′′ (yxx )t t
y = yxxx = = = =
x′t (1 + t2 )′t 2t
3
4t4
+ 4t32 3 3
= = 5 + 3.
2t 8t 8t
50
At the point where t = 1 we have that
yx′ = −1,
t=1
′′
yxx = −1,
t=1
′′′ 3 3 3 3 3
yxxx = 5
+ 3
= + = .
t=1 8·1 8·1 8 8 4
Solution to Exercise 13.19. The first derivative of the function y with respect to x is
(3)
Finally, yxxx is given by
′′ ′ 1+sin t ′
′′′ (yxx )t 3
cos t t
y (3) = yxxx = =
x′t (1 + sin t)′t
51
′′′
and at the point corresponding to t = 0 yxxx equals
The forthcoming Exercises 13.20-13.23 are devoted to the applications of the Mean Value Theo-
rem.
Exercise 13.20 Consider f (x) = x2 + 2x. Find all ξ ∈ [−2, 0] such that f ′ (ξ) = 0.
Solution to Exercise 13.20. Let us show that the function f (x) satisfies three conditions of Rolle’s
theorem.
1. First, we observe that f (x) is a quadratic function and, therefore, it is continuous in the interval
[−2, 0].
2. Since f (x) is quadratic, then it is differentiable over the open interval (−2, 0).
Now in order to calculate ξ, we have to find the solution to the equation f ′ (ξ) = 0, i.e.,
f ′ (ξ) = 2ξ + 2 = 0.
Exercise 13.21 Given f (x) = x2 − 6x + 5, find all ξ ∈ [2, 4] such that f ′ (ξ) = 0.
Solution to Exercise 13.21. Let us show that the function f (x) satisfies three conditions of Rolle’s
theorem.
1. First, we observe that f (x) is a quadratic function and, therefore, it is continuous in the interval
[2, 4].
2. Since f (x) is quadratic, then it is differentiable over the open interval (2, 4).
52
3. Finally, we have to check the third condition of Rolle’s theorem. In fact
Show that all the conditions of Lagrange’s theorem are fulfilled and find ξ given by the theorem.
Solution to Exercise 13.22. Let us show that the function f (x) satisfies the conditions of Lagrange’s
theorem.
1. First, we observe that f (x) is a quadratic function and, therefore, it is continuous in the interval
[1, 4].
2. Since f (x) is quadratic, then it is differentiable over the open interval (1, 4). Moreover, the
derivative of f is given by
′
f ′ (x) = x2 − 3x + 5 = 2x − 3.
√
f (x) = x+4 on the interval [0, 5].
Show that all the conditions of Lagrange’s theorem are fulfilled and find ξ satisfying the conditions of
this theorem.
53
Solution to Exercise 13.23. Let us show that the function f (x) satisfies the conditions of Lagrange’s
theorem.
This enables to apply Lagrange’s theorem and find the desired point ξ. We have
√ √
′ f (b) − f (a) 1 5+4− 0+4
f (ξ) = ⇒ √ =
b−a 2 ξ+4 5−0
1 1
⇒ √ =
2 ξ+4 5
and
p 5 25
ξ+4= or ξ + 4 = .
2 4
Thus the desired point is the point ξ = 2.25 ∈ (0, 5).
The remaining exercises concern finding limits with the help of Maclaurin’s expansion.
x 2 x3 x 2 x3
ln(1 + x) = x − + + o(x3 ), ln(1 − x) = −x − − + o(x3 ).
2 3 2 3
Then we can find the Maclaurin formula for the denominator of the fraction
2x3
1+x
ln − 2x = ln(1 + x) − ln(1 − x) − 2x = + o(x3 ) as x → 0.
1−x 3
Now we have to represent the numerator of our fraction by the Maclaurin formula with o(x3 ). We
know that
x3
arctan x = x − + o(x3 ) as x → 0 (13.3)
3
and, in addition,
t2 t3
et = 1 + t + + + o(t3 ) as t → 0. (13.4)
2 6
54
We plug (13.3) in (13.4) in order to have
x3 x2 x3
arctan x
e =1+ x− + + + o(x3 ) =
3 2 6
x2 x3
=1+x+ − + o(x3 ) as x → 0. (13.5)
2 6
Now we make use of the following Maclaurin formula
1
= 1 + x + x2 + x3 + o(x3 ) as x → 0.
1−x
Then we can obtain the Maclaurin formula for the numerator of the fraction. Namely,
1 x2 7
earctan x − + = − x3 + o(x3 ) as x → 0.
1−x 2 6
In this way, our fraction can be represented in the following form
− 67 x3 + o(x3 )
2x3
as x → 0.
3
+ o(x3 )
7
This means that the desired limit is equal to − .
4
55
Exercise 13.26 Find the limit
h√ x i1/ sin2 (x−2)
lim 3 − x + ln .
x→2 2
Solution to Exercise 13.26. We set x − 2 = t. Then we obtain
h√ 1/ sin2 t
x i1/ sin2 (x−2) √
t
lim 3 − x + ln = lim 1 − t + ln 1 + .
x→2 2 t→0 2
56