Mean-Value Theorem (Several Variables)
Mean-Value Theorem (Several Variables)
Mean-Value Theorem (Several Variables)
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
<strong>Mean</strong>-<strong>Value</strong> <strong>Theorem</strong> (<strong>Several</strong> <strong>Variables</strong>) 1<br />
<strong>Mean</strong>-<strong>Value</strong> <strong>Theorem</strong> (<strong>Several</strong> <strong>Variables</strong>)<br />
THEOREM THE MEAN-VALUE THEOREM (SEVERAL VARIABLES)<br />
If f is differentiable at each point of the line segment ab, then there exists on that<br />
line segment a point c between a and b such that<br />
f(b) − f(a) = ∇f(c) · (b − a).<br />
Proof. As t ranges from 0 to 1, a + t(b − a) traces out the line segment ab. The<br />
idea of the proof is to apply the one-variable mean-value theorem to the function<br />
g(t) = f(a + t[b − a]), t ∈ [0, 1].<br />
To show that g is differentiable on the open interval (0, 1), we take t ∈ (0, 1) and<br />
form<br />
g(t + h) − g(t) = f(a + (t + h)[b − a]) − f(a + t[b − a])<br />
= f(a + t[b − a] + h[b − a]) − f(a + t[b − a])<br />
= ∇f(a + t[b − a]) · h[b − a] + o(h[b − a]).<br />
Since<br />
∇f(a + t[b − a]) · h(b − a) = [∇f(a + t[b − a]) · (b − a)]h<br />
and the o(h(b − a)) term is obviously o(h), we can write<br />
g(t + h) − g(t) = [∇f(a + t[b − a]) · (b − a)]h + o(h).<br />
Dividing both sides by h, we see that g is differentiable and<br />
g ′ (t) = ∇f(a + t[b − a]) · (b − a).<br />
The function g is clearly continuous at 0 and at 1. Applying the one-variable meanvalue<br />
theorem to g, we can conclude that there exists a number t 0 between 0 and 1<br />
such that<br />
g(1) − g(0) = g ′ (t 0 )(1 − 0).<br />
Since g(1) = f(b), g(0) = f(a), and g ′ (t 0 ) = ∇f(a + t 0 [b − a]) · (b − a), the above gives<br />
f(b) − f(a) = ∇f(a + t 0 [b − a]) · (b − a).<br />
March 17, 2001
<strong>Mean</strong>-<strong>Value</strong> <strong>Theorem</strong> (<strong>Several</strong> <strong>Variables</strong>) 2<br />
Setting c = a + t 0 [b − a], we have<br />
f(b) − f(a) = ∇f(c) · (b − a).<br />
A nonempty open set U (in the plane or in three-space) is said to be connected if<br />
any two points of U can be joined by a polygonal path that lies entirely in U.<br />
THEOREM. Let U be an open connected set. If ∇f(x) = 0 for all x ∈ U, then f<br />
is constant in U.<br />
Proof. Let a and b be any two points in U. Since U is open and connected, we can<br />
join these points by a polygonal path with vertices a = c 0 , c 1 , · · · , c n−1 , c n = b. By the<br />
mean-value theorem there exist points<br />
c ∗ 1 on c 0 c 1 such that f(c 1 ) − f(c 0 ) = ∇f(c1) ∗ · (c 1 − c 0 ),<br />
c ∗ 2 on c 1 c 2 such that f(c 2 ) − f(c 1 ) = ∇f(c2) ∗ · (c 2 − c 1 ),<br />
c ∗ 3 on c 2 c 3 such that f(c 3 ) − f(c 2 ) = ∇f(c3) ∗ · (c 3 − c 2 ),<br />
· · ·<br />
c ∗ n on c n−1 c n such that f(c n ) − f(c n−1 ) = ∇f(cn) ∗ · (c n − c n−1 ).<br />
If ∇f(x) = 0 for all x in U, then<br />
f(c 1 ) − f(c 0 ) = 0, f(c 2 ) − f(c 1 ) = 0, · · · , f(c n ) − f(c n−1 ) = 0<br />
This shows that<br />
f(a) = f(c 0 ) = f(c 1 ) = f(c 2 ) = · · · = f(c n−1 ) = f(c n ) = f(b).<br />
Since a and b are arbitrary points of U, f must be constant on U.<br />
THEOREM. Let U be an open connected set.<br />
If ∇f(x) = ∇g(x) for all x in U, then f and g differ by a constant on U.<br />
Proof. If ∇f(x) = ∇g(x) for all x in U, then<br />
∇[f(x) − g(x)] = ∇f(x) − ∇g(x) = 0<br />
for all x in U. Therefore, f − g must be constant on U.<br />
March 17, 2001
Continuity on a Closed Region 3<br />
We have seen that a function continuous on an interval skips no values. (The<br />
intermediate-value theorem .) There are analogous results for functions of several<br />
variables. Here is one of them.<br />
THEOREM AN INTERMEDIATE-VALUE THEOREM (SEVERAL VARIABLES)<br />
Suppose that f is continuous on an open connected set U and A < C < B. If,<br />
somewhere on U, f takes on the value A and, somewhere on U, f takes on the value<br />
B, then, somewhere on U, f takes on the value C.<br />
Proof. Let a and b be points of U for which<br />
f(a) = A and f(b) = B.<br />
We must show that there exists a point c in U for which f(c) = C.<br />
Since U is polygonally connected, there is a polygonal path in U<br />
r = r(t), t ∈ [a, b]<br />
that joins a to b. Since r is continuous on [a, b], the composition<br />
g(t) = f(r(t))<br />
is also continuous on [a, b]. Since<br />
g(a) = f(r(a)) = f(a) = A and g(b) = f(r(b)) = f(b) = B,<br />
we know from the intermediate-value theorem of one variable that there is a number c<br />
in [a, b] for which g(t) = C. Setting c = r(t) we have f(c) = C.<br />
Continuity on a Closed Region<br />
An open connected set is called an open region. If we start with an open region and<br />
adjoin to it the boundary, then we have what is called a closed region. (A closed<br />
region is therefore a closed set, the interior of which is an open region.)<br />
Continuity on a closed region Ω requires continuity at the boundary points of Ω as<br />
well as the interior points.<br />
If x 0 is an interior point of Ω, then all points x sufficiently close to x 0 are in Ω and,<br />
by definition, f is continuous at x 0 if as x approaches x 0 , f(x) approaches f(x 0 ).<br />
March 17, 2001
Continuity on a Closed Region 4<br />
If x 0 is a boundary point of Ω, then we have to modify the definition and say: f is<br />
continuous at x 0 if as x approaches x 0 within Ω, f(x) approaches f(x 0 ).<br />
In terms of ɛ − δ, f is continuous at a boundary point x 0 if for each ɛ > 0 there<br />
exists δ> 0 such that<br />
if ‖x − x 0 ‖ < δ and x ∈ Ω, then |f(x) − f(x 0 )| < ɛ.<br />
(This is completely analogous to one-sided continuity at an endpoint of a closed<br />
interval [a, b].)<br />
The intermediate-value result that we just proved for open connected sets can be<br />
extended to closed regions.<br />
THEOREM A SECOND INTERMEDIATE-VALUE THEOREM [SEVERAL VARI-<br />
ABLES]<br />
Suppose that f is continuous on a closed region Ω and A < C < B. If, somewhere<br />
on Ω, f takes on the value A and, somewhere on Ω, f takes on the value B, then,<br />
somewhere on Ω, f takes on the value C.<br />
Proof. Let a and b be points of Ω for which<br />
f(a) = A and f(b) = B.<br />
If a and b are both in the interior of Ω, then the result follows from the previous<br />
theorem. But one or both of these points could lie on the boundary of Ω. To take care<br />
of that possibility, we can proceed as follows. Take ɛ > 0 small enough that<br />
A + ɛ < C < B − ɛ.<br />
By continuity there exist points x 1 , x 2 in the interior of Ω for which<br />
f(x 1 ) < A + ɛ and B − ɛ < f(x 2 ).<br />
Then<br />
f(x 1 ) < C < f(x 2 )<br />
and the result follows from the previous theorem.<br />
March 17, 2001
Chain Rules 5<br />
Chain Rules<br />
For functions of a single variable there is basically only one chain rule. For functions<br />
of several variables there are many chain rules.<br />
A vector-valued function is said to be continuous provided that its components are<br />
continuous. If f = f(x, y, z) is a scalar-valued function (a real-valued function), then<br />
its gradient ∇f is a vector-valued function. We say that f is continuously differentiable<br />
on an open set U if f is differentiable on U and ∇f is continuous on U. If a curve r<br />
lies in the domain of f then we can form the composition<br />
(f ◦ r)(t) = f(r(t)).<br />
The composition f ◦r is a real-valued function of a real variable t. The numbers f(r(t))<br />
are the values taken on by f along the curve r.<br />
THEOREM CHAIN RULE (ALONG A CURVE)<br />
If f is continuously differentiable on an open set U and r = r(t) is a differentiable<br />
curve that lies entirely in U, then the composition f ◦ r is differentiable and<br />
Proof. We will show that<br />
d<br />
dt [(f ◦ r)(t)] = ∇f(r(t)) · r′ (t).<br />
f(r(t + h)) − f(r(t))<br />
lim<br />
h→0 h<br />
= ∇f(r(t)) · r ′ (t).<br />
For h ≠ 0 and sufficiently small, the line segment that joins r(t) to r(t + h) lies entirely<br />
in U. This we know because U is open and r is continuous. For such h, the meanvalue<br />
theorem we just proved assures us that there exists a point c(h) between r(t) and<br />
r(t + h) such that<br />
Dividing both sides by h, we have<br />
f(r(t + h)) − f(r(t)) = ∇f(c(h)) · [r(t + h) − r(t)].<br />
f(r(t + h)) − f(r(t))<br />
h<br />
= ∇f(c(h)) · [<br />
r(t + h) − r(t)<br />
].<br />
h<br />
As h tends to zero, c(h) tends to r(t) and by the continuity of ∇f,<br />
∇f(c(h)) → ∇f(r(t)).<br />
March 17, 2001
Chain Rules 6<br />
Since<br />
the result follows.<br />
r(t + h) − r(t)<br />
h<br />
→ r ′ (t),<br />
Problem 1. Use the chain rule to find the rate of change of<br />
f(x, y) = 1 3 (x3 + y 3 )<br />
with respect to t along the curve r(t) = a cos t i + b sin t j.<br />
Solution. The rate of change of f with respect to t along the curve r is the derivative<br />
d<br />
dt [f(r(t)].<br />
By the chain rule<br />
Here<br />
d<br />
dt [f(r(t)] = ∇f(r(t)) · r′ (t).<br />
∇f = x 2 i + y 2 j.<br />
With x(t) = a cos t and y(t) = b sin t, we have<br />
∇f(r(t)) = a 2 cos 2 t i + b 2 sin 2 t j.<br />
Since r ′ (t) = −a sin t i + b cos t j, we see that<br />
d<br />
dt [f(r(t)] = ∇f(r(t)) · r′ (t)<br />
= (a 2 cos 2 t i + b 2 sin 2 t j) · (−a sin t i + b cos t j)<br />
= −a 3 sin t cos 2 t + b 3 sin 2 t cos t<br />
= sin t cos t (b 3 sin t − a 3 cos t).<br />
Remark. Note that we could have obtained the same result without invoking the<br />
chain rule by first forming f(r(t)) and then differentiating:<br />
f(r(t)) = f(x(t), y(t)) = 1 3 ([x(t)]3 + [y(y)] 3 ) = 1 3 (a3 cos 3 t + b 3 sin 3 t)<br />
so that<br />
d<br />
dt [f(r(t))] = 1 3 [3a3 cos 2 t(− sin t) + 3b 3 sin 2 t cos t]<br />
March 17, 2001
Another Formulation of the Chain Rule 7<br />
= sin t cos t (b 3 sin t − a 3 cos t).<br />
Problem 2. Use the chain rule to find the rate of change of<br />
f(x, y, z) = x 2 y + z cos x<br />
with respect to t along the twisted cubic r(t) = ti + t 2 j + t 3 k.<br />
Solution. Once again we use the relation<br />
d<br />
dt [f(r(t)] = ∇f(r(t)) · r′ (t).<br />
This time<br />
∇f = (2xy − z sin x, x 2 , cos x).<br />
With x(t) = t, y(t) = t 2 , z(t) = t 3 , we have<br />
∇f(r(t)) = (2t 3 − t 3 sin t, t 2 , cos t).<br />
Since r ′ (t) = (1, 2t, 3t 2 ), we have<br />
d<br />
dt [f(r(t)] = ∇f(r(t)) · r′ (t).<br />
= (2t 3 − t 3 sin t, t 2 , cos t) · (1, 2t, 3t 2 )<br />
= 2t 3 − t 3 sin t + 2t 3 + 3t 2 cos t<br />
= 4t 3 − t 3 sin t + 3t 2 cos t.<br />
Another Formulation of the Chain Rule<br />
The chain rule for functions of one variable,<br />
d<br />
dt [u(x(t))] = u′ (x(t))x ′ (t),<br />
can be written<br />
In a similar manner, the relation<br />
du<br />
dt = du dx<br />
dx dt .<br />
d<br />
dt [f(r(t)] = ∇f(r(t)) · r′ (t)<br />
March 17, 2001
Another Formulation of the Chain Rule 8<br />
can be written<br />
With<br />
this equation takes the form<br />
∇u = ( ∂u<br />
∂x , ∂u<br />
du<br />
dt<br />
∂y , ∂u<br />
∂z<br />
= ∇u · dr<br />
dt .<br />
) and<br />
dr<br />
dt = (dx dt , dy<br />
dt , dz<br />
dt ),<br />
du<br />
dt = ∂u dx<br />
∂x dt + ∂u dy<br />
∂y dt + ∂u dz<br />
∂z dt .<br />
In the two-variable case, the z-term drops out and we have<br />
Problem 3. Find du/dt if<br />
du<br />
dt = ∂u dx<br />
∂x dt + ∂u dy<br />
∂y dt .<br />
u = x 2 − y 2 and x = t 2 − 1, y = 3 sin πt.<br />
Solution. Here we are in the two-variable case<br />
du<br />
dt = ∂u dx<br />
∂x dt + ∂u dy<br />
∂y dt .<br />
Since<br />
we have<br />
∂u<br />
∂x<br />
= 2x,<br />
∂u<br />
∂y<br />
du<br />
dt<br />
= −2y and<br />
dx<br />
dt<br />
= 2t,<br />
dy<br />
dt<br />
= (2x)(2t) + (−2y)(3π cos πt)<br />
= 4t 3 − 4t − 18π sin πt cos πt.<br />
= 3π cos πt,<br />
This same result may be obtained by first writing u directly as a function of t and<br />
then differentiating:<br />
so that<br />
u = x 2 − y 2 = (t − 1) 2 − (3 sin πt) 2<br />
du<br />
dt = 4t3 − 4t − 18π sin πt cos πt.<br />
Problem 4. The upper radius of the frustum of a cone is 10 inches, the lower<br />
radius is 12 inches, and the height is 18 inches. At what rate is the volume changing if<br />
the upper radius decreases at the rate of 2 inches per minute, the lower radius increases<br />
March 17, 2001
Other Chain Rules 9<br />
at the rate of 3 inches per minute, and the height decreases at the rate of 4 inches per<br />
minute<br />
Solution. Let x be the upper radius, y the lower radius, and z the height. Then<br />
V = 1 3 πz(x2 + xy + y 2 )<br />
so that<br />
∂V<br />
∂x = 1 ∂V<br />
πz(2x + y),<br />
3 ∂y = 1 ∂V<br />
πz(x + 2y),<br />
3 ∂z = 1 3 π(x2 + xy + y 2 ).<br />
Since<br />
we have here<br />
dV<br />
dt = ∂V dx<br />
∂x dt + ∂V dy<br />
∂y dt + ∂V<br />
∂z<br />
dz<br />
dt ,<br />
Set<br />
we find that<br />
dV<br />
dt = 1 πz(2x + y)dx<br />
3 dt + 1 πz(x + 2y)dy<br />
3 dt + 1 3 π(x2 + xy + y 2 ) dz<br />
dt .<br />
x = 10, y = 12, z = 18, dx<br />
dt<br />
= −2,<br />
dy<br />
dt<br />
dV<br />
dt = −772 3 π.<br />
= 4,<br />
dz<br />
dt = −4<br />
The volume decreases at the rate of 772/3 cubic inches per minute.<br />
Other Chain Rules<br />
In the setting of functions of several variables there are numerous chain rules. They<br />
can all be deduced from the theorem proved above and its corollaries. If, for example,<br />
u = u(x, y), where x = x(s, t) and y = y(s, t),<br />
then<br />
∂u<br />
∂s = ∂u ∂x<br />
∂x ∂s + ∂u ∂y<br />
∂y ∂s<br />
∂u<br />
and<br />
∂t = ∂u ∂x<br />
∂x ∂t + ∂u<br />
∂y<br />
To obtain the first equation, keep t fixed and differentiate u with respect to s according<br />
to the formula in the chain rule; to obtain the second equation, keep s fixed and<br />
differentiate u with respect to t.<br />
∂y<br />
∂t .<br />
March 17, 2001
Other Chain Rules 10<br />
In the figure below we have drawn a tree diagram for the formula. We construct<br />
such a tree by branching at each stage from a function to all the variables that directly<br />
determine it. Each path starting at u and ending at a variable determines a product<br />
of (partial) derivatives. The partial derivative of u with respect to each variable is the<br />
sum of the products generated by all the direct paths to that variable.<br />
Suppose, for example, that<br />
u = u(x, y, z), where x = x(s, t), y = y(s, t), z = z(s, t).<br />
The partials of u with respect to s and t can be read from the diagram:<br />
∂u<br />
∂s = ∂u ∂x<br />
∂x ∂s + ∂u ∂y<br />
∂y ∂s + ∂u ∂z<br />
∂z ∂s<br />
∂u<br />
and<br />
∂t = ∂u ∂x<br />
∂x ∂t + ∂u ∂y<br />
∂y ∂t + ∂u<br />
∂z<br />
Problem 5. (Implicit differentiation) Let u = u(x, y, z) be a continuously differentiable<br />
function, and suppose that the equation<br />
u(x, y, z) = 0<br />
defines z implicitly as a differentiable function of x and y. Show that<br />
if ∂u<br />
∂z<br />
∂z<br />
≠ 0, then<br />
∂x = −∂u/∂x ∂u/∂z<br />
Solution. To be able to apply the chain rule, we write<br />
Then<br />
Since u(s, t, z(s, t)) = 0 for all s and t,<br />
and<br />
∂z<br />
∂y = −∂u/∂y ∂u/∂z .<br />
u = u(x, y, z) with x = s, y = t, z = z(s, t).<br />
∂u<br />
∂s = ∂u ∂x<br />
∂x ∂s + ∂u ∂y<br />
∂y ∂s + ∂u ∂z<br />
∂z ∂s .<br />
∂z<br />
∂t .<br />
∂u<br />
∂s = 0. March 17, 2001
A first-order partial differential equation with constant coefficients 11<br />
Since<br />
we have<br />
If ∂u<br />
∂z<br />
≠ 0, then<br />
∂z<br />
The formula for ∂z<br />
∂y<br />
0 = ∂u<br />
∂x · 1 + ∂u<br />
∂x<br />
∂s<br />
∂y · 0 + ∂u<br />
∂z<br />
= 1 and<br />
∂y<br />
∂s = 0,<br />
∂z<br />
∂s = ∂u<br />
∂x + ∂u<br />
∂z<br />
∂x = −∂u/∂x ∂u/∂z<br />
can be obtained in a similar manner.<br />
∂z<br />
∂s = ∂u<br />
∂x + ∂u ∂z<br />
∂z ∂x .<br />
A first-order partial differential equation with constant<br />
coefficients<br />
Consider the first-order partial differential equation<br />
∂f(x, y) ∂f(x, y)<br />
3 + 2<br />
∂x ∂y<br />
All the solutions of this equation can be found by geometric considerations. We express<br />
= 0.<br />
the left member as a dot product, and write the equation in the form<br />
(3, 2) · ∇f(x, y) = 0.<br />
This tells us that the gradient vector ∇f(x, y) is orthogonal to the vector 3i + 2j at<br />
each point (x, y). But we also know that ∇f(x, y) is orthogonal to the level curves of<br />
f. Hence these level curves must be straight lines parallel to 3i + 2j. In other words,<br />
the level curves of f are the lines<br />
2x − 3y = c.<br />
Therefore f(x, y) is constant when 2x − 3y is constant. This suggests that<br />
f(x, y) = g(2x − 3y)<br />
for some function g.<br />
March 17, 2001
A first-order partial differential equation with constant coefficients 12<br />
Now we verify that, for each differentiable function g, the scalar field f defined by<br />
this equation does, indeed, satisfy the given PDE. Using the chain rule to compute the<br />
partial derivatives of f we find<br />
∂f<br />
∂x = 2g′ (2x − 3y),<br />
∂f<br />
∂y = −3g′ (2x − 3y),<br />
3 ∂f<br />
∂x + 2∂f ∂y = 6g′ (2x − 3y) − 6g ′ (2x − 3y) = 0.<br />
Therefore, f satisfies the PDE as required.<br />
Conversely, we can show that every differentiable f which satisfies the PDE must<br />
necessarily have the form<br />
f(x, y) = g(2x − 3y)<br />
for some g. To do this, we introduce a linear change of variables,<br />
x = Au + Bv,<br />
y = Cu + Dv.<br />
This transforms f(x, y) into a function of u and v, say<br />
h(u, v) = f(Au + Bv, Cu + Dv).<br />
We shall choose the constants A, B, C, D so that h satisfies the simpler equation<br />
∂h(u, v)<br />
∂u<br />
= 0.<br />
Then we shall solve this equation and show that f has the required form. Using the<br />
chain rule we find<br />
Since f satisfies<br />
∂h<br />
∂u = ∂f ∂x<br />
∂x ∂u + ∂f ∂y<br />
∂y ∂u = ∂f<br />
∂x A + ∂f<br />
∂y C.<br />
∂f(x, y) ∂f(x, y)<br />
3 + 2<br />
∂x ∂y<br />
= 0.<br />
we have ∂f/∂y = −(3/2)(∂f/∂x), so the equation for ∂h/∂u becomes<br />
Therefore, h will satisfy ∂h(u,v)<br />
∂u<br />
find<br />
∂h<br />
∂u = ∂f<br />
∂x (A − 3 2 C).<br />
= 0 if we choose A = 3 C. Taking A = 3 and C = 2 we<br />
2<br />
x = 3u + Bv,<br />
y = 2u + Dv.<br />
March 17, 2001
A first-order partial differential equation with constant coefficients 13<br />
For this choice of A and C, the function h satisfies ∂h(u,v)<br />
∂u<br />
of v alone, say<br />
h(u, v) = g(v)<br />
= 0, so h(u, v) is a function<br />
for some function g. To express v in terms of x and y we eliminate u from<br />
x = 3u + Bv,<br />
y = 2u + Dv<br />
and obtain 2x − 3y = (2B − 3D)v. Now we choose B and D to make 2B − 3D = 1,<br />
say B = 2, D = 1. For this choice the transformation<br />
x = Ax + Bv,<br />
y = Cu + Dv.<br />
is nonsingular; we have v = 2x − 3y, and hence<br />
f(x, y) = h(u, v) = g(v) = g(2x − 3y).<br />
This shows that every differentiable solution for<br />
has the form<br />
∂f(x, y) ∂f(x, y)<br />
3 + 2<br />
∂x ∂y<br />
f(x, y) = g(2x − 3y).<br />
Exactly the same type of argument proves the following theorem for first-order<br />
equations with constant coefficients.<br />
THEOREM Let g be differentiable on R, and let f be the scalar field defined on<br />
R 2 by the equation<br />
f(x, y) = g(bx − ay),<br />
where a and b are constants, not both zero.<br />
differential equation<br />
∂f(x, y) ∂f(x, y)<br />
a + b<br />
∂x ∂y<br />
= 0<br />
Then f satisfies the first-order partial<br />
everywhere in R 2 . Conversely, every differentiable solution of this PDE necessarily<br />
has the form for some g.<br />
= 0<br />
March 17, 2001
d’Alembert’s Solution of the Wave Equation 14<br />
The one-dimensional wave equation<br />
Imagine a string of infinite length stretched along the x-axis and allowed to vibrate<br />
in the xy-plane. We denote by y = f(x, t) the vertical displacement of the string at<br />
the point x at time t. We assume that, at time t = 0, the string is displaced along a<br />
prescribed curve, y = F (x). We regard the displacement f(x, t) as an unknown function<br />
of x and t to be determined. A mathematical model for this problem (suggested by<br />
physical considerations) is the partial differential equation<br />
∂ 2 f<br />
∂t 2<br />
= c2 ∂2 f<br />
∂x 2 ,<br />
where c is a positive constant depending on the physical characteristics of the string.<br />
This equation is called the one-dimensional wave equation. We will solve this<br />
equation subject to certain auxiliary conditions.<br />
Since the initial displacement is the prescribed curve y = F (x), we seek a solution<br />
satisfying the condition<br />
f(x, 0) = F (x).<br />
We also assume that ∂y/∂t, the velocity of the vertical displacement, is prescribed at<br />
time t = 0, say<br />
D 2 f(x, 0) = G(x),<br />
where G is a given function. It seems reasonable to expect that this information<br />
should suffice to determine the subsequent motion of the string. We will show that,<br />
indeed, this is true by determining the function f in terms of F and G. The solution is<br />
expressed in a form given by Jean d’Alembert (1717˜ 1783), a French mathematician<br />
and philosopher.<br />
d’Alembert’s Solution of the Wave Equation<br />
<strong>Theorem</strong>. Let F and G be given functions such that G is differentiable and F is twice<br />
differentiable on R. Then the function f given by the formula<br />
f(x, t) =<br />
F (x + ct) + F (x − ct)<br />
2<br />
+ 1 2c<br />
∫ x+ct<br />
x−ct<br />
G(s) ds<br />
March 17, 2001
d’Alembert’s Solution of the Wave Equation 15<br />
satisfies the one-dimensional wave equation<br />
∂ 2 f<br />
∂t 2<br />
= c2 ∂2 f<br />
∂x 2<br />
and the initial conditions<br />
f(x, 0) = F (x),<br />
D 2 f(x, 0) = G(x).<br />
Conversely, any function f with equal mixed partials which satisfies these conditions<br />
necessarily has this form.<br />
Proof. It is a straightforward to verify that the function f given this way satisfies<br />
the wave equation and the given initial conditions. We shall prove the converse.<br />
One way to proceed is to assume that f is a solution of the wave equation, introduce<br />
a linear change of variables,<br />
x = Au + Bv, t = Cu + Dv<br />
which transforms f(x, t) into a function of u and v, say<br />
g(u, v) = f(Au + Bv, Cu + Dv),<br />
and choose the constants A, B, C, D so that g satisfies the simpler equation<br />
∂ 2 g<br />
∂u∂v = 0.<br />
Solving this equation for g we find that g(u, v) = ϕ 1 (u) + ϕ 2 (v), where ϕ 1 (u) is a<br />
function of u alone and ϕ 2 (v) is a function of v alone. The constants A, B, C, D can<br />
be chosen so that u = x + ct, v = x − ct, from which we obtain<br />
f(x, t) = ϕ 1 (x + ct) + ϕ 2 (x − ct).<br />
Then we use the initial conditions to determine the functions ϕ 1 (x) and ϕ 2 (x) in terms<br />
the given functions F and G.<br />
We will obtain this form by another method which makes use of the previous theorem<br />
and avoids the change of variables. First we rewrite the wave equation in the<br />
form<br />
L 1 (L 2 f) = 0,<br />
March 17, 2001
d’Alembert’s Solution of the Wave Equation 16<br />
where L 1 and L 2 are the first-order linear differential operators given by<br />
L 1 = ∂ ∂t − c ∂<br />
∂x ,<br />
L 2 = ∂ ∂t + c ∂<br />
∂x .<br />
Let f be a solution of L 1 (L 2 f) = 0 and let<br />
u(x, t) = L 2 f(x, t).<br />
Equation L 1 (L 2 f) = 0 states that u satisfies the first-order equation L 1 (u) = 0. Hence,<br />
by the previous theorem we have<br />
u(x, t) = ϕ(x + ct)<br />
for some function ϕ. Let Φ be any primitive of ϕ, say Φ(y) = ∫ y<br />
0<br />
We will show that L 2 (v) = L 2 (f). We have<br />
v(x, t) = 1 Φ(x + ct).<br />
2c<br />
ϕ(s) ds, and let<br />
∂v<br />
∂x = 1 2c Φ′ (x + ct) and<br />
∂v<br />
∂t = 1 2 Φ′ (x + ct),<br />
so<br />
L 2 v = ∂v<br />
∂t + c∂v ∂x = Φ′ (x + ct) = u(x, t) = L 2 f.<br />
In other words, the difference f − v satisfies the first-order equation<br />
L 2 (f − v) = 0.<br />
By the previous theorem we must have f(x, t) − v(x, t) = ψ(x − ct) for some function<br />
ψ. Therefore<br />
f(x, t) = v(x, t) + ψ(x − ct) = 1 Φ(x + ct) + ψ(x − ct).<br />
2c<br />
This proves that<br />
f(x, t) = ϕ 1 (x + ct) + ϕ 2 (x − ct)<br />
with ϕ 1 = 1 2c Φ and ϕ 2 = ψ.<br />
Now we use the initial conditions<br />
f(x, 0) = F (x),<br />
D 2 f(x, 0) = G(x).<br />
March 17, 2001
d’Alembert’s Solution of the Wave Equation 17<br />
(9.13) to determine the functions ϕ 1 and ϕ 2 in terms of the given functions F and G.<br />
The relation f(x, 0) = F (x) implies<br />
ϕ 1 (x) + ϕ 2 (x) = F (x).<br />
The other initial condition, D 2 f(x, 0) = G(x), implies<br />
cϕ ′ 1(x) − cϕ ′ 2(x) = G(x).<br />
Differentiating we obtain<br />
Solving these we find<br />
ϕ ′ 1(x) + ϕ ′ 2(x) = F ′ (x).<br />
ϕ ′ 1(x) = 1 2 F ′ (x) + 1 2c G(x),<br />
ϕ′ 2(x) = 1 2 F ′ (x) − 1 2c G(x).<br />
Integrating these relations we get<br />
ϕ 1 (x) − ϕ 1 (0) =<br />
F (x) − F (0)<br />
2<br />
+ 1 2c<br />
∫ x<br />
0<br />
G(s) ds,<br />
ϕ 2 (x) − ϕ 2 (0) =<br />
F (x) − F (0)<br />
2<br />
− 1 2c<br />
∫ x<br />
0<br />
G(s) ds.<br />
In the first equation we replace x by x+ct; in the second equation we replace x by x−ct.<br />
Then we add the two resulting equations and use the fact that ϕ 1 (0) + ϕ 2 (0) = F (0)<br />
to obtain<br />
f(x, t) = ϕ 1 (x + ct) + ϕ 2 (x + ct) =<br />
F (x + ct) + F (x − ct)<br />
2<br />
+ 1 2c<br />
∫ x+ct<br />
x−ct<br />
G(s) ds<br />
This completes the proof.<br />
EXAMPLE. Assume the initial displacement is given by the formula<br />
F (x) =<br />
⎧<br />
⎪ ⎨<br />
⎪⎩<br />
1 + cos πx for − 1 ≤ x ≤ 1<br />
0 for |x| > 1.<br />
Suppose that the initial velocity G(x) = 0 for all x. Then the resulting solution of the<br />
wave equation is given by the formula<br />
f(x, t) =<br />
F (x + ct) + F (x − ct)<br />
.<br />
2<br />
March 17, 2001