02.08.2013 Views

Chapter 7. Induction and Recursion Part 1. Mathematical Induction ...

Chapter 7. Induction and Recursion Part 1. Mathematical Induction ...

Chapter 7. Induction and Recursion Part 1. Mathematical Induction ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

<strong>Chapter</strong> <strong>7.</strong> <strong>Induction</strong> <strong>and</strong> <strong>Recursion</strong><br />

<strong>Part</strong> <strong>1.</strong> <strong>Mathematical</strong> <strong>Induction</strong><br />

The principle of mathematical induction is this: to establish an infinite sequence of<br />

propositions<br />

P1, P2, P3, . . . , Pn, . . .<br />

(or, simply put, Pn (n ≥ 1)), it is enough to verify the following two things<br />

(1) P1, <strong>and</strong><br />

(2) Pk ⇒ Pk+ 1; (that is, assuming Pk vaild, we can rpove the validity of Pk+ 1).<br />

These two things will give a “domino effect” for the validity of all Pn (n ≥ 1). Indeed,<br />

step (1) tells us that P1 is true. Applying (2) to the case n = 1, we know that P2 is<br />

true. Knowing that P2 is true <strong>and</strong> applying (2) to the case n = 2 we know that P3 is<br />

true. Continue in this manner, we see that P4 is true, P5 is true, etc. This argument<br />

can go on <strong>and</strong> on. Now you should be convinced that Pn is true, no matter how big is n.<br />

We give some examples to show how this induction principle works.<br />

Example <strong>1.</strong> Use mathematical induction to show 1 + 3 + 5 + · · · + (2n − 1) = n 2 .<br />

(Remember: in mathematics, “show” means “prove”.)<br />

Answer: For n = 1, the identity becomes 1 = 1 2 , which is obviously true. Now<br />

assume the validity of the identity for n = k:<br />

1 + 3 + 5 + · · · + (2k − 1) = k 2 .<br />

For n = k + 1, the left h<strong>and</strong> side of the identity is<br />

1 + 3 + 5 + · · · + (2k − 1) + (2(k + 1) − 1) = k 2 + (2k + 1) = (k + 1) 2<br />

which is the right h<strong>and</strong> side for n = k + <strong>1.</strong> The proof is complete.<br />

1


Example 2. Use mathematical induction to show the following formula for a geometric<br />

series:<br />

1 + x + x 2 + · · · + x n =<br />

1 1 − xn+<br />

. (x = 0) (<strong>7.</strong>1)<br />

1 − x<br />

Answer: For n = 1, the formula becomes 1 = 1−x<br />

1−x , which holds trivially. Now we<br />

assume the validity of the identity for n = k: 1 + x + x2 + · · · + xk = (1 − xk+ 1 )/(1 − x).<br />

For n = k + 1, the left h<strong>and</strong> side of the formula is<br />

1 + x + x 2 + · · · + x k + x k+ 1 1 1 − xk+<br />

=<br />

1 − x<br />

+ xk+<br />

1<br />

= 1 − xk+ 1<br />

1 − x + xk+ 1 (1 − x)<br />

=<br />

1 − x<br />

1 − xk+ 1<br />

1 − x + xk+ 1 − x<br />

1 − x<br />

k+ 2<br />

= 1 − xk+ 2<br />

1 − x<br />

which is the right h<strong>and</strong> side for n = k + <strong>1.</strong> So the formula is valid for general n.<br />

Exercise <strong>1.</strong> Prove the following identity by induction:<br />

1 + 2 + 3 + · · · + n =<br />

Exercise 2. Prove the following identity by induction:<br />

1 2 + 2 2 + 3 2 + · · · + n 2 =<br />

Exercise 3. Prove the following identity by induction:<br />

n(n + 1)<br />

2<br />

1 3 + 2 3 + 3 3 + · · · + n 3 = (1 + 2 + 3 + · · · + n) 2<br />

(<strong>7.</strong>2)<br />

n(n + 1)(2n + 1)<br />

. (<strong>7.</strong>3)<br />

6<br />

(According to Exercise 1, you only need to prove 1 3 + 2 3 + 3 3 + · · · + n 3 =<br />

2 n(n+ 1)<br />

2 .)<br />

Not every identity depending on n can be h<strong>and</strong>led by induction, as indicated in the<br />

following exercise.<br />

Exercise 4. Try to prove the following identity by induction:<br />

a n+ 1 − b n+ 1 = (a − b)(a n + a n−1 b + a n−2 b 2 + · · · + ab n−1 + b n ). (<strong>7.</strong>4)<br />

After failing to do so, try to prove this by multiplying out the right h<strong>and</strong> side.<br />

2


Sometimes we can use induction to statements depending on a parameter n of natural<br />

numbers, other than identities, as shown in the following example:<br />

Example 3. The celebrated Cauchy-Schwarz inequality says<br />

(a1b1 + a2b2 + · · · + anbn) 2 ≤ (a 2 1 + a 2 2 + · · · + a 2 n)(b 2 1 + b 2 2 + · · · + b 2 n) (<strong>7.</strong>5)<br />

Check this for n = 1, 2. Then use induction to prove this inequality for general n.<br />

Answer. When n = 1, the inequality reads (a1b1) 2 ≤ a2 1b22 . This is clear: in fact,<br />

(a1b1) 2 = a2 1b2 2. Next we verify this inequality for n = 2. Here we rewrite this inequality<br />

in different symbols because we need it in the inductive step:<br />

(A1B1 + A2B2) 2 ≤ (A 2 1 + A22 )(B2 1 + B2 2 ), (∗)<br />

that is, A 2 1B 2 1 + 2A1B1A2B2 + A 2 2B 2 2 ≤ A 2 1B 2 1 + A 2 1B 2 2 + A 2 2B 2 1 + A 2 2B 2 2. So it is enough to<br />

check<br />

2A1B1A2B2 ≤ A 2 1 B2 2 + A2 2 B2 1 ,<br />

which can be recast as A2 1B2 2 − 2A1B1A2B2 + A2 2B2 1 ≥ 0. But<br />

A 2 1 B2 2 − 2A1B1A2B2 + A 2 2 B2 1 = (A1B2 − A2B1) 2<br />

which is clearly ≥ 0. Thus (∗) is valid. Now we assume the validity of<br />

(a1b1 + a2b2 + · · · + anbn) 2 ≤ (a 2 1 + a22 + · · · + a2n )(b2 1 + b22 + · · · + b2n ) (∗∗)<br />

for n = k. Notice that the left h<strong>and</strong> side of (∗∗) increases or remains the same if all<br />

numbers in it are replaced by their absolute values, while the right h<strong>and</strong> side remains the<br />

same. Hence we may (<strong>and</strong> we do) assume that aj ≥ 0 <strong>and</strong> bj ≥ 0 for all j = 1, . . . , n.<br />

By the induction hypothesis, (a1b1 + · · · + akbk) 2 ≤ (a2 1 + · · · + a2 k )(b21 + · · · + b2 k ), or<br />

a1b1 + a2b2 + · · · + akbk ≤ A1B1,<br />

where A1 =<br />

<br />

a 2 1 + a2 2 + · · · + a2 k , <strong>and</strong> B1 =<br />

Now let A2 = ak+ 1 <strong>and</strong> B2 = bk+ 1 <strong>and</strong> use (∗) to obtain<br />

(a1b1 + · · · + akbk + ak+ 1bk+ 1) 2 ≤ (A1B1 + ak+ 1bk+ 1) 2<br />

≡ (A1B1 + A2B2) 2 ≤ (A 2 1 + A 2 2)(B 2 1 + B 2 2)<br />

<br />

b 2 1 + b2 2 + · · · + b2 k .<br />

= (a 2 1 + a2 2 + · · · + a2 k + a2 k+ 1 )(b2 1 + b2 2 + · · · + b2 k + b2 k+ 1 ),<br />

3


<strong>and</strong> hence (∗∗) is also valid for n = k + <strong>1.</strong><br />

Question 5. What is wrong with the following “proof” of “n = n + 1”?<br />

“Assume that n = n + 1 holds for n = k, that is, k = k + <strong>1.</strong> Add both sides by 1,<br />

we get k + 1 = k + 1 + 1, which shows that the identity n = n + 1 also holds for<br />

n = k + <strong>1.</strong> By the principle of induction, n = n + 1 is valid for all n.”<br />

Question 6. What is wrong with the following “proof” of the statement “every man is<br />

bald” by induction?<br />

“All we need to do is to establish the statement ‘a man with n hairs or less is bald’<br />

for all n. When n = 0, 1 or 2, the statement is clearly valid. Now assume that the<br />

statement is true for n = k, that is, assume that “a man with k hairs or less is<br />

bald” is valid. Then the statement n = k + 1 is also valid because an additional hair<br />

cannot change the fact of being bald. So by the principle of induction, the statement<br />

‘a man with n hairs is bald’ is valid for all n. This proves all men are bald.”<br />

An important role played by induction is to give proper definitions for many mathe-<br />

matical expressions involving natural numbers. The usual pattern (or its variant) of this<br />

definition by induction or by recursion is such: to define a sequence {f(n)}n≥1, we<br />

take the following two steps:<br />

(I) Specify what f(1) is,<br />

(II) Specify how f(n + 1) can be decided from a subset of f(1), f(2), . . . , f(n).<br />

Example 4. You know the meanings of the expressions a n , <strong>and</strong> n!. Strictly speaking,<br />

they should be defined by induction. We write down the two steps (I), (II) described above<br />

for defining each of them:<br />

a 1 = a; a n+ 1 = a n a. (With f(n) = a n , f(1) = a <strong>and</strong> f(n + 1) = f(n)f(1).)<br />

1! = 1; (n+1)! = n! (n+1). (With f(n) = n!, f(1) = 1 <strong>and</strong> f(n+1) = f(n)(n+1).)<br />

Certainly, na is also defined by induction: 1a = a <strong>and</strong> (n + 1)a = na + a.<br />

4


Example 5. This is an example of great importance about an expression involving the<br />

summation symbol that can be defined recursively, given as follows<br />

n<br />

ak.<br />

k= 1<br />

We read the expression as “the sum of all terms ak, where k runs from 1 to n. (Earlier<br />

we write such an expression simply as a1 + a2 + · · · + an.) Many identities <strong>and</strong> exercises<br />

above can be recast nicely in the format using the summation symbol, for example,<br />

n<br />

k= 1 (2k − 1) = n2 , n<br />

k= 1 k3 =<br />

n<br />

k= 1 k<br />

2 etc.; (see Example 1 <strong>and</strong> Exercise 3 above). The recursive definition of n<br />

k= 1 ak is<br />

1<br />

k= 1 ak = a1; n+ 1<br />

k= 1 ak =<br />

n<br />

k= 1 ak<br />

(with f(n) = n<br />

k= 1 ak, f(1) = a1 <strong>and</strong> f(n + 1) = f(n) + an+ <strong>1.</strong>)<br />

<br />

,<br />

+ an+ 1<br />

Example 6. The famous Fibonacci numbers Fn (n = 0, 1, 2, 3, . . . ) are defined by<br />

induction (or recursion):<br />

Fn = Fn−1 + Fn−2, n ≥ 2, (<strong>7.</strong>6)<br />

which says, starting from the third term, every term in the sequence of Fibonacci numbers<br />

is the sum of the previous two terms. In order to determine this sequence, we have to<br />

specify the first two terms. Here they are: F0 = 1 <strong>and</strong> F1 = <strong>1.</strong> It is easy to produce a<br />

few terms at the beginning of the sequence<br />

1, 1, 2, 3, 5, 8, 13, 21, 34, . . .<br />

(This sequence was originally described by the inventor as how rabbits multiply: Fn is<br />

the number of pairs of rabbits on the nth day. There is an interesting story of this but it<br />

does not concern us here.) The mathematical question here is, how can we find a closed<br />

expression for Fn for general n? There are many ways to answer this question, such as:<br />

treat it as a difference equation, or rewrite the recursion relation as a matrix identity<br />

<br />

Fn 1 1 Fn−1<br />

=<br />

Fn−1 1 0 Fn−2<br />

<strong>and</strong> use the method of diagonalization in linear algebra, or use generating functions.<br />

5


** Appendix (may be omitted) . We briefly describe the last method here. Form the<br />

power series f(x) = ∞<br />

n= 0 Fnx n . It is called the generating function for the Fibonacci<br />

sequence. It converges when |x| is small enough. Now<br />

f(x) = F0 + F1x + <br />

n≥2 Fnx n = F0 + F1x + <br />

n≥2 (Fn−1 + Fn−2) x n<br />

= 1 + x + <br />

n≥2 Fn−1 x n + <br />

n≥2 Fn−2 x n<br />

= 1 + x + <br />

k≥1 Fk x k+ 1 + <br />

ℓ≥0 Fℓ<br />

ℓ+ 2<br />

x (k = n − 1, ℓ = n − 2)<br />

<br />

= 1 + x + x<br />

k≥1 Fk x k<br />

<br />

2<br />

+ x<br />

ℓ≥0 Fℓ x ℓ<br />

= 1 + x + x(f(x) − 1) + x 2 f(x) = 1 + xf(x) + x 2 f(x).<br />

So we have (1 − x − x 2 )f(x) = <strong>1.</strong> We can factorize the polynomial 1 − x − x 2 as<br />

1 − x − x 2 = (1 − r+ x)(1 − r−x), where r± = (1 ± √ 5)/2 (with r+ + r− = 1 <strong>and</strong><br />

r+ r− = −1).<br />

Now<br />

1<br />

1<br />

f(x) =<br />

=<br />

1 − x − x2 (1 − r+ x)(1 − r−x) =<br />

<br />

r+ r− 1<br />

−<br />

1 − r+ x 1 − r−x<br />

<br />

= r+<br />

n≥0 rn + x n <br />

− r−<br />

n≥0 rn − x n 1<br />

√5 = 1 <br />

1<br />

√ (rn+ + − r<br />

5 n≥0<br />

Comparing this with f(x) = <br />

n≥0 Fn xn , we obtain<br />

Fn = 1 n+ 1 n+ 1<br />

√ (r+ − r− ) ≡<br />

5 1<br />

√<br />

5<br />

<br />

r+ − r−<br />

1 + √ n+ 1<br />

5<br />

−<br />

2<br />

1<br />

<br />

1 −<br />

√<br />

5<br />

√ n+ 1<br />

5<br />

.<br />

2<br />

n+ 1<br />

− ) x n .<br />

The amazing thing here is the appearance of the irrational number √ 5 in this answer for<br />

Fn, which is always an integer. (End of the appendix **)<br />

Exercise <strong>7.</strong> Use the recursion relation for Fibonacci numbers to prove the following<br />

identity<br />

F0 + F1 + F2 + · · · + Fn−1 = Fn+ 1 − 1, (n ≥ 1).<br />

Exercise 8. Prove the amazing identity Fm+ n = FmFn + Fm−1Fn−1 by induction.<br />

<strong>Recursion</strong> is an important notion for constructing so called recursive functions,<br />

which is a key notion in logic <strong>and</strong> theoretical computer science.<br />

6


<strong>Part</strong> 2. Polynomials: Taylor’s formula, binomial formula, etc.<br />

By a polynomial we mean a function of the form<br />

p(x) = anx n + an−1x n−1 + · · · + a2x 2 + a1x + a0 ≡ n<br />

k= 0<br />

akx k<br />

where a0, a1, a2, . . . , an are some constants, called coefficients of p(x). When an = 0,<br />

we say that the degree of p(x) is n. Now let us take an arbitrary constant a. Then<br />

p(x + a) as a function of x is a polynomial of the same degree as that of p(x). However,<br />

the degree of p(x + a) − p(x) is lower by 1, since the highest power terms in p(x + a)<br />

<strong>and</strong> p(x) are canceled out.<br />

Example <strong>7.</strong> Consider the polynomial p(x) = 2x 2 + 3x + 1 of degree 2. Then, for<br />

any constant a,<br />

p(x + a) = 2(x + a) 2 + 3(x + a) + 1 = 2(x 2 + 2ax + a 2 ) + 3(x + a) + 1<br />

= 2x 2 + (4a + 3)x + (2a 2 + 3a + 1)<br />

which is also a polynomial of degree 2. So p(x + a) − p(x) = 4ax + (2a 2 + 3a), which is a<br />

polynomial of degree <strong>1.</strong><br />

The derivative of a polynomial p(x), denoted by p ′ d (x) or p(x), is defined to be<br />

dx<br />

the limit of the expression (p(x + h) − p(x))/h, called a difference quotient” as h tends<br />

to zero. Thus we write<br />

d<br />

dx p(x) ≡ p′ (x) = lim<br />

h→ 0<br />

p(x + h) − p(x)<br />

.<br />

h<br />

Suppose that the degree of p(x) is n. Then the degree of the difference p(x + h) − p(x)<br />

drops by 1 <strong>and</strong> hence we expect that the degree of p ′ (x) is n − <strong>1.</strong> When p(x) is of<br />

degree zero, in other words, when p(x) is a constant function, say p(x) ≡ C, we have<br />

d<br />

C = 0.<br />

dx<br />

Simply put, the derivative of a constant function is zero.<br />

Example 8. Consider the polynomial p(x) = 2x 2 + 3x + <strong>1.</strong> From the previous<br />

example we know that p(x + a) − p(x) = 4ax + (2a 2 + 3a). Replace a by h to get<br />

p(x + h) − p(x) = 4hx + (2h 2 + 3h) = h(4x + 2h + 3). So<br />

p ′ p(x + h) − p(x)<br />

(x) = lim<br />

h→ 0 h<br />

= lim<br />

h→ 0 (4x + 2h + 3) = 4x + 3.<br />

7


We will develop some general rules to obtain this answer within seconds.<br />

The following are two basic rules for computing derivatives. For simplicity, we write<br />

u <strong>and</strong> v for two polynomials (instead of u(x) <strong>and</strong> v(x)) <strong>and</strong> a, b as two constants:<br />

Linearity: (au + bv) ′ = au ′ + bv ′ .<br />

Product rule: (uv) ′ = u ′ v + uv ′ .<br />

They can be checked as follows. For linearity:<br />

(au + bv) ′ (x) = lim<br />

h→ 0<br />

= lim<br />

h→ 0<br />

[au(x + h) + bv(x + h)] − [au(x) + bv(x)]<br />

h<br />

<br />

<br />

u(x + h) − u(x) v(x + h) − v(x)<br />

a + b = au<br />

h<br />

h<br />

′ (x) + bv ′ (x).<br />

For the product rule, we need a clever trick of inserting appropriate terms used first time<br />

by Leibnitz:<br />

(uv) ′ (x) = lim<br />

h→ 0<br />

= lim<br />

h→ 0<br />

= lim<br />

h→ 0<br />

u(x + h)v(x + h) − u(x)v(x)<br />

h<br />

<br />

u(x + h)v(x + h) − u(x)v(x + h)<br />

+<br />

h<br />

u(x)v(x + h) − u(x)v(x)<br />

<br />

h<br />

<br />

u(x + h) − u(x)<br />

v(x + h) − v(x)<br />

v(x + h) + u(x)<br />

h<br />

h<br />

= u ′ (x)v(x) + u(x)v ′ (x).<br />

(Note: In the mathematical literature this rule is commonly known as Leibnitz rule. “Prod-<br />

uct rule” is used in textbooks for the convenience of beginners.)<br />

The following identity is a basic formula for differentiating polynomials<br />

d<br />

dx xn = nx n−1<br />

(n ≥ 0) (<strong>7.</strong>7)<br />

Together with linearity, we can find the derivative of a given polynomial instantly. For<br />

example, the derivative of p(x) = 2x 2 + 3x + 1 (Example 8 above) is p ′ (x) = 2(2x) +<br />

3. 1 + 0 = 4x + 3.<br />

Example 9. Prove (<strong>7.</strong>7) by induction.<br />

Solution. For n = 0, x n is the constant function <strong>1.</strong> In this case the identity becomes<br />

d 1 = 0, which is valid – as we know that the derivative of a constant function vanishes.<br />

dx<br />

For n = 1, we have<br />

dx<br />

dx<br />

(x + h) − x<br />

= lim<br />

h→ 0 h<br />

h<br />

= lim<br />

h→ 0 h<br />

8<br />

= lim<br />

h→ 0 1 = 1 ≡ <strong>1.</strong> x0 .


Hence the identity is valid for n = 1 as well. Now assume the validity of the identity for<br />

n = k, that is, d<br />

dx xk = kx k−1 . By means of the product rule, we have<br />

d<br />

dx xk+ 1 = d<br />

dx xk x =<br />

d<br />

dx xk<br />

<br />

x + x k<br />

d<br />

dx x<br />

<br />

= kx k−1 x + x k = (k + 1)x k ,<br />

showing the validity of the identity for n = k + <strong>1.</strong> The induction principle tells us that<br />

the identity holds for all n ≥ 0.<br />

The derivative of the derivative p ′ (x) of a polynomial p(x) is called the second<br />

derivative of p(x), <strong>and</strong> is denoted by p ′′ (x) or p (2) (x). The derivative of p (2) (x) is<br />

called the third derivative of p(x) <strong>and</strong> is denoted by p (3) (x). In general, we can define<br />

the nth derivative p (n) (x) of a polynomial p(x) recursively as follows:<br />

p (0) (x) = p(x), p (k+ 1) (x) = d<br />

dx p(k) (x).<br />

For example, with p(x) = 2x 2 + 3x + 1 (Example 8 above) we have p (1) (x) = 4x + 3,<br />

p (2) (x) = 4, p (3) (x) = 0.<br />

Exercise 9. In each of the following cases, find all derivatives of a given polynomial p(x):<br />

(a) p(x) = 3 + 4x + 2x 2 , (b) p(x) = 7 + 3x + 2x 2 + x 3 , (c) p(x) = 1<br />

15 x5 + 1<br />

12 x4 .<br />

Let f(x) be an arbitrary polynomial <strong>and</strong> let a be any number. Then f(x + a) is<br />

a polynomial in x of the same degree <strong>and</strong> hence we can put it in the form f(x + a) =<br />

n k= 0 ckxk . Replace x by x − a through out the last identity, we get<br />

n<br />

f(x) = ck(x − a) k ≡ c0 + c1(x − a) + c2(x − a) 2 + · · · + an(x − a) n . (<strong>7.</strong>8)<br />

k= 0<br />

To determine the constant term c0, we let x = a in (<strong>7.</strong>8) to get f(a) = c0. To determine<br />

c1, we take the derivatives of both sides of (8) first:<br />

f ′ (x) = c1 + c2. 2(x − a) + · · · + cn. n(x − a) n−1<br />

<strong>and</strong> then we substitute x = a to get p ′ (a) = c<strong>1.</strong> To get an expression for cj for general j<br />

(j ≤ n), we need to differentiate both sides of (<strong>7.</strong>8) j times. This leads us to the question<br />

of finding the jth derivative of (x − a) k . The answer to this question is<br />

dj dxj (x − a)k ⎧<br />

k!<br />

⎨ (k−j)!<br />

=<br />

⎩<br />

(x − a)k−j if j < k ;<br />

k! if j = k ;<br />

0 if j > k .<br />

9<br />

(<strong>7.</strong>9)


You have to convince yourself that (<strong>7.</strong>9) is correct.<br />

Exercise 10. Verify (<strong>7.</strong>9) for k = 4 <strong>and</strong> j = 1, 2, 3, 4.<br />

When j = k, the right h<strong>and</strong> side of (<strong>7.</strong>9) either has a factor of x − a or is equal to zero,<br />

<strong>and</strong> therefore its value is zero at x = a. When j = k, it becomes k! which is the same<br />

as j!. Thus we have<br />

dj (x − a)k<br />

dxj <br />

<br />

<br />

x= a<br />

=<br />

j! if j = k ;<br />

0 if j = k .<br />

(<strong>7.</strong>10)<br />

Now take the jth derivatives of both sides of (<strong>7.</strong>8) <strong>and</strong> then evaluate at x = a. The left<br />

h<strong>and</strong> side becomes f (j) (a). It follows from (<strong>7.</strong>10) that the right h<strong>and</strong> side becomes j!cj.<br />

Thus we have f (j) (a) = j!cj, or cj = f (j) (a)/j!. Replace j by k <strong>and</strong> substitute the<br />

resulting identity ck = f (k) (a)/k! back to (<strong>7.</strong>8). We finally get<br />

f(x) =<br />

n<br />

k= 0<br />

f (k) (a)<br />

k!<br />

(x − a) k . (<strong>7.</strong>11)<br />

The last identity is called the Taylor expansion at x = a for the polynomial f.<br />

Example 10. Use the Taylor expansion to find the partial fractions of x2 + x + 2<br />

.<br />

(x − 1) 3<br />

Solution. Let p(x) = x 2 + x + 2. Then p (1) (x) = 2x + 1, p (2) (x) = 2, p (3) (x) = 0.<br />

So p(1) = 4, p (1) (1) = 3, p (2) (1) = 2. The Taylor expansion of p(x) at x = 1 is<br />

p(x) = 4 + 3(x − 1) + (x − 1) 2 . Hence<br />

x2 + x + 2 4 + 3(x − 1) + (x − 1)2<br />

=<br />

(x − 1) 3 (x − 1) 3<br />

which is the requires partial fraction decomposition.<br />

= 1<br />

x − 1 +<br />

3<br />

+<br />

(x − 1) 2<br />

4<br />

.<br />

(x − 1) 3<br />

Now we apply (<strong>7.</strong>11) to the monomial f(x) = x n . We have to compute the kth<br />

derivative of f(x) = x n for k ≤ n. The answer to this is supplied by (<strong>7.</strong>9). In using<br />

(<strong>7.</strong>9), we switch k to n <strong>and</strong> j to k, <strong>and</strong> set a = 0. We obtain<br />

f (k) (x) ≡ dk<br />

dx k xn =<br />

10<br />

n!<br />

(n − k)! xn−k


from which we get f (k) (a) = n!<br />

(n−k)! an−k . So (<strong>7.</strong>11) gives<br />

x n = n<br />

k= 0<br />

n!<br />

k!(n − k)! an−k (x − a) k .<br />

Substituting x = a + b to the last identity <strong>and</strong> noticing that<br />

n!<br />

k!(n − k)! =<br />

<br />

n<br />

, (<strong>7.</strong>12)<br />

k<br />

we obtain<br />

(a + b) n =<br />

n<br />

k= 0<br />

which is the celebrated binomial theorem.<br />

<br />

n<br />

a<br />

k<br />

n−k b k<br />

Exercise 1<strong>1.</strong> Use the binomial theorem to verify the identities<br />

<br />

n n n n<br />

n<br />

≡ + + · · · + = 2<br />

k= 0 k 0 1<br />

n<br />

n<br />

<strong>and</strong><br />

n<br />

k= 0 (−1)k<br />

<br />

n<br />

≡<br />

k<br />

<br />

n<br />

−<br />

0<br />

<br />

n<br />

+ · · · + (−1)<br />

1<br />

n<br />

<br />

n<br />

= 0.<br />

n<br />

The number n<br />

k , read as “n choose k”, is interpreted as the number of ways to choose<br />

k objects from n objects. For example, 4 4! = 2 2!2! = 6 tells us that there are 6 ways to<br />

take two objects from a set of 4. Indeed, suppose that the four objects are {♥ ♣ ♦ ♠},<br />

then the six ways of picking two are {♥ ♣}, {♥ ♦}, {♥ ♠}, {♣ ♦}, {♣ ♠}, {♦ ♠}.<br />

The following identities concerning “n choose k” is basic:<br />

<br />

n n n + 1 n<br />

= ,<br />

= +<br />

k n − k k k − 1<br />

<br />

n<br />

. (<strong>7.</strong>13)<br />

k<br />

Both can be understood as follows. The first identity says, in dividing n objects between<br />

you <strong>and</strong> me, k for me <strong>and</strong> n − k for you, the two methods below give the same number of<br />

ways. One method is that I pick k objects, leaving the rest to you, <strong>and</strong> there are n<br />

k ways<br />

to do this. The other method is that you pick n − k objects, leaving the rest to me <strong>and</strong><br />

there are n<br />

n−k ways to do this. The second identity concerns the number of ways to pick<br />

k objects from n + 1 objects. Let us label on of these n + 1 objects <strong>and</strong> call it ♠. There<br />

are n<br />

n<br />

k ways of taking k objects not including ♠ <strong>and</strong> there are k−1 ways of taking k<br />

objects including ♠. Together they give n n<br />

+ ways<br />

k k−1<br />

11


Exercise 12. Use (<strong>7.</strong>12) to verify (<strong>7.</strong>13).<br />

*The rest material of this chapter is optional.<br />

We use the binomial theorem to prove the following assertion inductively:<br />

Proposition If p(x) is a polynomial of degree n, then there is a polynomial f(x)<br />

of degree n + 1 such that p(x) = f(x + 1) − f(x).<br />

This proposition suggests how to find a recipe for the sum<br />

Sn = p(0) + p(1) + p(2) + p(3) + · · · + p(n),<br />

where p(x) is a given polynomial. Indeed, if we know f(x) with p(x) = f(x+1)−f(x) ≡<br />

−f(x) + f(x + 1), Sn becomes a “telescoping sum”:<br />

Sn = (−f(0) + f(1)) + (−f(1) + f(2)) + (−f(2) + f(3)) + · · · + (−f(n) + f(n + 1))<br />

= −f(0) + f(n + 1) = f(n + 1) − f(0).<br />

Since the relation f(x + 1) − f(x) = p(x) is unchanged when we add an constant to f(x),<br />

we may arrange f(x) in such a way that f(0) = 0 so that the above identity becomes<br />

Sn = f(n + 1), which is the required recipe for the sum. This proposition tells us that<br />

such f(x) exists. All we need is to look for it – that may involve some hard work!<br />

Now we use this method to find the recipe for the sum 1 2 +2 2 +3 2 +· · ·+n 2 . Certainly<br />

the answer is given by (<strong>7.</strong>3) in Exercise 2, which asks to verify this recipe by induction.<br />

But how one can conjure up such a recipe is a complete myth! Our task here is to use a<br />

systematic method to demystify this recipe. Here p(x) = x 2 . The above proposition tells<br />

us that there is a polynomial f(x) of degree 3 such that f(x + 1) − f(x) = p(x). Let<br />

us write f(x) = ax 3 + bx 2 + cx + d. Let us set f(0) = 0 so that d = 0. Now<br />

So we have<br />

f(1) = f(1) − f(0) = p(0) = 0,<br />

f(2) = f(2) − f(1) = p(1) = 1 2 = 1,<br />

f(3) = f(2) + p(2) = 1 + 2 2 = 5.<br />

a + b + c = 0, 8a + 4b + 2c = 1, 27a + 9b + 3c = 4.<br />

The solution to this system of linear equations is a = 1/3, b = −1/2, c = 1/6. So<br />

f(x) = 1<br />

3 x3 − 1<br />

2 x2 + 1<br />

6 x = (2x2 − 3x + 1)x<br />

6<br />

12<br />

= (x − 1)(2x − 1)x<br />

6


<strong>and</strong> hence Sn = f(n + 1) = n(2n + 1)(n + 1)/6.<br />

Now we return to the proof of the proposition by induction on the degree n of p(x).<br />

When n = 0, p(x) is a constant, say p(x) = c. In this case we just let f(x) = cx, which<br />

is a polynomial of degree 1 ≡ 0 + <strong>1.</strong> Indeed,<br />

f(x + 1) − f(x) = c(x + 1) − cx = c = p(x).<br />

Now we assume that the proposition is valid for all n ≤ k − 1 <strong>and</strong> let p(x) be a<br />

polynomial of degree k. We can write p(x) = cx k + s(x), where s(x) is of degree<br />

≤ k − <strong>1.</strong> By the induction hypothesis, there is a polynomial g(x) of degree ≤ k such<br />

that s(x) = g(x + 1) − g(x). On the other h<strong>and</strong>, the binomial theorem tells us that<br />

(x + 1) k+ 1 = x k+ 1 + (k + 1)x k + r(x), where r(x) is a polynomial of degree k − 1 <strong>and</strong><br />

hence r(x) = h(x + 1) − h(x) for some polynomial h(x) of degree k. Thus we have<br />

x k = 1 k+ 1 k+ 1<br />

(x + 1) − x − (h(x + 1) − h(x)) = q(x + 1) − q(x)<br />

k + 1<br />

where q(x) = (x k+ 1 − h(x))/(k + 1) is a polynomial of degree k + <strong>1.</strong> Hence<br />

p(x) = c(q(x + 1) − q(x)) + (g(x + 1) − g(x)) = f(x + 1) − f(x),<br />

where f(x) = cq(x) + g(x) is a polynomial of degree k + <strong>1.</strong> The proof is complete.<br />

13

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!