08.10.2016 Views

Foundations of Data Science

2dLYwbK

2dLYwbK

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

n+1 ∫<br />

x=m<br />

f (x)dx ≤<br />

n ∑<br />

i=m<br />

f (i) ≤<br />

n∫<br />

x=m−1<br />

f (x)dx<br />

m − 1 m n n + 1<br />

Figure 12.1: Approximating sums by integrals<br />

More generally for any convex function f,<br />

( n∑<br />

)<br />

f α i x i ≤<br />

i=1<br />

n∑<br />

α i f (x i ),<br />

∑<br />

where 0 ≤ α i ≤ 1 and n α i = 1. From this, it follows that for any convex function f and<br />

random variable x,<br />

i=1<br />

i=1<br />

E (f (x)) ≥ f (E (x)) .<br />

We prove this for a discrete random variable x taking on values a 1 , a 2 , . . . with Prob(x =<br />

a i ) = α i :<br />

E(f(x)) = ∑ i<br />

α i f(a i ) ≥ f<br />

( ∑<br />

i<br />

α i a i<br />

)<br />

= f(E(x)).<br />

f(x 1 )<br />

f(x 2 )<br />

x 1 x 2<br />

Figure 12.2: For a convex function f, f ( )<br />

x 1 +x 2<br />

2 ≤<br />

1<br />

(f (x 2 1) + f (x 2 )) .<br />

386

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!