06.06.2013 Views

Theory of Statistics - George Mason University

Theory of Statistics - George Mason University

Theory of Statistics - George Mason University

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

840 Appendix B. Useful Inequalities in Probability<br />

Corollary B.3.1.1 (Chebyshev’s inequality)<br />

For ɛ > 0,<br />

Pr(|X − E(X)| ≥ ɛ) ≤ 1<br />

ɛ 2V(X)<br />

Pro<strong>of</strong>. In Markov’s inequality, let k = 2, and replace X by X − E(X).<br />

Corollary B.3.1.2 (Chebyshev’s inequality (another form))<br />

For f ∋ f(x) ≥ 0 and ɛ > 0,<br />

(B.6)<br />

Pr(f(X) ≥ ɛ) ≤ 1<br />

E(f(X)) (B.7)<br />

ɛ<br />

Pro<strong>of</strong>. Same as Markov’s inequality; start with E(f(X)).<br />

Chebyshev’s inequality is <strong>of</strong>ten useful for ɛ = V(X). There are also<br />

versions <strong>of</strong> Chebyshev’s inequality for specific families <strong>of</strong> distributions.<br />

• 3σ rule for a unimodal random variable<br />

If X is a random variable with a unimodal absolutely continuous distribution,<br />

and σ = V(X), then<br />

See Dharmadhikari and Joag-Dev (1988).<br />

• Normal tail probability<br />

If X ∼ N(µ, σ 2 ), then<br />

See DasGupta (2000).<br />

Pr(|X − E(X)| ≥ 3σ) ≤ 4<br />

. (B.8)<br />

81<br />

Pr(|X − µ| ≥ kσ) ≤ 1<br />

3k 2.<br />

(B.9)<br />

There are a number <strong>of</strong> inequalities that generalize Chebyshev’s inequality<br />

to finite partial sums <strong>of</strong> a sequence <strong>of</strong> independent random variables<br />

X1, X2, . . . over a common probability space such that for each, E(Xi) = 0<br />

and E(X 2 i<br />

) < ∞. (The common probability space means that E(·) has the<br />

same meaning for each i, but the Xi do not necessarily have the same distribution.)<br />

These inequalities are <strong>of</strong>ten called the The Bernstein inequalities,<br />

but some <strong>of</strong> them have other names.<br />

Theorem B.3.2 (the Hoeffding inequality)<br />

Let X1, . . ., Xn be independent, E(Xi) = 0, and Pr(|Xi| ≤ c) = 1. Then for<br />

any t > 0,<br />

<br />

n<br />

<br />

t<br />

Pr Xi > t ≤ exp −<br />

2 /2<br />

n i=1 E(X2 <br />

. (B.10)<br />

i ) + ct/3<br />

i=1<br />

The Hoeffding inequality is a special case <strong>of</strong> the Azuma inequality for martingales<br />

(see Section 1.6).<br />

<strong>Theory</strong> <strong>of</strong> <strong>Statistics</strong> c○2000–2013 James E. Gentle

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!