A Course on Large Deviations with an Introduction to Gibbs Measures.

More documents

Recommendations

Info

$Lecture notes for Math 522 Spring 2012 (Rudin chapter 7)$

$Math 320 - Fall 2010 HW #7 Selected Solutions Problem 5.1.32 Let ...$

4 1. Introduction Example 1.1. Let us consider coin tosses. Let {Xn} be an i.i.d. sequence of Bernoulli random variables with success probability p (i.e. each Xn = 1 with probability p and 0 otherwise). Denote the partial sum by Sn = X1+· · ·+Xn. The law of large numbers ([15] or page 73 of [26]) says that Sn/n converges to p, almost surely. But at any given n there is a chance of p n that we get all heads (Sn = n) and also a chance of (1 − p) n that we get all tails (Sn = 0). In fact, for any s ∈ (0, 1) there is always a chance that one gets a fraction of heads close to s. Let us compute this probability. Let us write [x] for the integral part of x ∈ R, i.e. the largest integer smaller or equal to x. Write P {Sn = [ns]} = n! [ns]!(n − [ns])! p[ns] (1 − p) n−[ns] ∼ nn p [ns] (1 − p) n−[ns] [ns] [ns] (n − [ns]) n−[ns] n 2π[ns](n − [ns]) , where we have used Stirling’s formula n! ∼ e−nnn√2πn; see, for example, page 21 of Khoshnevisan’s textbook [26] or page 52 of Feller’s Vol. I [17]. (We say that an ∼ bn, or an is equivalent to bn, when an/bn → 1.) Let us abbreviate n βn = 2π[ns](n − [ns]) , γn = (ns)ns (n − ns) n−ns [ns] [ns] (n − [ns]) n−[ns] p [ns] (1 − p) n−[ns] pns (1 − p) n−ns . Then, P {Sn = [ns]} is equivalent to βnγn exp{n log n+ns log ns+n(1−s) log n(1−s)−ns log p−n(1−s) log(1−p)}. ∗ Exercise 1.2. Show that there exists a constant C such that 1 C √ n ≤ βn ≤ C and 1 Cn ≤ γn ≤ Cn for large enough n. (1.1) One then has lim n→∞ 1 n log P {Sn = [ns]} = −Ip(s), with Ip(s) = s log s 1 − s + (1 − s) log p 1 − p . This function Ip is continuous on (0, 1) and its limits at 0 and 1 are exactly what we predicted earlier: Ip(1) = log 1 p and Ip(0) = log 1 1−p . For s ∈ [0, 1] it is natural to set Ip(s) = ∞. Figure 1.1 shows what this function looks like. The function Ip in (1.1) is called a large deviation rate function. Ip(s) is also called the entropy of the coin yielding heads with probability s relative
1.1. Information-theoretic entropy 5 ∞ log 1 p log 1 1−p I(s) 0 0 p 1 Figure 1.1. The rate function for coin tosses. to the one giving heads with probability p. The choice of terminology is not a coincidence. It is indeed related to both information-theoretic and thermodynamic entropy. For this reason we go on a brief detour to discuss these well-known notions of entropy and to point out the link with the large deviation rate function Ip. The so-called relative entropy that appears in large deviation theory will take center stage in Chapters 6–7, and again in Chapter 9 when we discuss statistical mechanics of lattice systems. 1.1. Information-theoretic entropy Let us regard the outcome of n coin tosses as a long word written in binary language, a sequence of zeros and ones. The question we would like to address is: how much information is there in a given sequence of n zeros and ones? Or: what is the minimal amount of bits one would need to encode such a sequence? The answer to this question has to do with quantifying the amount of uncertainty present in the coin itself. If the coin only gives heads, then one knows exactly what is coming and needs 0 bits to encode the outcome. On the other hand, if the coin is fair, then one cannot predict what will come next in any favorable way and one thus needs exactly n bits to encode a sequence of n zeros and ones; i.e. one needs 1 bit per character. In general, one needs h bits per character with h ∈ [0, 1]. In fact, we have already computed h in Example 1.1. To see how that computation is relevant we just need to view things from a slightly different angle. A coin that gives heads with probability s is tossed. After n tosses one would typically get about [ns] heads and [n(1 − s)] tails. The number of all possible combinations with [ns] heads is n [ns] . If we are to encode this s
Page 1 and 2: A Course on Large
Page 3: To Alla, Maxim, and Kirill To Celes
Page 6 and 7: viii Contents §5.3. Multidimension
Page 9 and 10: Preface (and to the reviewers) This
Page 11: Preface (and to the reviewers) xiii
Page 15: Introduction Chapter 1 Imagine the
Page 19 and 20: 1.2. Thermodynamic entropy 7 to 0.
Page 21: 1.2. Thermodynamic entropy 9 The id
Page 24 and 25: 12 2. Preliminary examples and gene
Page 32 and 33: 20 3. More generalities and Cramér
Page 42 and 43: 30 4. Yet some more generalities Th
Page 44 and 45: 32 4. Yet some more generalities Va
Page 46 and 47: 34 4. Yet some more generalities Va
Page 48 and 49: 36 4. Yet some more generalities fu
Page 50 and 51: 38 4. Yet some more generalities Ha
Page 52 and 53: 40 4. Yet some more generalities Ne
Page 54 and 55: 42 5. Convex analysis in large devi
Page 66 and 67:
54 5. Convex analysis in large devi
Page 68 and 69:
56 5. Convex analysis in large devi
Page 70 and 71:
58 6. Relative entropy and large de
Page 72 and 73:
Page 74 and 75:
Page 76 and 77:
Page 78 and 79:
Page 80 and 81:
Page 82 and 83:
Page 84 and 85:
72 7. Large deviations for i.i.d. f
Page 86 and 87:
Page 88 and 89:
Page 90 and 91:
Page 92 and 93:
Page 94 and 95:
Page 96 and 97:
Page 99 and 100:
Formalism for classical lattice sys
Page 101 and 102:
8.2. Potentials and Hamiltonians 89
Page 103 and 104:
8.3. Specifications 91 Example 8.5.
Page 105 and 106:
8.4. Gibbs specifications and phase
Page 107 and 108:
8.4. Gibbs specifications and phase
Page 109 and 110:
8.5. Observables 97 In either case,
Page 111 and 112:
Large deviations and equilibrium st
Page 113 and 114:
9.2. Thermodynamic limit of the pre
Page 115 and 116:
9.3. Specific relative entropy 103
Page 117 and 118:
9.5. Dobrushin-Lanford-Ruelle (DLR)
Page 119 and 120:
Page 121:
Page 124 and 125:
112 10. Phase transition in the Isi
Page 126 and 127:
Page 128 and 129:
Page 130 and 131:
Page 132 and 133:
Page 134 and 135:
Page 136 and 137:
Page 138 and 139:
Page 141:
Percolation approach to phase trans
Page 145 and 146:
Further asymptotics for i.i.d. rand
Page 147 and 148:
12.1. Refinement of Cramér’s the
Page 149 and 150:
12.2. Moderate deviations 137 Theor
Page 151:
12.2. Moderate deviations 139 where
Page 154 and 155:
142 13. Large deviations for Markov
Page 156 and 157:
144 13. Large deviations for Markov
Page 158 and 159:
146 14. Convexity criterion for lar
Page 160 and 161:
Page 162 and 163:
Page 165 and 166:
Nonstationary independent variables
Page 167 and 168:
15.2. Proof of the large deviation
Page 169 and 170:
Page 171 and 172:
Page 173 and 174:
Page 175:
Page 179 and 180:
Topics from probability Appendix A
Page 181 and 182:
A.1. Weak convergence of probabilit
Page 183 and 184:
A.2. Ergodic theorem 171 To illumin
Page 185 and 186:
A.2. Ergodic theorem 173 measurable
Page 187 and 188:
A.3. Stochastic ordering 175 de Fin
Page 189 and 190:
Topics from analysis Ultimately, th
Page 191 and 192:
B.2. Minimax theorem 179 ∗ Exerci
Page 193:
B.2. Minimax theorem 181 Now, if
Page 196 and 197:
184 C. Inequalities Next, let C = m
Page 198 and 199:
186 C. Inequalities The inequality
Page 201 and 202:
Bibliography 1. Michael Aizenman, I
Page 203 and 204:
Bibliography 191 24. Y. Higuchi, On
Page 205 and 206:
Notation index empirical mean (X1 +
Page 207:
Notation index 195 B space of absol
Page 211:
Aizenman, 113 Bartle, 6 Baxter and
Page 214 and 215:
202 General index average, 99 field
Page 216 and 217:
204 General index proper function,
show all

A Course on Large Deviations with an Introduction to Gibbs Measures.

Create successful ePaper yourself

Delete template?

Save as template?