Monte Carlo Methods in Statistical Mechanics: Foundations and ...

More documents

Recommendations

Info

is very far from equilibrium. By throwing away the data from the initial transient, we lose nothing, and avoid a potentially large systematic error. Autocorrelation in equilibrium. As explained in the preceding lecture, the variance of the sample mean f inadynamic Monte Carlomethod is a factor 2 intf higher than it would be in independent sampling. Otherwise put, a run oflength n contains only n=2 intf \e ectively independent data points". This has several implications for Monte Carlo work. On the onehand, it means that the the computational e ciency of the algorithm is determined principally by its autocorrelation time. More precisely, ifonewishes to compare two alternative Monte Carlo algorithms for the same problem, then the better algorithm is the one that has the smaller autocorrelation time, when time is measured in units of computer (CPU) time. [In general there may arise tradeo s between \physical" autocorrelation time (i.e. measured in iterations) and computational complexity per iteration.] So accurate measurements ofthe autocorrelation time are essential to evaluating the computational e ciency of competing algorithms. On the other hand, even for a xed algorithm, knowledge of intf is essential for determining run lengths | is a run of 100000 sweeps long enough? | and for setting error bars on estimates of hfi. Roughly speaking, error bars will be of order ( =n) 1=2 so if we want 1% accuracy, then we need a run oflength 10000 ,andsoon. Above all, there is a basic self-consistency requirement: the runlength n must be than the estimates of produced by that samerun, otherwise none of the results fromthat run should be believed. Of course, while self-consistency is a necessary condition for the trustworthiness of Monte Carlo data, it is not a su cient condition there is always the danger of metastability. Already we can draw a conclusion about the relative importance of initialization bias and autocorrelation as di culties in dynamic Monte Carlo work. Let us assume that the time for initial convergence to equilibrium is comparable to (orat least not too much larger than) the equilibrium autocorrelation time intf (for the observables f of interest) | thisisoften but notalways the case. Then initialization bias is a relatively trivial problem compared to autocorrelation in equilibrium. To eliminate initialization bias, it su ces to discard 20 of the data atthe beginning ofthe run but toachieve a reasonably small statistical error, it is necessary to make arunoflength 1000 or more. So the data that must be discarded at the beginning, ndisc, isanegligible fraction of the total run length n. This estimate alsoshows that the exact value of ndisc is not particularly delicate: anything between 20 and n=5 will eliminate essentially all initialization bias while paying less than a 10% price in the nal error bars. In this remainder of this lecture I would like to discuss in more detail the statistical analysis of dynamic Monte Carlo data (assumed to be already \in equilibrium"), with emphasis on how toestimate the autocorrelation time intf and how tocompute valid error bars. What isinvolved here is a branch ofmathematical statistics called timeseries analysis. An excellent exposition can be found inthe books of of Priestley [14] and Anderson [15]. 13
Let fftg be a real-valued stationary stochastic process with mean unnormalized autocorrelation function normalized autocorrelation function and integrated autocorrelation time hfti (3.1) C(t) hfs fs+ti ; 2 (3.2) (t) C(t)=C(0) (3.3) int = 1 2 1X t = ;1 (t) : (3.4) Our goal is to estimate , C(t), (t) and int based on a nite (but large) sample f1 ::: fn from this stochastic process. The \natural" estimator of is the sample mean f 1 n nX i =1 This estimator is unbiased (i.e. hfi = )andhas variance var(f) = 1 n n;1 X t=;(n;1) fi : (3.5) (1 ; jtj ) C(t) (3.6) n 1 n (2 int) C(0) for n (3.7) Thus, even if we areinterested only in the static quantity ,itisnecessary to estimate the dynamic quantity int in order to determine valid error bars for . The \natural" estimator of C(t) is bC(t) if the mean is known, and bC(t) 1 n ;jtj 1 n ;jtj n X;jtj i =1 n X;jtj i =1 (fi ; )(fi + jtj ; ) (3.8) (fi ; f)(fi + jtj ; f) (3.9) if the mean is unknown. We emphasize the conceptual distinction between the autocorrelation function C(t), which foreach t is a number, andthe estimator b C(t) or bC(t), which foreach t is a random variable. As will become clear, this distinction is 14
Page 1 and 2: Monte Carlo Methods in Statistical
Page 3 and 4: Let me be more precise about this l
Page 5 and 6: The initial distribution . Here is
Page 7 and 8: and unnormalized autocorrelation fu
Page 9 and 10: This quantity has the nicefeature t
Page 11 and 12: Moreover, since 7! (1 + )=(1 ; ) is
Page 13: Lacking rigorous knowledge of the a
Page 17 and 18: This retains most of the \signal" b
Page 19 and 20: Of course, it is still necessary to
Page 21 and 22: Therestofthe formulae are analogous
Page 23 and 24: independent" sample. So this means
Page 25 and 26: Before entering into details, let u
Page 27 and 28: 2 L where p1p2 = ::: L+1 L+1 L+1
Page 29 and 30: the present wecanimagine that Sl co
Page 31 and 32: Figure 1: Standard coarsening (fact
Page 33 and 34: The coarse-grid problemisthus also
Page 35 and 36: oth!) of m1 or m2 could be zero, i.
Page 37 and 38: so-called directional methods: let
Page 39 and 40: elds lie in M, the coarse-grid elds
Page 41 and 42: di ers from the heat-bath algorithm
Page 43 and 44: where M, N and Q are xed matrices a
Page 45 and 46: where is Gaussian white noise with
Page 47 and 48: even better), and trying to improve
Page 49 and 50: These facts can be used for both an
Page 51 and 52: Estimates of zSW q =1 q =2 q =3 q =
Page 53 and 54: The value of the single-cluster alg
Page 55 and 56: Figure 4: Action of theWol embeddin
Page 57 and 58: partition function is and the Gibbs
Page 59 and 60: slightembarrassment. Fortunately,th
Page 61 and 62: = (f (I ; )P jtj (I ; )f) (8.2) whe
Page 63 and 64: Note, however, that here time ismea
Page 65 and 66:
It follows that and hence NN(t) NN(
Page 67 and 68:
As noted at the beginning ofthis le
Page 69 and 70:
grant NAC{705, and the Pittsburgh S
Page 71 and 72:
[32] W. Hackbusch, Multi-Grid Metho
Page 73 and 74:
[62] X.-J. Li and A.D. Sokal, Phys.
Page 75 and 76:
[102] G. Mana, A. Pelissetto and A.
show all

Monte Carlo Methods in Statistical Mechanics: Foundations and ...

Create successful ePaper yourself

Delete template?

Save as template?