30.09.2014 Views

final report - probability.ca

final report - probability.ca

final report - probability.ca

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

when the lo<strong>ca</strong>l correlations are sufficiently small so that no phase transitions occur. The Gibbs measure used as<br />

the target distribution for this model is such that the field’s correlations decrease exponentially fast as a function<br />

of distance. In particular, they consider a Gibbs measure on R Zr which has a density with respect to ∏ k∈Z µ (dx k)<br />

r<br />

given by<br />

{<br />

}<br />

exp<br />

− ∑ k∈Z r U k (x)<br />

where U k is a finite-range potential, depending only on a finite number of neighboring terms of k. Markov random<br />

fields with signifi<strong>ca</strong>nt phase-transition behavior will require algorithms that use proposals with large jumps in order<br />

to provide sufficient mixing. This will often yield degenerate algorithms, as the acceptance rate will converge to 0,<br />

making the algorithm practi<strong>ca</strong>lly useless.<br />

2.2 Roberts and Rosenthal (2001): & Inhomogeneous Targets<br />

Another validation of the asymptotic optimal acceptance rate equaling 0.234 for the RWM was proven by Roberts<br />

and Rosenthal (2001) [RR01] in the <strong>ca</strong>se of inhomogeneous target distributions. Specifi<strong>ca</strong>lly, they considered<br />

inhomogeneous target distributions of the form<br />

d∏<br />

π (x) = C i f (C i x i ) , (2.1)<br />

i=1<br />

where the {C i } are themselves i.i.d. for some fixed distribution, and f is again a fixed one-dimensional density<br />

satisfying the regularity conditions discussed in (1.10). The components in this target are uncorrelated, but heterogeneously<br />

s<strong>ca</strong>led. The target (1.8) used in [RGG97] corresponds to the <strong>ca</strong>se where the {C i } are all constant in (2.1).<br />

In order to obtain general results for the limiting theory of the RWM algorithm applied to (2.1), it is assumed that<br />

C i have mean 1 and variance b < ∞.<br />

Theorem 2.1. (Theorem 5 in [RR01]) Consider RWM with target density (2.1), and with proposal distribution<br />

N ( (<br />

0, I d l 2 /d )) , for some l > 0. Consider the time-res<strong>ca</strong>led process consisting of the first component of the induced<br />

Markov chain, i.e., W (d)<br />

t = C 1 X (1)<br />

[td] . As d → ∞, W t<br />

d ⇒ W t , where W t satisfies the Langevin SDE<br />

dW t = 1 2 g′ (W t ) (C 1 s) 2 dt + (C 1 s) dB t ,<br />

with B t being a standard Brownian motion and where<br />

(<br />

)<br />

s 2 = 2l 2 Φ −lb 1/2 I 1/2 /2<br />

= 1 b · 2 ( l 2 b ) (<br />

Φ − ( l 2 b ) )<br />

1/2<br />

I 1/2 /2 ,<br />

with I = E f<br />

[(g ′ (X)) 2] .<br />

Remark 2.2. When considering a functional of the first coordinate of the algorithm, the efficiency of the algorithm<br />

as a function of acceptance rate is the same as that found in [RGG97], but multiplied with a global “inhomogeneous”<br />

factor C1/b. 2 For a fixed function of f, the optimal asymptotic efficiency is proportional to C1/bd. 2 In particular,<br />

the efficiency of the algorithm is now slowed down by a factor of b = E ( )<br />

Ci<br />

2 /E (Ci ) 2 , which by Jensen’s inequality<br />

is greater than or equal to 1, with equality coming only when the factors C i are all constant, which is equivalent to<br />

the form of (1.8).<br />

Remark 2.3. When the values of C i are known, then instead of using the proposal distribution N ( )<br />

0, σ 2 (d) I d ,<br />

one should use an inhomogeneous proposal which is s<strong>ca</strong>led in proportion to C i for each component, i..e, we should<br />

consider proposals of the form N ( 0, ¯σ 2 (d) diag (C 1 , . . . , C d ) ) . This would then yield a RWM applied to the form<br />

of the target (1.8), since the s<strong>ca</strong>ling in the proposal and in the target would <strong>ca</strong>ncel each other out. This would<br />

alleviate the inefficiency factor of b as discussed above. See Theorem 6 and the subsequent discussion in [RR01] for<br />

a more detailed assessment of inhomogeneous proposals for such target distributions.<br />

The results above provide some theoreti<strong>ca</strong>l justifi<strong>ca</strong>tion for a frequently used strategy as suggested by Tierney<br />

in [Tie94], which is to use a proposal distribution that is a s<strong>ca</strong>lar multiple of the correlation matrix obtained<br />

by estimating the correlation structure of the target distribution, perhaps through an empiri<strong>ca</strong>l estimate based<br />

on a sample MCMC run, or numeri<strong>ca</strong>lly through curvature <strong>ca</strong>lculations on the target distribution itself. Using<br />

inhomogeneous proposals by first obtaining a preliminary estimate of the C i <strong>ca</strong>n thus lead to more efficient Metropolis<br />

algorithms.<br />

,<br />

16

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!