01.06.2013 Views

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

An important idea, although not one that is universally accepted, is to run<br />

more than one cha<strong>in</strong>. The advantages of many cha<strong>in</strong>s over one much longer<br />

cha<strong>in</strong> have been discussed widely (see Gelman & Rub<strong>in</strong>, 1992, and associated<br />

follow<strong>in</strong>g discussion). In any given application the issue may be resolved by<br />

practical considerations of the computational equipment available. However,<br />

multiple cha<strong>in</strong>s clearly allow the possibility of start<strong>in</strong>g cha<strong>in</strong>s from a variety of<br />

values and assess<strong>in</strong>g how long the <strong>in</strong>fluence of the start<strong>in</strong>g value appears to<br />

persist. Gelman (1996) argues that methods which try to assess convergence on<br />

the basis of a s<strong>in</strong>gle sequence from an MCMC method often fail to detect nonconvergence.<br />

Inference<br />

16.4 Markov cha<strong>in</strong> Monte Carlo methods 559<br />

Once the length of the `burn-<strong>in</strong>' period has been determ<strong>in</strong>ed, it rema<strong>in</strong>s to decide<br />

how many further samples should be used to make <strong>in</strong>ferences about the posterior<br />

distribution. Essentially this is just a problem <strong>in</strong> estimation for which standard<br />

sample-size calculations might be contemplated. If the aim is to estimate the<br />

posterior mean of some scalar function, f …u†, of the parameter, then generat<strong>in</strong>g<br />

enough values from the posterior to ensure that the standard error of this<br />

quantity is below a specified value might be a sensible approach. If an estimate<br />

of the standard deviation, s, off…u† is available, then, assum<strong>in</strong>g that the values<br />

obta<strong>in</strong>ed from the cha<strong>in</strong> are <strong>in</strong>dependent, the usual formula for the standard<br />

error, s= n<br />

p can be used to estimate the number, n, of simulations required.<br />

However, it will often be <strong>in</strong>appropriate to assume that the values obta<strong>in</strong>ed<br />

from the cha<strong>in</strong> are <strong>in</strong>dependent and time series methods can be applied. For<br />

example, a first-order autoregression (see §12.7) can be fitted to the successive<br />

values of f …u† obta<strong>in</strong>ed from the cha<strong>in</strong>. If this approach is followed then the<br />

appropriate formula for the standard error becomes<br />

r<br />

s<br />

p<br />

n<br />

1 ‡ r<br />

1 r<br />

where r is the autocorrelation coefficient. It can be appreciated from this<br />

formula that, if there is substantial autocorrelation <strong>in</strong> the series, so that r is<br />

close to 1, a very large value of n may be needed to ensure that the standard error<br />

is below the required value. A valuable discussion of related issues can be found<br />

<strong>in</strong> Geyer (1992). See also Raftery and Lewis (1992).<br />

If it is necessary to use a very long cha<strong>in</strong> to obta<strong>in</strong> adequate estimates of<br />

relevant quantities, then problems can arise with the amount of computer<br />

storage required. If the successive values are highly correlated (and, as just<br />

seen, very long runs are likely to be required <strong>in</strong> just this circumstance), perhaps<br />

little will be lost by keep<strong>in</strong>g only every Lth value of u obta<strong>in</strong>ed from the cha<strong>in</strong>.<br />

Apply<strong>in</strong>g this device, which is known as th<strong>in</strong>n<strong>in</strong>g, may well result <strong>in</strong> a set of<br />

,

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!