11.07.2015 Views

statisticalrethinkin..

statisticalrethinkin..

statisticalrethinkin..

SHOW MORE
SHOW LESS
  • No tags were found...

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

3.2. SAMPLING TO SUMMARIZE 67In contrast, the right-hand plot in FIGURE 3.3 displays the 50% HIGHEST POSTERIORDENSITY INTERVAL (HPDI). 43 e HPDI is the narrowest interval containing the specifiedprobability mass. If you think about it, there must be an infinite number of posterior intervalswith the same mass. But if you want an interval that best represents the parameter valuesmost consistent with the data, then you want the densest of these intervals. at’s what theHPDI is. Compute it from the samples with HPDI (also part of rethinking):HPDI( samples , prob=0.5 )R code3.13lower 0.5 upper 0.50.8418418 1.0000000is interval captures the parameters with highest posterior probability, as well as being noticeablynarrower: 0.16 in width rather than 0.23 for the PI.So the HPDI has some advantages over the PI. But in most cases, these two types ofinterval are very similar. 44 ey only look so different in this case because the posterior distributionis so highly skewed. If we used samples from the posterior distribution for 6 watersin 9 tosses (as in most of this chapter), instead these intervals would be nearly identical. Tryit for yourself, using different probability masses, such as prob=0.8 and prob=0.95. Whenthe posterior is bell shaped, it hardly matters which type of interval you use. Remember,we’re not launching rockets or calibrating atom smashers, so fetishizing precision to the 5thdecimal place will not improve your science.e HPDI also has some disadvantages. HPDI is more computationally intensive than PIand suffers from greater simulation variance, which is a fancy way of saying that it is sensitiveto how many samples you draw from the posterior. It is also harder to understand and manyscientific audiences will not appreciate its features, while they will immediately understand apercentile interval, as ordinary non-Bayesian intervals are nearly always percentile intervals(although of sampling distributions, not posterior distributions).Overall, if the choice of interval type makes a big difference, then you shouldn’t be usingintervals to summarize the posterior. Remember, the entire posterior distribution isthe Bayesian estimate. It summarizes the relative plausibilities of each possible value of theparameter. Intervals of the distribution are just helpful for summarizing it. If choice of intervalleads to different inferences, then you’d be better off just plotting the entire posteriordistribution.Rethinking: Why 95%? e most common interval mass in the natural and social sciences is the95% interval. is interval leaves 5% of the probability outside, correspond to a 5% chance of theparameter not lying within the interval (although see below). is customary interval also reflectsthe customary threshold for statistical significance, which is 5% or P < 0.05. It is not easy to defendthe choice of 95% (5%), outside of pleas to convention. Ronald Fisher is sometimes blamed for thischoice, but his widely-cited 1925 invocation of it was not enthusiastic:“e [number of standard deviations] for which P = .05, or 1 in 20, is 1.96 ornearly 2; it is convenient to take this point as a limit in judging whether a deviationis to be considered significant or not.” 45Most people don’t think of convenience as a serious criterion. Later in his career, Fisher activelyadvised against always using the same threshold for significance or width of confidence interval. 46So what are you supposed to do then? ere is no consensus, but thinking is always a good idea.If you are trying to say that an interval doesn’t include some value, then you might use the widest

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!