01.04.2014 Views

Chapter 14 - Bootstrap Methods and Permutation Tests - WH Freeman

Chapter 14 - Bootstrap Methods and Permutation Tests - WH Freeman

Chapter 14 - Bootstrap Methods and Permutation Tests - WH Freeman

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

<strong>14</strong>-46 CHAPTER <strong>14</strong> <strong>Bootstrap</strong> <strong>Methods</strong> <strong>and</strong> <strong>Permutation</strong> <strong>Tests</strong><br />

(b) Examine the bootstrap distribution of the slope b 1 of the least-squares regression<br />

line. The distribution shows some departures from normality. In<br />

what way is the bootstrap distribution nonnormal? What is the bootstrap<br />

estimate of bias? Based on what you see, would you consider use of bootstrap<br />

t or bootstrap percentile intervals?<br />

(c) Give the BCa 95% confidence interval for the slope β 1 of the population<br />

regression line. Compare this with the st<strong>and</strong>ard 95% confidence interval<br />

based on normality, the bootstrap t interval, <strong>and</strong> the bootstrap percentile<br />

interval. Using the BCa interval as a st<strong>and</strong>ard, which of the other intervals<br />

are adequately accurate for practical use?<br />

<strong>14</strong>.43 Table <strong>14</strong>.2 gives data on a sample of 50 baseball players.<br />

(a) Find the least-squares regression line for predicting salary from batting<br />

average.<br />

(b) <strong>Bootstrap</strong> the regression line <strong>and</strong> give a 95% confidence interval for the<br />

slope of the population regression line.<br />

(c) In the discussion of Example <strong>14</strong>.9 we found bootstrap confidence intervals<br />

for the correlation between salary <strong>and</strong> batting average. Does your interval<br />

for the slope of the population line agree with the conclusion of that<br />

example that there may be no relation between salary <strong>and</strong> batting average?<br />

Explain.<br />

<strong>14</strong>.44 We know that outliers can strongly influence statistics such as the mean <strong>and</strong><br />

the least-squares line. Example 7.7 (page 459) describes a matched pairs study<br />

of disruptive behavior by dementia patients. The differences in Table 7.2 show<br />

several low values that may be considered outliers.<br />

(a) <strong>Bootstrap</strong> the mean of the differences with <strong>and</strong> without the three low values.<br />

How do these values influence the shape <strong>and</strong> bias of the bootstrap<br />

distribution?<br />

(b) Give the BCa or tilting confidence interval from both bootstrap distributions.<br />

Discuss the differences.<br />

<strong>14</strong>.5 Significance Testing Using<br />

<strong>Permutation</strong> <strong>Tests</strong><br />

Significance tests tell us whether an observed effect, such as a difference between<br />

two means or a correlation between two variables, could reasonably<br />

occur “just by chance” in selecting a r<strong>and</strong>om sample. If not, we have evidence<br />

that the effect observed in the sample reflects an effect that is present in the<br />

population. The reasoning of tests goes like this:<br />

1. Choose a statistic that measures the effect you are looking for.<br />

2. Construct the sampling distribution that this statistic would have if the effect<br />

were not present in the population.<br />

3. Locate the observed statistic on this distribution. A value in the main body<br />

of the distribution could easily occur just by chance. A value in the tail would

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!