Chapter 14 - Bootstrap Methods and Permutation Tests - WH Freeman

More documents

Recommendations

Info

$Design and analysis of a fractional 413125 split-plot ... - ResearchGate$

14-44 CHAPTER 14 Bootstrap Methods and Permutation Tests 14.33 The distribution of the 72 guinea pig lifetimes in Table 1.8 (page 38) is strongly skewed. In Exercise 14.9 (page 14-23) you found a bootstrap t confidence interval for the population mean µ, even though some skewness remains in the bootstrap distribution. Bootstrap the mean lifetime and give all four bootstrap 95% confidence intervals: t, percentile, BCa, and tilting. Make a graphical comparison by drawing a vertical line at the original sample mean x and displaying the four intervals horizontally, one above the other. Discuss what you see: Do bootstrap t and percentile agree? Do the more accurate intervals agree with the two simpler methods? 14.34 We would like a 95% confidence interval for the standard deviation σ of Seattle real estate prices. Your work in Exercise 14.11 probably suggests that it is risky to bootstrap the sample standard deviation s from the sample in Table 14.1 and use the bootstrap t interval. Now we have more accurate methods. Bootstrap s and report all four bootstrap 95% confidence intervals: t, percentile, BCa, and tilting. Make a graphical comparison by drawing a vertical line at the original s and displaying the four intervals horizontally, one above the other. Discuss what you see: Do bootstrap t and percentile agree? Do the more accurate intervals agree with the two simpler methods? What interval would you use in a report on real estate prices? CHALLENGE CHALLENGE 14.35 Exercise 14.7 (page 14-13) gives an SRS of 20 of the 72 guinea pig survival times in Table 1.8. The bootstrap distribution of x from this sample is clearly right-skewed. Give a 95% confidence interval for the population mean µ based on these data and a method of your choice. Describe carefully how your result differs from the intervals in Exercise 14.33, which use the full sample of 72 lifetimes. 14.36 The CLEC data for Example 14.6 are strongly skewed to the right. The 23 CLEC repair times appear in Exercise 14.22 (page 14-26). (a) Bootstrap the mean of the data. Based on the bootstrap distribution, which bootstrap confidence intervals would you consider for use? Explain your answer. (b) Find all four bootstrap confidence intervals. How do the intervals compare? Briefly explain the reasons for any differences. In particular, what kind of errors would you make in estimating the mean repair time for all CLEC customers by using a t interval or percentile interval instead of a tilting or BCa interval? 14.37 Example 14.6 (page 14-19) considers the mean difference between repair times for Verizon (ILEC) customers and customers of competing carriers (CLECs). The bootstrap distribution is nonnormal with strong left skewness, so that any t confidence interval is inappropriate. Give the BCa 95% confidence interval for the mean difference in service times for all customers. In practical terms, what kind of error would you make by using a t interval or percentile interval instead of a BCa interval? 14.38 Figure 2.3 (page 108) is a scatterplot of field versus laboratory measurements of the depths of 100 defects in the Trans-Alaska Oil Pipeline. The correlation is r = 0.944. Bootstrap the correlation for these data. (The data are in the file ex14 038.dat.)
Section 14.4 Exercises 14-45 (a) Describe the shape and bias of the bootstrap distribution. Do the simpler bootstrap confidence intervals (t and percentile) appear to be justified? (b) Find all four bootstrap 95% confidence intervals: t, percentile, BCa, and tilting. Make a graphical comparison by drawing a vertical line at the original correlation r and displaying the four intervals horizontally, one above the other. Discuss what you see. Does it still appear that the simpler intervals are justified? What confidence interval would you include in a report comparing field and laboratory measurements? 14.39 Figure 2.7 (page 114) shows a very weak relationship between returns on Treasury bills and returns on common stocks. The correlation is r =−0.113. We wonder if this is significantly different from 0. To find out, bootstrap the correlation. (The data are in the file ex14 039.dat.) (a) Describe the shape and bias of the bootstrap distribution. It appears that even simple bootstrap inference (t and percentile confidence intervals) is justified. Explain why. (b) Give the BCa and bootstrap percentile 95% confidence intervals for the population correlation. Do they (as expected) agree closely? Do these intervals provide significant evidence at the 5% level that the population correlation is not 0? CHALLENGE CHALLENGE 14.40 Describe carefully how to resample from data on an explanatory variable x and a response variable y to create a bootstrap distribution for the slope b 1 of the least-squares regression line. (Software such as S-PLUS automates resampling methods for regression inference.) 14.41 Continue your study of historical returns on Treasury bills and common stocks, begun in Exercise 14.39, by regressing stock returns on T-bill returns. (a) Request a plot of the residuals against the explanatory variable and a normal quantile plot of the residuals. The residuals are somewhat nonnormal. In what way? It is hard to predict the accuracy of the usual t confidence interval for the slope β 1 of the population regression line. (b) Examine the shape and bias of the bootstrap distribution of the slope b 1 of the least-squares line. The distribution suggests that even the bootstrap t interval will be accurate. Why? (c) Give the standard t confidence interval for β 1 and also the BCa, bootstrap t, and bootstrap percentile 95% confidence intervals. What do you conclude about the accuracy of the two t intervals? Do the data provide evidence at the 5% level that the population slope β 1 is not 0? CHALLENGE 14.42 Continue your study of field measurements versus laboratory measurements of defects in the Trans-Alaska Oil Pipeline, begun in Exercise 14.38, by regressing field measurement result on laboratory measurement result. (a) Request a plot of the residuals against the explanatory variable and a normal quantile plot of the residuals. These plots suggest that inference based on the usual simple linear regression model (Chapter 10, page 638) may be inaccurate. Why?
Page 1 and 2: CHAPTER 14 Bootstrap Methods and Pe
Page 3 and 4: 14.1 The Bootstrap Idea 14-3 are su
Page 5 and 6: 14.1 The Bootstrap Idea 14-5 The bi
Page 7 and 8: 14.1 The Bootstrap Idea 14-7 Observ
Page 9 and 10: SRS of size n x - x - x - SRS of si
Page 11 and 12: Section 14.1 Summary 14-11 Number o
Page 13 and 14: 14.2 First Steps in Using the Boots
Page 23 and 24: Section 14.2 Exercises 14-23 800 Or
Page 25 and 26: Section 14.2 Exercises 14-25 (b) In
Page 27 and 28: 14.3 How Accurate Is a Bootstrap Di
Page 33 and 34: Section 14.3 Exercises 14-33 CAUTIO
Page 35 and 36: 14.4 Bootstrap Confidence Intervals
Page 43: Section 14.4 Exercises 14-43 (a) Ex
Page 47 and 48: 14.5 Significance Testing Using Per
Page 57 and 58: Section 14.5 Exercises 14-57 SECTIO
Page 59 and 60: Section 14.5 Exercises 14-59 that p
Page 61 and 62: Section 14.5 Exercises 14-61 14.56
Page 63 and 64: Chapter 14 Exercises 14-63 TABLE 14
Page 65 and 66: Chapter 14 Exercises 14-65 areas kn
Page 67 and 68: Chapter 14 Exercises 14-67 (b) Do a
Page 69 and 70: Chapter 14 Notes 14-69 CHAPTER 14 (

Chapter 14 - Bootstrap Methods and Permutation Tests - WH Freeman

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?