23.08.2015 Views

ST 412/512 – Midterm Exam w: Key (00) – Methods of Data Analysis II

ST 412/512 – Midterm Exam w: Key (00) – Methods of Data Analysis II

ST 412/512 – Midterm Exam w: Key (00) – Methods of Data Analysis II

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

St <strong>412</strong>/<strong>512</strong>, D. Schafer, Spring 2<strong>00</strong>0EXAMYour name:_____SOLUTIONS________________________________Your lab time (Circle one): Tues. 8:<strong>00</strong> Tues 11:<strong>00</strong> Tues 2:<strong>00</strong>This is a 50-minute open-book, open-notes test. Show work where appropriate.Part A. Write the letter from the list below that best matches the description. (4 pointseach)1. __T__ The quantity that least squares estimators minimize.2. __R__ Another name for the estimate <strong>of</strong> σ 2 in regression analysis.3. __S__ A residual divided by its estimated standard deviation.4. __J__ An explanatory variable that takes on the value 1 or 0.5. __F__ An inference about the regression in a region outside <strong>of</strong> the range <strong>of</strong> observed x’s.6. __W__ The distance, in standard errors, between an estimate and a hypothesized value.7. __O__ An interval <strong>of</strong> plausible values <strong>of</strong> the response at specified explanatory variable values.8. __M__ A test for checking model adequacy when there are replicate responses.9. __Z__ A statistic used to identify problems <strong>of</strong> multicollinearity.10.__D__ A statistic for comparing models with different sets <strong>of</strong> possible x’s.A. Calibration interval J. Indicator S. Studentized residualB. Confidence interval K. Interaction T. Sum <strong>of</strong> squared residualsC. Cook’s Distance L. Interpolation U. Sum <strong>of</strong> squares due to regressionD. Cp statistic M. Lack-<strong>of</strong>-fit F-test V. t-ratioE. Extra sum <strong>of</strong> squares N. Leverage W. t-statisticF. Extrapolation O. Prediction interval X. Total sum <strong>of</strong> squaresG. Fitted value P. Quadratic term Y. VarianceH. F-statistic Q. Residual Z. Variance inflation factorI. F-word R. Residual mean squarePage 1 <strong>of</strong> 5


Part B. Acorn data (questions worth 6 points each). Below is a plot <strong>of</strong>Y = log <strong>of</strong> the species range area (1<strong>00</strong>×km 2 ), versusX 1 = log <strong>of</strong> the acorn (seed) volume (cm 3 )for species <strong>of</strong> oak trees from Atlantic and California regions. The species with the smallest Y(species #7) is only found on an island so its range area cannot possibly exceed the area <strong>of</strong> theisland.log <strong>of</strong> Range Area4 6 8 10Regionwith # 7without #7AtlanticCaliforniaspecies # 7-1 0 1 2 3Log <strong>of</strong> Acorn Size11. Based on the scatterplot, circle the letter for the one sentence that describes the probableinfluence <strong>of</strong> species # 7.a) The species is probably not influential.b) The interactive effect <strong>of</strong> log acorn size and region (Atlantic or California) is substantiallymore significant when the species is included than when it is not included.c) The interactive effect <strong>of</strong> log acorn size and region is substantially less significant when thespecies is included than when it is not included.Page 2 <strong>of</strong> 5


19. The following is the fit to model D (recall: Y = log area, X 1 = log acorn volume):µ(Y|X 1 ,X 2 ) = 5.7 + 2.8X 1 + .63X 2 .Which <strong>of</strong> the following is the best interpretation <strong>of</strong> the effect <strong>of</strong> acorn volume.SHOW WORK then circle the correct answer.2.82 = 7.0see p. 208a) Associated with each one cm 3 increase in acorn volume is a 2.8 ×1<strong>00</strong> km 2 increase in the meanrange area.b) Associated with each one cm 3 increase in acorn volume is a .63×1<strong>00</strong> km 2 increase in the meanrange area.c) Associated with each doubling in acorn volume is a 2.8 ×1<strong>00</strong> km 2 increase in the mean rangearea.d) Associated with each doubling in acorn volume is a .63 ×1<strong>00</strong> km 2 increase in the mean rangearea.e) Associated with each doubling in acorn volume is a 1.9 ×1<strong>00</strong> km 2 increase in the mean rangearea.f) Associated with each doubling in acorn volume is a 7-fold change in the median range area.g) Associated with each doubling in acorn volume is a 16-fold change in the median range area.20. Suppose there are 3 regions. Let X 2 = 1 if a species is from the Atlantic region, and 0 otherwise;let X 3 = 1 if a species is from the midwest region and 0 otherwise; and let X 4 = 1 if aspecies is from the California region, and 0 otherwise. Write down a model that specifies 3parallel lines for the regression <strong>of</strong> Y on X 1 , using the midwest region as the “reference level.”µ(Y|X 1 ,X 2 ,X 3 ,X 4 ) = β 0 + β 1 + β 2 X 2 + β 3 X 4_____________________________Page 5 <strong>of</strong> 5


POINT DI<strong>ST</strong>RIBUTION SUMMARYLower quartile = 69.5Median = 73Upper quartile = 84LETTER GRADE SCORINGA 84-1<strong>00</strong>A- 77-83B+ 70-76B 60-69B- 50-59C+ 40-59C 35-39Page 6 <strong>of</strong> 5

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!