08.03.2014 Views

Variance Estimation for the General Regression Estimator

Variance Estimation for the General Regression Estimator

Variance Estimation for the General Regression Estimator

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

1<br />

ABSTRACT<br />

A variety of estimators of <strong>the</strong> variance of <strong>the</strong> general regression (GREG) estimator of a<br />

mean have been proposed in <strong>the</strong> sampling literature, mainly with <strong>the</strong> goal of estimating <strong>the</strong><br />

design-based variance. <strong>Estimator</strong>s can be easily constructed that, under certain conditions, are<br />

approximately unbiased <strong>for</strong> both <strong>the</strong> design-variance and <strong>the</strong> model-variance. Several dualpurpose<br />

estimators are studied here in single-stage sampling. These choices are robust<br />

estimators of a model-variance even if <strong>the</strong> model that motivates <strong>the</strong> GREG has an incorrect<br />

variance parameter.<br />

A key feature of <strong>the</strong> robust estimators is <strong>the</strong> adjustment of squared residuals by factors<br />

analogous to <strong>the</strong> leverages used in standard regression analysis. We also show that <strong>the</strong> deleteone<br />

jackknife implicitly includes <strong>the</strong> leverage adjustments and is a good choice from ei<strong>the</strong>r <strong>the</strong><br />

design-based or model-based perspective. In a set of simulations, <strong>the</strong>se variance estimators have<br />

small bias and produce confidence intervals with near-nominal coverage rates <strong>for</strong> several<br />

sampling methods, sample sizes, and populations in single-stage sampling.<br />

We also present simulation results <strong>for</strong> a skewed population where all variance estimators<br />

per<strong>for</strong>m poorly. Samples that do not adequately represent <strong>the</strong> units with large values lead to<br />

estimated means that are too small, variance estimates that are too small, and confidence<br />

intervals that cover at far less than <strong>the</strong> nominal rate. These defects need to be avoided at <strong>the</strong><br />

design stage by selecting samples that cover <strong>the</strong> extreme units well. However, in populations<br />

with inadequate design in<strong>for</strong>mation this will not be feasible.<br />

KEY WORDS: Confidence interval coverage; Hat matrix; Jackknife; Leverage;<br />

Model unbiased; Skewness

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!