Variance Estimation for the General Regression Estimator

More documents

Recommendations

Info

1 ABSTRACT A variety of estimators of the variance of the general regression (GREG) estimator of a mean have been proposed in the sampling literature, mainly with the goal of estimating the design-based variance. Estimators can be easily constructed that, under certain conditions, are approximately unbiased for both the design-variance and the model-variance. Several dualpurpose estimators are studied here in single-stage sampling. These choices are robust estimators of a model-variance even if the model that motivates the GREG has an incorrect variance parameter. A key feature of the robust estimators is the adjustment of squared residuals by factors analogous to the leverages used in standard regression analysis. We also show that the deleteone jackknife implicitly includes the leverage adjustments and is a good choice from either the design-based or model-based perspective. In a set of simulations, these variance estimators have small bias and produce confidence intervals with near-nominal coverage rates for several sampling methods, sample sizes, and populations in single-stage sampling. We also present simulation results for a skewed population where all variance estimators perform poorly. Samples that do not adequately represent the units with large values lead to estimated means that are too small, variance estimates that are too small, and confidence intervals that cover at far less than the nominal rate. These defects need to be avoided at the design stage by selecting samples that cover the extreme units well. However, in populations with inadequate design information this will not be feasible. KEY WORDS: Confidence interval coverage; Hat matrix; Jackknife; Leverage; Model unbiased; Skewness
1 1. Introduction Robust variance estimation is a key consideration in the prediction approach to finite population sampling. Valliant, Dorfman, and Royall (2000) synthesize much of the model-based literature. In that approach, a working model is formulated that is used to construct a point estimator of a mean or total. Variance estimators are created that are robust in the sense of being approximately model-unbiased and consistent for the model-variance even when the variance specification in the working model is incorrect. In this paper, that approach is extended to the general regression estimator (GREG) to construct variance estimators that are approximately model-unbiased but are also approximately design-unbiased in single-stage sampling. A number of alternatives are compared including the jackknife and some variants of the jackknife. We will use a particular class of linear models along with Bernoulli or Poisson sampling as motivation for the variance estimators. However, some of these estimators can often be successfully applied in practice to single-stage designs where selections are not independent. Associated with each unit in the population is a target variable Y i and a p-vector of auxiliary variables i = ( i1, , ip) x x K x ′ where i = 1, K , N . The population vector of totals of the auxiliaries is x = ( x1, , xp) T T K T ′ where T xk =∑ x , k = 1, K , p. The general regression N i= 1 ki estimator, defined below, is motivated by a linear model in which the Y’s are independent random variables with E var ( Y ) ( Y ) M i i = xβ ′ . (1.1) = v M i i In most situations (1.1) is a “working” model that is likely to be incorrect to some degree.
Page 1: Variance Estimation for the General
Page 5 and 6: 3 based and model-based interpretat
Page 7 and 8: 5 3.5, for a more detailed descript
Page 9 and 10: 7 Lemma 1. Assume that (i) and (ii)
Page 11 and 12: 9 When the selection probability of
Page 13 and 14: 11 −1 −1 −1 −1 si si X′ s
Page 15 and 16: 13 than the other variance estimato
Page 17 and 18: 15 hours worked. A constant model-v
Page 19 and 20: 17 use the leverage adjustments but
Page 21 and 22: 19 variance estimates, conditional
Page 23 and 24: 21 ACKNOWLEDGMENT The author is ind
Page 25 and 26: 23 REFERENCES BELSLEY, D.A., KUH, E
Page 27 and 28: 25 STUKEL, D., HIDIROGLOU, M.A., AN
Page 29 and 30: 27 Table 1. Relative biases and roo
Page 31 and 32: 29 Table 3. 95% confidence interval
Page 33 and 34: 31 Table 5. 95% confidence interval
Page 36: srs n = 50 Figure 3 pps n = 50 -10

Variance Estimation for the General Regression Estimator

Create successful ePaper yourself

Delete template?

Save as template?