27.12.2012 Views

The 1995/1996 Household Income, Expenditure - (PDF, 101 mb ...

The 1995/1996 Household Income, Expenditure - (PDF, 101 mb ...

The 1995/1996 Household Income, Expenditure - (PDF, 101 mb ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Appendix LA Comparing Estimates from Interpenetrating Samples<br />

To illustrate the use of half sample comparison, consider the nu<strong>mb</strong>er of<br />

persons per household in the first half versus the nu<strong>mb</strong>er of persons per<br />

household in the second half. <strong>The</strong> null hypothesis is a significant difference<br />

between these means, and the t values for a paired difference experiment<br />

allows us to draw confidence bounds around the alternative.<br />

Average <strong>Household</strong> Size Estimates:<br />

1. First half-sample: 5.04954, N = 7347, STD = 2.53449<br />

2. Second half-sample: 4.93966, N = 7458, STO = 2.45973<br />

t =<br />

(5.04954 - 4.93966)<br />

(2.53449)2 + (2.45973)2<br />

7347 7458<br />

<strong>The</strong> calculated t value is 2.68 and we may reject the null hypothesis of<br />

equality of the two means at a 95% level of confidence. <strong>The</strong> inference is that<br />

average household size in the second half sample is statistically smaller, but<br />

not by much, because the difference between the two means is very small.<br />

A different point to note, not merely illustrative of the use of<br />

interpenetrating samples to test hypotheses, is in the use of the standard<br />

deviation for the means, as created by SAS. SAS assumes that the estimate<br />

comes from a simple random sample design. For the full sample of 14805<br />

households from the recent survey, SAS produces an estimate of 4.9942 as the<br />

mean household size, with standard deviation 2.4976. <strong>The</strong> implied 95%<br />

confidence interval is therefore ± .0402. However, as mentioned, the HIECS<br />

households are sampled under a stratified, two stage sampling scheme with<br />

probability proportional to size without replacement in the first stage and<br />

simple random sampling without replacement in the second stage. Technically,<br />

this means that the between- and within-PSU components of the variance of<br />

any estimate must be accounted for. In most cases, then, estimates which<br />

account for this component of the total variance will be higher than those<br />

which assume a simple random sample design. CENVAR software (see below),<br />

which does take into account both variance components, produces an<br />

(unweighted) estimate of 4.963 for the mean household size ± .055. Cochran<br />

(1977) is a good source for this issue. In cases of critical household<br />

estimates. where possible, the asymptotic measures reported here will be the<br />

more conservative ones produced by CENVAR.<br />

Appendix LB Functions of IMPS Components<br />

1) CENTRACK handled the movement and transmittal of survey forms<br />

between the various processing phases - receipt, entry, verification (or<br />

L34

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!