02.04.2013 Views

The Relationship Between The 1990 Census and ... - Census Bureau

The Relationship Between The 1990 Census and ... - Census Bureau

The Relationship Between The 1990 Census and ... - Census Bureau

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

“majority” choice in most cases when coders assigned different codes to the same records. So the first of<br />

the two codes assigned was used for initial analysis purposes.<br />

<strong>The</strong> selected <strong>1990</strong> <strong>and</strong> 2000 codes were then cross-tabulated against each other, creating a matrix of<br />

<strong>1990</strong>-by-2000-code combinations similar in concept to the cross-tabulation of codes assigned to the index<br />

lines described in the section above. <strong>The</strong> distributions of responses to these <strong>1990</strong>-by-2000 combinations<br />

were then compared to their <strong>1990</strong>-by-2000 counterparts created from the 2000 classified index.<br />

<strong>The</strong>se comparisons exposed the fact that coding errors can contaminate the analysis of changes in the<br />

labor force from the <strong>1990</strong> classifications to the 2000 systems. Remember that the index crosswalks<br />

provided a set of expected changes from <strong>1990</strong> to 2000. <strong>The</strong> distributions of actual responses, however,<br />

often did not match the expected distributions as measured by the index. Many combinations of <strong>1990</strong> <strong>and</strong><br />

2000 codes appearing in the <strong>1990</strong> <strong>Census</strong> Sample did not appear at all in the index crosswalk, <strong>and</strong> made<br />

no logical sense together.<br />

<strong>Census</strong> <strong>Bureau</strong> staff identified <strong>and</strong> examined many of the worst discrepancies between the two sets of<br />

distributions. Once again, this examination revealed errors in the index that caused coding errors, but were<br />

not caught in the original review of the index described above. Fortunately, this first review of the <strong>Census</strong><br />

Sample occurred before the <strong>Census</strong> 2000 processing, so the indexes could be repaired in time for the<br />

actual census. At the same time, these coding errors plus others that were not caused by the index were<br />

corrected in the <strong>1990</strong> <strong>Census</strong> Sample.<br />

After the census was completed, staff revisited the <strong>1990</strong> <strong>Census</strong> Sample <strong>and</strong> once again compared its<br />

<strong>1990</strong>-2000 distributions (after the revisions resulting from the first review) to those found in the index<br />

crosswalks. This comparison instigated a second review of the sample, particularly for those <strong>1990</strong> codes<br />

that showed the largest discrepancies between the sample <strong>and</strong> the index. Over 18,000 of the 146,000+<br />

records in sample were reviewed. This examination led to the correction of more coding errors, <strong>and</strong> to the<br />

deletion of records containing responses that were judged to be too vague to code accurately. Most of<br />

these deletions had no clear “majority” among the multiple codes assigned over the years since <strong>1990</strong>.<br />

Altogether, over 5300 records were corrected or deleted.<br />

Calculation of the Conversion Factors<br />

Because the index crosswalk represented the expected distribution of the <strong>1990</strong> categories into the 2000<br />

classifications, this source became the starting point for determining which <strong>1990</strong> to 2000 code<br />

combinations were “legitimate.” In other words, the starting assumption was that the combination of<br />

<strong>1990</strong>-2000 codes among the index titles ideally should drive the assignment of those codes to actual<br />

census responses.<br />

This assumption, therefore, initially eliminated records in the <strong>1990</strong> <strong>Census</strong> Sample that contained <strong>1990</strong>-<br />

2000 code combinations that did not appear in the index crosswalk. In other words, the corollary to this<br />

assumption was that any records in the <strong>1990</strong> <strong>Census</strong> Sample that did not contain a “legitimate” <strong>1990</strong>-2000<br />

code combination must contain coding errors that make the records ineligible for inclusion in the<br />

computation of conversion factors. <strong>The</strong> factors used to adjust or redistribute the <strong>1990</strong> tabulated data into<br />

the 2000 classifications should not make non-logical conversions based on errors in the sample that<br />

created those factors.<br />

12

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!