29.07.2014 Views

Practical Considerations in Raking Survey Data

Practical Considerations in Raking Survey Data

Practical Considerations in Raking Survey Data

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

trimm<strong>in</strong>g high weight values one generally lowers sampl<strong>in</strong>g variability but may <strong>in</strong>cur some bias.<br />

The MSE will be lower if the reduction <strong>in</strong> variance is large relative to the <strong>in</strong>crease <strong>in</strong> bias aris<strong>in</strong>g<br />

from weight trimm<strong>in</strong>g. There are no established rules for weight trimm<strong>in</strong>g; rather most people use<br />

a general set of guidel<strong>in</strong>es. Some common truncation po<strong>in</strong>ts are: 1) the median weight plus five or<br />

six times the <strong>in</strong>terquartile range (IQR) of the weights, 2) five times the mean weight, 3) the 95 th<br />

percentile of the weights.<br />

How can weight trimm<strong>in</strong>g be <strong>in</strong>corporated <strong>in</strong> rak<strong>in</strong>g? The IHB SAS macro can be used for weight<br />

trimm<strong>in</strong>g <strong>in</strong> the follow<strong>in</strong>g steps (us<strong>in</strong>g as an example the median weight plus six times the IQR as<br />

the truncation po<strong>in</strong>t) 1 :<br />

1. Prior to rak<strong>in</strong>g i, where i references the number of times the rak<strong>in</strong>g is run, exam<strong>in</strong>e the<br />

distribution of the rak<strong>in</strong>g “<strong>in</strong>put” weight and calculate the median weight plus six times the<br />

<strong>in</strong>terquartile (IQR) range of the weights.<br />

2. Truncate values of the <strong>in</strong>put weight that are above the median weight plus six times the<br />

IQR plus one to the median weight plus six times the IQR (values at or below the median<br />

weight plus six times the IQR plus one are not altered).<br />

3. Us<strong>in</strong>g the truncated <strong>in</strong>put weight, run the rak<strong>in</strong>g to obta<strong>in</strong> rak<strong>in</strong>g weight i.<br />

4. Repeat Steps 1 to 3 (i.e., run the rak<strong>in</strong>g a second time, third time, etc.) until there are no<br />

weights that are above the median weight plus six times the IQR plus one.<br />

Although the cutoff value equals the median weight plus six times the IQR, weights that exceed the<br />

median weight plus six times the IQR plus one are truncated to the median weight plus six times<br />

1 A somewhat more sophisticated, but computer <strong>in</strong>tensive, procedure is to apply bounds to the weights as the<br />

rak<strong>in</strong>g is tak<strong>in</strong>g place.<br />

23

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!