24.01.2013 Views

Rob van Hest Capture-recapture Methods in Surveillance - RePub ...

Rob van Hest Capture-recapture Methods in Surveillance - RePub ...

Rob van Hest Capture-recapture Methods in Surveillance - RePub ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Chapter 11<br />

Question 3:<br />

What is the feasibility and validity of truncated population estimation models <strong>in</strong><br />

<strong>in</strong>fectious disease surveillance?<br />

For priority sett<strong>in</strong>g, service plann<strong>in</strong>g and resource allocation it is necessary to know the<br />

number of persons <strong>in</strong> a targeted group. This number can also be used to assess the<br />

coverage of an <strong>in</strong>tervention. Often direct (enumeration) techniques are not feasible to<br />

estimate the size of hidden populations. Instead, <strong>in</strong>direct techniques such as capture<strong>recapture</strong><br />

analysis have to be used. Paradoxically, for hidden populations often the<br />

preferred three l<strong>in</strong>ked registers, allow<strong>in</strong>g for log-l<strong>in</strong>ear capture-<strong>recapture</strong> analysis <strong>in</strong> order<br />

to reduce bias <strong>in</strong> the estimates, are not available. As an alternative, truncated models,<br />

related to capture-<strong>recapture</strong> analysis, and applicable to frequency counts of observations<br />

of <strong>in</strong>dividuals <strong>in</strong> a s<strong>in</strong>gle source of <strong>in</strong>formation, are described <strong>in</strong> the literature. Two such<br />

truncated models are Zelterman’s Poisson mixture model and Chao’s heterogeneity<br />

model. 24-26 These models aim to estimate the number of unobserved persons <strong>in</strong> the<br />

(truncated) zero-frequency class based upon <strong>in</strong>formation from the lower observed<br />

frequency classes, assum<strong>in</strong>g a specific truncated distribution of the observed data, e.g.<br />

Poisson <strong>in</strong> Chao’s model. Observed frequency distributions may not be strictly Poisson<br />

and to relax this assumption Zelterman based his model on a Poisson mixture<br />

distribution, allegedly allow<strong>in</strong>g greater flexibility and applicability on real life data. The<br />

validity of truncated model estimates depends on the possible violation of the underly<strong>in</strong>g<br />

assumptions, similar to capture-<strong>recapture</strong> analysis as described earlier, although the<br />

<strong>in</strong>dependent registers assumption is replaced by the constant (re)observation probability<br />

assumption when us<strong>in</strong>g a s<strong>in</strong>gle data source. In addition, equiprobability (i.e. equal<br />

ascerta<strong>in</strong>ment probabilities of all registers) should be assumed when us<strong>in</strong>g multiple<br />

sources. This violation could be as much, possibly even more, as for capture-<strong>recapture</strong><br />

analysis. 11<br />

We estimated the coverage of a tuberculosis control <strong>in</strong>tervention, a targeted<br />

mobile tuberculosis screen<strong>in</strong>g programme among illicit drug users and homeless persons<br />

<strong>in</strong> Rotterdam. Application of truncated models was feasible because this screen<strong>in</strong>g<br />

programme uses a s<strong>in</strong>gle register. Although capture-<strong>recapture</strong> techniques for estimat<strong>in</strong>g<br />

the size of a population from a s<strong>in</strong>gle register have been described occasionally, 27-29 for<br />

feasibility one prefers the simplest technique with almost similar assumptions. We could<br />

extract, check and prepare the required data from the exist<strong>in</strong>g rout<strong>in</strong>e dataset <strong>in</strong> two days<br />

and calculate the po<strong>in</strong>t estimates on a pocket calculator. Violation of the perfect recordl<strong>in</strong>kage<br />

assumption is considered m<strong>in</strong>imal because of good computerised and visual<br />

identification of the clients <strong>in</strong> the screen<strong>in</strong>g programme. However, the closed population<br />

assumption is violated because every year a substantial number of people not previously<br />

screened enter the programme, result<strong>in</strong>g <strong>in</strong> under-estimation of the coverage. We cannot<br />

exclude heterogeneity among <strong>in</strong>dividuals belong<strong>in</strong>g to the target group of the screen<strong>in</strong>g<br />

programme but this could cause limited bias <strong>in</strong> the model estimates. Truncated models<br />

are arguably more robust to violation of the homogeneity assumption s<strong>in</strong>ce they are partly<br />

based upon the lower frequency classes, assumed to have more resemblance to the zero<br />

frequency class. The constant (re)observation probability assumption will be violated as<br />

well to some extent but this effect could be limited due to the nature and organisation of<br />

168

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!