24.01.2013 Views

Rob van Hest Capture-recapture Methods in Surveillance - RePub ...

Rob van Hest Capture-recapture Methods in Surveillance - RePub ...

Rob van Hest Capture-recapture Methods in Surveillance - RePub ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Chapter 10<br />

such studies should be expected. Extract<strong>in</strong>g 2 x 2 tables ignores possible conditional<br />

dependence confound<strong>in</strong>g the results thus obta<strong>in</strong>ed. For studies 4 and 11 the log-l<strong>in</strong>ear<br />

models <strong>in</strong>cluded one respectively two <strong>in</strong>teraction terms for pair-wise dependencies, which<br />

may expla<strong>in</strong> the underestimation <strong>in</strong> the Petersen and truncated estimators. We therefore<br />

also validated the two studies with <strong>in</strong>dependent log-l<strong>in</strong>ear models (studies 1 and 2). We<br />

took register 2 as the complete set for study 1 and register 3 as the complete set for study<br />

2. For study 1 there were 73 “unlisted” cases. The Petersen estimator, 43, is a little low,<br />

but the truncated b<strong>in</strong>omial estimator, at 201, is too high (Zelterman and Chao models<br />

estimates are 397 and 401, respectively). The discrepant (over)estimate by the truncated<br />

models can be expla<strong>in</strong>ed by the different coverages of registers 1 and 3, i.e. violation of<br />

the equiprobability assumption. In study 2 the coefficient of variation was low and the<br />

coverage of registers 1 and 2 similar. For study 2 there were 22 “unlisted” cases. The<br />

Petersen estimator and the truncated b<strong>in</strong>omial estimator are both 25 and similar to the<br />

known “unlisted” number, expla<strong>in</strong>ed by almost absent violation of both the <strong>in</strong>dependent<br />

sources and equiprobability assumptions. The Zelterman and Chao models estimates are<br />

43 and 51 respectively and the discrepancy with the truncated b<strong>in</strong>omial model estimate<br />

can be expla<strong>in</strong>ed by violation of the “<strong>in</strong>f<strong>in</strong>ite number of sources” assumption.<br />

Alternative models<br />

As an alternative to log-l<strong>in</strong>ear capture-<strong>recapture</strong> models a structural source model has<br />

been proposed. 36 Whereas log-l<strong>in</strong>ear models only partly identify and <strong>in</strong>corporate<br />

dependencies between registers, the structural source model models potential<br />

<strong>in</strong>terdependencies of the registers and heterogeneity of the population, partly based on<br />

prior knowledge, and estimates the probabilities of conditions that produce these<br />

<strong>in</strong>teractions between the registers. However, the published data of the capture-<strong>recapture</strong><br />

studies were <strong>in</strong>sufficient to re-exam<strong>in</strong>e these studies with a structural source model.<br />

Conclusion<br />

We have <strong>in</strong>dicated conditions where estimates of <strong>in</strong>fectious disease <strong>in</strong>cidence from logl<strong>in</strong>ear<br />

models are similar or dissimilar to alternative truncated models for <strong>in</strong>complete<br />

count data. Our results suggest that for estimat<strong>in</strong>g <strong>in</strong>fectious disease <strong>in</strong>cidence and<br />

completeness of notification <strong>in</strong>dependent and parsimonious three-source log-l<strong>in</strong>ear<br />

capture–<strong>recapture</strong> models are preferable. When saturated models are selected as best-<br />

fitt<strong>in</strong>g model and the estimates are unexpectedly high and seem implausible, first, the data<br />

should be re-exam<strong>in</strong>ed with truncated models as a heuristic tool, <strong>in</strong> the absence of a gold<br />

standard, to identify possible failure <strong>in</strong> the saturated log-l<strong>in</strong>ear model when the truncated<br />

models produce a lower estimated number of <strong>in</strong>fectious disease patients. Second, <strong>in</strong> case<br />

of such discrepancy between the log-l<strong>in</strong>ear and the truncated model estimates, the data<br />

should be re-exam<strong>in</strong>ed for possible violation of the underly<strong>in</strong>g capture-<strong>recapture</strong><br />

assumptions, such as imperfect record-l<strong>in</strong>kage, false-positive records or heterogeneity,<br />

corrected and the capture-<strong>recapture</strong> analysis repeated on the corrected data. When after<br />

repeated capture-<strong>recapture</strong> analysis the discrepancy between the log-l<strong>in</strong>ear and the<br />

truncated model estimates rema<strong>in</strong>s or no violation of the underly<strong>in</strong>g assumptions can be<br />

identified, the <strong>in</strong>vestigator should be cautious about us<strong>in</strong>g the associated outcome. 10<br />

Us<strong>in</strong>g truncated model estimates as an early alert could prevent flawed capture-<strong>recapture</strong><br />

152

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!