Rob van Hest Capture-recapture Methods in Surveillance - RePub ...
Rob van Hest Capture-recapture Methods in Surveillance - RePub ...
Rob van Hest Capture-recapture Methods in Surveillance - RePub ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Chapter 1<br />
1.1 Assess<strong>in</strong>g completeness of ascerta<strong>in</strong>ment <strong>in</strong> epidemiology<br />
Epidemiology is the study of how often diseases occur <strong>in</strong> different groups of people and<br />
why. 1 This <strong>in</strong>cludes knowledge about classification errors, i.e. the absence of true cases<br />
and the presence of false-positive cases <strong>in</strong> registrations. Observ<strong>in</strong>g and monitor<strong>in</strong>g health<br />
and behaviour trends requires a surveillance system that captures useful data on those<br />
persons correctly identified with the characteristic under study. This <strong>in</strong>formation can be<br />
used to identify priorities and evaluate <strong>in</strong>terventions. To determ<strong>in</strong>e the usefulness of a<br />
surveillance system there is the need to assess the quality of the data and completeness of<br />
ascerta<strong>in</strong>ment. 2 The number of <strong>in</strong>dividuals with a certa<strong>in</strong> condition (i.e. cases) or events <strong>in</strong><br />
a population can be ascerta<strong>in</strong>ed directly, by count<strong>in</strong>g every s<strong>in</strong>gle person or event as<br />
attempted <strong>in</strong> a census, or <strong>in</strong>directly, by obta<strong>in</strong><strong>in</strong>g sufficient <strong>in</strong>formation to estimate<br />
prevalence (i.e. the number of cases at a specific po<strong>in</strong>t <strong>in</strong> time) or <strong>in</strong>cidence (i.e. the<br />
number of new cases dur<strong>in</strong>g a specific period of time), as attempted <strong>in</strong> a survey (active<br />
case-f<strong>in</strong>d<strong>in</strong>g) or by notification (passive case-f<strong>in</strong>d<strong>in</strong>g). Other examples of <strong>in</strong>direct<br />
ascerta<strong>in</strong>ment of the number of cases <strong>in</strong> a population are pharmaco-epidemiolocal studies<br />
and record-l<strong>in</strong>kage, i.e. compar<strong>in</strong>g patient data across multiple registers. It is difficult to<br />
establish whether these counts are complete or biased to under-ascerta<strong>in</strong>ment and only<br />
on a few occasions it is attempted to estimate or adjust for miss<strong>in</strong>g cases. 3,4 An <strong>in</strong>direct<br />
technique that estimates completeness of ascerta<strong>in</strong>ment of surveys and registers used <strong>in</strong><br />
epidemiological studies is capture-<strong>recapture</strong> analysis. 5,6<br />
1.2 Brief <strong>in</strong>troduction to capture-<strong>recapture</strong> analysis<br />
A more extensive overview of the historical development of capture-<strong>recapture</strong> analysis is<br />
given elsewhere. 5 Briefly, the first use of capture-<strong>recapture</strong> analysis can be traced back to<br />
Graunt who used a similar method for estimat<strong>in</strong>g the population of England as early as<br />
1662 7 and Laplace, who attempted to estimate the population size <strong>in</strong> France <strong>in</strong> 1782, 8 but<br />
usually it is mentioned that capture-<strong>recapture</strong> analysis was first applied by Petersen <strong>in</strong><br />
1894 for the study of fish populations. He used the so-called two-sample method, the<br />
simplest capture-<strong>recapture</strong> model, to estimate the unknown size of a population of plaice<br />
<strong>in</strong> the Limfjord <strong>in</strong> Denmark. 9 The first sample provides the animals for mark<strong>in</strong>g or<br />
tagg<strong>in</strong>g and is returned to the population, while the second sample provides the<br />
<strong>recapture</strong>s, i.e. the numbers of animals caught <strong>in</strong> both samples. Us<strong>in</strong>g the number of<br />
<strong>recapture</strong>s and the number of animals caught <strong>in</strong> the first and the second sample, it is<br />
possible, under certa<strong>in</strong> assumptions, to estimate the number not caught <strong>in</strong> either sample,<br />
thus provid<strong>in</strong>g an estimate of the total population size. Two-sample capture-<strong>recapture</strong><br />
analysis was extended to multiple-sample capture-<strong>recapture</strong> analysis by Schnabel <strong>in</strong><br />
1938. 10 Unmarked animals <strong>in</strong> each sample are given <strong>in</strong>dividual (i.e. numbered) marks<br />
before be<strong>in</strong>g returned to the population, result<strong>in</strong>g <strong>in</strong> a known capture history of each<br />
marked animal. The theory of capture-<strong>recapture</strong> models was developed more fully <strong>in</strong> the<br />
1950's, for example by Chapman, 11 who suggested an adjustment of the capture-<strong>recapture</strong><br />
estimate to reduce small sample bias, known as the “Nearly Unbiased Estimator”, and<br />
Darroch, 12 who founded the mathematical framework. To tackle the problem of violation<br />
of the underly<strong>in</strong>g assumptions, with<strong>in</strong> animal population biology a range of different<br />
10