24.01.2013 Views

Rob van Hest Capture-recapture Methods in Surveillance - RePub ...

Rob van Hest Capture-recapture Methods in Surveillance - RePub ...

Rob van Hest Capture-recapture Methods in Surveillance - RePub ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Chapter 1<br />

1.1 Assess<strong>in</strong>g completeness of ascerta<strong>in</strong>ment <strong>in</strong> epidemiology<br />

Epidemiology is the study of how often diseases occur <strong>in</strong> different groups of people and<br />

why. 1 This <strong>in</strong>cludes knowledge about classification errors, i.e. the absence of true cases<br />

and the presence of false-positive cases <strong>in</strong> registrations. Observ<strong>in</strong>g and monitor<strong>in</strong>g health<br />

and behaviour trends requires a surveillance system that captures useful data on those<br />

persons correctly identified with the characteristic under study. This <strong>in</strong>formation can be<br />

used to identify priorities and evaluate <strong>in</strong>terventions. To determ<strong>in</strong>e the usefulness of a<br />

surveillance system there is the need to assess the quality of the data and completeness of<br />

ascerta<strong>in</strong>ment. 2 The number of <strong>in</strong>dividuals with a certa<strong>in</strong> condition (i.e. cases) or events <strong>in</strong><br />

a population can be ascerta<strong>in</strong>ed directly, by count<strong>in</strong>g every s<strong>in</strong>gle person or event as<br />

attempted <strong>in</strong> a census, or <strong>in</strong>directly, by obta<strong>in</strong><strong>in</strong>g sufficient <strong>in</strong>formation to estimate<br />

prevalence (i.e. the number of cases at a specific po<strong>in</strong>t <strong>in</strong> time) or <strong>in</strong>cidence (i.e. the<br />

number of new cases dur<strong>in</strong>g a specific period of time), as attempted <strong>in</strong> a survey (active<br />

case-f<strong>in</strong>d<strong>in</strong>g) or by notification (passive case-f<strong>in</strong>d<strong>in</strong>g). Other examples of <strong>in</strong>direct<br />

ascerta<strong>in</strong>ment of the number of cases <strong>in</strong> a population are pharmaco-epidemiolocal studies<br />

and record-l<strong>in</strong>kage, i.e. compar<strong>in</strong>g patient data across multiple registers. It is difficult to<br />

establish whether these counts are complete or biased to under-ascerta<strong>in</strong>ment and only<br />

on a few occasions it is attempted to estimate or adjust for miss<strong>in</strong>g cases. 3,4 An <strong>in</strong>direct<br />

technique that estimates completeness of ascerta<strong>in</strong>ment of surveys and registers used <strong>in</strong><br />

epidemiological studies is capture-<strong>recapture</strong> analysis. 5,6<br />

1.2 Brief <strong>in</strong>troduction to capture-<strong>recapture</strong> analysis<br />

A more extensive overview of the historical development of capture-<strong>recapture</strong> analysis is<br />

given elsewhere. 5 Briefly, the first use of capture-<strong>recapture</strong> analysis can be traced back to<br />

Graunt who used a similar method for estimat<strong>in</strong>g the population of England as early as<br />

1662 7 and Laplace, who attempted to estimate the population size <strong>in</strong> France <strong>in</strong> 1782, 8 but<br />

usually it is mentioned that capture-<strong>recapture</strong> analysis was first applied by Petersen <strong>in</strong><br />

1894 for the study of fish populations. He used the so-called two-sample method, the<br />

simplest capture-<strong>recapture</strong> model, to estimate the unknown size of a population of plaice<br />

<strong>in</strong> the Limfjord <strong>in</strong> Denmark. 9 The first sample provides the animals for mark<strong>in</strong>g or<br />

tagg<strong>in</strong>g and is returned to the population, while the second sample provides the<br />

<strong>recapture</strong>s, i.e. the numbers of animals caught <strong>in</strong> both samples. Us<strong>in</strong>g the number of<br />

<strong>recapture</strong>s and the number of animals caught <strong>in</strong> the first and the second sample, it is<br />

possible, under certa<strong>in</strong> assumptions, to estimate the number not caught <strong>in</strong> either sample,<br />

thus provid<strong>in</strong>g an estimate of the total population size. Two-sample capture-<strong>recapture</strong><br />

analysis was extended to multiple-sample capture-<strong>recapture</strong> analysis by Schnabel <strong>in</strong><br />

1938. 10 Unmarked animals <strong>in</strong> each sample are given <strong>in</strong>dividual (i.e. numbered) marks<br />

before be<strong>in</strong>g returned to the population, result<strong>in</strong>g <strong>in</strong> a known capture history of each<br />

marked animal. The theory of capture-<strong>recapture</strong> models was developed more fully <strong>in</strong> the<br />

1950's, for example by Chapman, 11 who suggested an adjustment of the capture-<strong>recapture</strong><br />

estimate to reduce small sample bias, known as the “Nearly Unbiased Estimator”, and<br />

Darroch, 12 who founded the mathematical framework. To tackle the problem of violation<br />

of the underly<strong>in</strong>g assumptions, with<strong>in</strong> animal population biology a range of different<br />

10

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!