08.12.2012 Views

Scientific Concept of the National Cohort (status ... - Nationale Kohorte

Scientific Concept of the National Cohort (status ... - Nationale Kohorte

Scientific Concept of the National Cohort (status ... - Nationale Kohorte

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

A.6 Planned statistical analyses and statistical power considerations<br />

A.6 planned statistical analyses and statistical power<br />

considerations<br />

A.6.1 General issues and structure <strong>of</strong> <strong>the</strong> section<br />

As stated in Sect. A.1, <strong>the</strong> major specific aims for <strong>the</strong> <strong>National</strong> <strong>Cohort</strong> include identifying<br />

lifestyle, nutritional, occupational, environmental, (pre)clinical, and (epi)genetic risk factors<br />

for <strong>the</strong> development <strong>of</strong> chronic diseases and aging-related declines in health and determinants<br />

<strong>of</strong> <strong>the</strong> progression <strong>of</strong> early-stage (preclinical) morbidity to overt clinical-stage disease<br />

and/or related losses <strong>of</strong> autonomy. A related aim is to develop risk prediction models and<br />

algorithms that may help identify individuals at increased risk <strong>of</strong> developing major chronic<br />

diseases. These latter models may include questionnaire assessments <strong>of</strong> risk factors, as<br />

well as genetic and o<strong>the</strong>r biological markers, and clinical examination data. For each <strong>of</strong><br />

<strong>the</strong>se objectives, quantitative risk associations – in terms <strong>of</strong> relative and attributable risk<br />

– will be estimated for a range <strong>of</strong> chronic disease outcomes. Finally, it is also anticipated<br />

that <strong>the</strong> cohort will be used to evaluate candidate blood or urine-based markers or clinical<br />

examinations (e.g., imaging) for early detection <strong>of</strong> disease.<br />

The design <strong>of</strong> <strong>the</strong> <strong>National</strong> <strong>Cohort</strong> as a large-scale, population-based prospective cohort<br />

study guides <strong>the</strong> general strategy for planned statistical analyses. This has a number <strong>of</strong><br />

direct implications:<br />

� Whenever indicated and feasible, and contingent upon <strong>the</strong> exposure or risk factor<br />

information collected, statistical analyses will be based on data from <strong>the</strong> full cohort. If<br />

a study question <strong>of</strong> interest is only relevant for a certain subpopulation, relevant subcohorts<br />

will be considered.<br />

� This study does not have one single hypo<strong>the</strong>sis or even a limited number <strong>of</strong> hypo<strong>the</strong>ses<br />

that can be taken as a single basis for confirmatory statistical testing and for<br />

sample size considerations and power calculations. Ra<strong>the</strong>r, research questions as<br />

outlined in Sects. A.1 and A.2 have been defined from diverse areas in epidemiology.<br />

A wide range <strong>of</strong> scientific questions and related statistical analyses follow from <strong>the</strong>se two<br />

general statements, which are summarized in Sect. A.6.2.<br />

To structure <strong>the</strong> statistical models used for <strong>the</strong> <strong>National</strong> <strong>Cohort</strong>, we need to account for<br />

<strong>the</strong> different types <strong>of</strong> outcome that will be observed during study follow-up. Thus, we have<br />

outlined a possible series <strong>of</strong> general types <strong>of</strong> analysis and a list <strong>of</strong> selected approaches,<br />

without any claim <strong>of</strong> completeness, in Sect. A.6.2.1.<br />

Within this general model (by type <strong>of</strong> outcome), <strong>the</strong> risk factors or intermediate factors<br />

presenting along <strong>the</strong> pathway from exposure to a health event will be related using a wide<br />

range <strong>of</strong> specialized models. Therefore, <strong>the</strong> directional and undirectional association and<br />

correlation structures between <strong>the</strong> different exposure variables must be incorporated in<br />

statistical analyses and <strong>the</strong> general models must be adapted accordingly. Models for exposures<br />

with a clustered structure such as environmental exposures, which require (variance)<br />

component models, for repeated measurements, and for missing values (e.g., due to losses<br />

by follow-up) must also be adapted. On <strong>the</strong> basis <strong>of</strong> <strong>the</strong> general models, some <strong>of</strong> <strong>the</strong>se<br />

model adaptations are <strong>the</strong>refore described in Sect. A.6.2.3.<br />

Calculations for <strong>the</strong> expected numbers <strong>of</strong> incident cases <strong>of</strong> several relevant diseases are<br />

outlined in Sect. A.6.3. These calculations are based on <strong>the</strong> source population presented in<br />

Sect. A.3.1. The numbers, combined with <strong>the</strong> aforementioned statistical methods, serve as<br />

examples for <strong>the</strong> general power calculations <strong>of</strong> <strong>the</strong> cohort <strong>of</strong> 200,000.<br />

169<br />

A.6

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!