08.12.2012 Views

Scientific Concept of the National Cohort (status ... - Nationale Kohorte

Scientific Concept of the National Cohort (status ... - Nationale Kohorte

Scientific Concept of the National Cohort (status ... - Nationale Kohorte

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

A.6 Planned statistical analyses and statistical power considerations<br />

Longer-term replicate risk factor assessments and measurements errors<br />

For many specific exposure types, short-term variations (e.g., a 1-year interval) may reasonably<br />

be considered “random” variations around average exposure level at that time point<br />

(or within a comparatively short time window around that time point) that are nondifferential<br />

with respect to disease outcome. Over longer time intervals, however, this assumption may<br />

progressively lose validity and risk <strong>of</strong> disease may show stronger associations with risk factors<br />

assessed at a more recent time before diagnosis. For <strong>the</strong> latter type <strong>of</strong> situation, Frost<br />

and white (2005) 742 also demonstrated that a regression calibration approach to correct<br />

for (presumed “random”) errors in baseline measurements, using replicate measurements<br />

taken over long time intervals, may overcorrect and lead to biased RR estimates.<br />

Frost and White (2005) 742 proposed a basic longitudinal (“life course”) modeling approach<br />

�1 �<br />

that provides 193 a useful, basic framework � � �for � <strong>the</strong> statistical modeling <strong>of</strong> disease risks in rela-<br />

b � � w � �b<br />

�<br />

tion to long-term repeat measurements in prospective cohort studies. Following that approach,<br />

a simple statistical model for <strong>the</strong> (first) two risk factor/exposure measurements<br />

planned within <strong>the</strong> <strong>National</strong> <strong>Cohort</strong> is:<br />

194 log( risk) � � � �1<br />

xi1<br />

� � 2 xi2<br />

,<br />

�1 �<br />

where, for individual 193 i, x is <strong>the</strong> centered error-free (true) risk factor level in <strong>the</strong> latest index<br />

i2 � � �� b � � w � �b<br />

�<br />

period <strong>of</strong> risk factor assessment, and x is <strong>the</strong> true �1risk factor �<br />

193 � �<br />

level in first (initial recruitment)<br />

i1 �� b � � w � �b<br />

�<br />

period. In this basic model, coefficient β can be interpreted as <strong>the</strong> “history-adjusted” (log)<br />

2 194 log( odds) � � � ( �<br />

RR (or log-odds ratio) associated with an 1 � �<br />

error-free 2 ) xi1<br />

� �<br />

risk 2 ( xifactor<br />

2 � xi1)<br />

(which may be expressed for<br />

a 1-unit increase), 194 adjusted for all log( past risk error-free ) � � � levels. � The history-adjusted current as-<br />

1 xi1<br />

� � 2 xi2<br />

sociation describes <strong>the</strong> short-term, potentially modifiable risk, measuring <strong>the</strong> difference in<br />

risk between<br />

194<br />

two subjects who log( risk have ) � different � � �1<br />

xrisk<br />

i1<br />

�factor<br />

� 2 xi2<br />

levels now but had identical levels<br />

in <strong>the</strong> 194 past. With just two exposure log( riskperiods, ) � � � <strong>the</strong> � L �wmodel 1xi1<br />

� wcan<br />

2x<br />

i2be<br />

� rewritten as<br />

194 log( odds) � � � ( �1<br />

� �2<br />

) xi1<br />

� �2<br />

( xi<br />

2 � xi1)<br />

,<br />

which 194 indicates that log( <strong>the</strong> odds coefficient ) � � � in ( �1<strong>the</strong><br />

� above �2<br />

) xi1<br />

�models<br />

�2<br />

( xi<br />

2 �actually<br />

xi1)<br />

estimates <strong>the</strong> change in<br />

(log)disease risk associated with <strong>the</strong> P(<br />

change D)<br />

� P(<br />

in D over | E ) time (x -x ) true exposure level.<br />

195<br />

PAR �<br />

i2 i1<br />

A second way <strong>of</strong> rewriting <strong>the</strong> above, basic<br />

P(<br />

D<br />

model<br />

)<br />

194 log( risk) � � � is: � L �w1 xi1<br />

� w2x<br />

i2<br />

�<br />

risk) � � � � w x � w x ,<br />

log( L 1 i1<br />

2 i2<br />

194 � �<br />

where β = Σ β and wi = β / β , that is, exposures assessed in visits (time periods) 1 and<br />

L i2 i i L P(<br />

D)<br />

� P(<br />

D*)<br />

2 are 195 weighted by <strong>the</strong> strength GIF � <strong>of</strong> <strong>the</strong>ir log-linear P(<br />

D)<br />

� Prelationship<br />

( D | E ) with risk (i.e., weighted by<br />

<strong>the</strong>ir respective 195 β-coefficients). In this PARP<br />

second � ( D)<br />

model, reflects <strong>the</strong> long-term association<br />

P(<br />

D)<br />

� P(<br />

D | PE<br />

( ) D)<br />

for subjects 195 whose risk factor PARlevel<br />

� has differed by 1 unit throughout <strong>the</strong> full cohort study<br />

period, and for modifiable exposures this P(<br />

can D)<br />

be interpreted as measuring <strong>the</strong> long-term<br />

potentially modifiable risk, with respect to a long-term exposure history. In <strong>the</strong> special case<br />

where β = β , <strong>the</strong> latter model will � provide 1 �estimates<br />

for a single regression coefficient, β 1 2 L<br />

203<br />

VIF �<br />

(= β = β ), relating (log) disease risk �<br />

� P<br />

to <strong>the</strong> �<br />

� ( D)<br />

� P(<br />

D*)<br />

195<br />

GIF 2 (equally weighted) average <strong>of</strong> <strong>the</strong> exposures, x 1 2 �1<br />

� ��<br />

CE<br />

i1<br />

and x , in <strong>the</strong> two index periods. Averaging P(<br />

D)<br />

� P�<br />

( <strong>the</strong> D*)<br />

Pexposures<br />

( D)<br />

195<br />

from two index periods will give<br />

i2 GIF �<br />

a corresponding increase in statistical power P(<br />

D)<br />

and estimated accuracy (confidence interval)<br />

<strong>of</strong> RR estimates, as intraindividual (time-related) variations in exposure will be reduced by<br />

taking an integrated measure <strong>of</strong> <strong>the</strong> 2exposures<br />

at <strong>the</strong> two time points. However, if β > β<br />

203<br />

�<br />

1 2<br />

CE<br />

(or vice versa, β < β ), <strong>the</strong>n unequal weights � 1will<br />

be � given to <strong>the</strong> exposures in <strong>the</strong> different<br />

203 1 2 VIF �<br />

index periods and, depending on <strong>the</strong> degree �<br />

�<br />

to which �<br />

�<br />

2<br />

� 1 ��1<br />

� � �<br />

<strong>the</strong> relative weighting predominantly<br />

CE<br />

favors 203 <strong>the</strong> exposure at just VIF one �<strong>of</strong> �<br />

�<strong>the</strong><br />

time �<br />

�<br />

2 points, x or x , <strong>the</strong> potential gains in statistical<br />

1 2<br />

power by averaging exposures over �1<br />

� �CE<br />

time �(testing<br />

<strong>the</strong> null hypo<strong>the</strong>sis β = 0 for long-term<br />

L<br />

association) will be smaller.<br />

2<br />

203<br />

� CE<br />

2<br />

203<br />

�<br />

CE<br />

173<br />

A.6

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!