03.01.2015 Views

Chance, bias and confounding - The INCLEN Trust

Chance, bias and confounding - The INCLEN Trust

Chance, bias and confounding - The INCLEN Trust

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>Chance</strong>, <strong>bias</strong> <strong>and</strong> <strong>confounding</strong><br />

• <strong>The</strong> observed statistical association between a<br />

certain outcome <strong>and</strong> <strong>and</strong> the hypothesized<br />

exposure could be a matter of chance<br />

• Or it could be the result of systematic errors in<br />

collection of data (sampling, disease <strong>and</strong> exposure<br />

ascertainment) or its interpretation: the role of <strong>bias</strong><br />

• Or it could be due to the effect of additional<br />

variables that might be responsible for the<br />

observed association: the role of <strong>confounding</strong><br />

• Or it could be a real association


Confounder<br />

• Is a factor that distorts the true relationship<br />

between an exposure <strong>and</strong> the disease outcome on<br />

account of its being associated with both the<br />

exposure as well as the disease<br />

• This distortion (over/underestimation) of the true<br />

relation between exposure <strong>and</strong> disease can occur<br />

only if this factor is unequally distributed between<br />

the exposed <strong>and</strong> unexposed groups


Confounding<br />

• A confounder is a third factor that is associated<br />

with the exposure <strong>and</strong> independently affects the<br />

risk of developing the disease<br />

• It distorts the estimate of true relationship between<br />

the exposure <strong>and</strong> disease: it may result in<br />

association being observed when none in fact<br />

exists; or no association being observed when a<br />

true relationship does exist


Confounder<br />

• A potential confounder must be predictive<br />

of disease independently of its association<br />

with the exposure under study<br />

• This means that there must be an<br />

association between the confounder <strong>and</strong><br />

disease even amongst the group unexposed<br />

to the exposure under investigation


Confounder<br />

• This third factor should not be merely an<br />

intermediate step in the cause <strong>and</strong> effect<br />

relationship between the exposure <strong>and</strong> the<br />

disease outcome<br />

• <strong>The</strong> association between the confounder <strong>and</strong><br />

the disease need not be causal. It may a<br />

marker for for a risk factor other than the<br />

one under investigation in a study.


Confounding<br />

• Confounding can lead to the observation of<br />

apparent differences between the study<br />

groups when they do not truly exist, or<br />

conversely, the observation of of no<br />

difference when they do exist.


An example of <strong>confounding</strong><br />

• A number of observational epidemiological<br />

studies have shown an inverse association between<br />

the consumption of vegetables rich in β carotene<br />

with the risk of cancer<br />

• It is however possible that this association is<br />

confounded by other differences between the<br />

consumers <strong>and</strong> non-consumers of vegetables such<br />

as fiber, which is known to reduce the risk of<br />

cancer


Confounding: another example<br />

• An observed association between the consumption<br />

of coffee <strong>and</strong> the risk of MI could be due, at least<br />

in part, to the effect of cigarette smoking, since<br />

coffee drinking is associated with smoking , <strong>and</strong><br />

independent of coffee drinking, smoking is a risk<br />

factor for MI<br />

• <strong>The</strong> potential or true confounders are not always<br />

as obvious as they are in the examples cited above


How to avoid <strong>confounding</strong><br />

• If a <strong>confounding</strong> factor does not vary between the<br />

exposed <strong>and</strong> non-exposed, or those diseases <strong>and</strong><br />

non-diseased, then by definition, there can be no<br />

<strong>confounding</strong> by that variable<br />

• Thus if by design or analysis, the association<br />

between disease <strong>and</strong> exposure is evaluated only<br />

amongst those who are similar with respect to the<br />

<strong>confounding</strong> factor, there can be no <strong>confounding</strong>


Controlling confounders<br />

• Restriction of the study population<br />

• Matching<br />

• R<strong>and</strong>omization of exposure<br />

• Stratification<br />

• Multivariable analysis


Common confounders<br />

• Age <strong>and</strong> sex are almost universal<br />

confounders for all exposure – disease<br />

associations<br />

• This is because they are markers for a<br />

whole lot of cumulative exposures. <strong>The</strong>y<br />

may not be causally related to disease, but<br />

are markers for many other exposures<br />

which might be truly related to disease.


Confounding: the intermediate<br />

link<br />

• Moderate consumption of alcohol is associated<br />

with reduced risk of CAD<br />

• HDL cholesterol also is protective for CAD<br />

• Moderate alcohol consumption increases HDL<br />

• If one controls for HDL, the association between<br />

alcohol intake <strong>and</strong> the risk of CAD becomes weak<br />

or statistically insignificant.<br />

• Being an intermediate link between alcohol <strong>and</strong><br />

the risk of CAD, should HDL be considered a<br />

confounder at all Should it be controlled


Positive <strong>and</strong> negative<br />

<strong>confounding</strong><br />

• Tobacco smoking would be a positive<br />

confounder in association between coffee<br />

drinking <strong>and</strong> CAD<br />

• <strong>The</strong> association between physical activity<br />

<strong>and</strong> CAD would be negatively confounded<br />

by gender, since women have lower risk of<br />

CAD <strong>and</strong> they also exercise less than men.


R<strong>and</strong>omization<br />

• Applicable only to interventional studies<br />

• Most powerful method to control for<br />

known, potential or unknown confounders<br />

if the sample size is sufficiently large


Restriction<br />

• Reduces the number of eligible subjects for<br />

enrollment<br />

• It limits generalizability of observations to<br />

only the restricted population use for<br />

drawing the r<strong>and</strong>om sample


Matching<br />

• It includes elements of both design <strong>and</strong> analysis<br />

• Mostly applicable to case-control study design<br />

• It is expensive, difficult <strong>and</strong> time consuming<br />

• By design, the effect of risk factor which has been<br />

matched can not be studied<br />

• Confounding is avoided not just by matching but<br />

by special method of matched table analysis


Analysis<br />

• Stratified analyses: Stratum specific estimates of<br />

association are calculated, <strong>and</strong> the differences<br />

amongst the strata are assessed by eyeballing, or<br />

performing appropriate tests of statistical<br />

significance<br />

• Summary statistic for the pooled data is calculated<br />

as per the method of Mantel <strong>and</strong> Haenszel<br />

• <strong>The</strong> magnitude of <strong>confounding</strong> is assessed by<br />

looking at the discrepancy between the crude <strong>and</strong><br />

adjusted estimates (without applying any tests of<br />

statistical significance)


Confounding <strong>and</strong> effect<br />

modification<br />

• Confounding distorts the true relationship between<br />

the exposure <strong>and</strong> disease <strong>and</strong> should be controlled<br />

• Effect modification tells us that the association<br />

between exposure <strong>and</strong> disease is modified by a<br />

third factor. It should not be controlled for, the<br />

magnitude of effect modification should be<br />

reported <strong>and</strong> biological explanation for its<br />

presence sought.


Bias<br />

• <strong>The</strong> study must be designed <strong>and</strong> conducted in such<br />

a manner that that every possibility of introducing<br />

a <strong>bias</strong> is anticipated <strong>and</strong> steps are taken to<br />

minimize its occurrence<br />

• In spite of these precautions, the observed<br />

association should be carefully examined to see if<br />

it could be explained by <strong>bias</strong>.<br />

• If indeed the study has elements of <strong>bias</strong>, it can not<br />

be rectified at the stage of analysis (unlike<br />

<strong>confounding</strong>)


Types of <strong>bias</strong><br />

• Selection <strong>bias</strong>: A particular problem in case<br />

control <strong>and</strong> retrospective cohort studies<br />

where both exposure <strong>and</strong> disease have<br />

occurred at the time of selection of<br />

individuals for the study<br />

• Information <strong>bias</strong>


Selection <strong>bias</strong><br />

• Differential surveillance, diagnosis or<br />

referral of individuals in the study: e.g.,<br />

women using estrogen have uterine<br />

bleeding more often, <strong>and</strong> seek medical<br />

attention for this symptom. Hence they are<br />

more likely to seek diagnostic evaluation<br />

than those who are not on estrogens<br />

resulting in more frequent diagnosis of<br />

uterine cancer in women on estrogens


Multivariate regression analysis<br />

• Several potential confounders can be<br />

controlled; this is not easy in stratified<br />

analysis<br />

• It is an efficient method of data analysis<br />

• Several models for regression exist. Choice<br />

depends on the type of data to be analysed.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!