Evaluating Patient-Based Outcome Measures - NIHR Health ...
Evaluating Patient-Based Outcome Measures - NIHR Health ...
Evaluating Patient-Based Outcome Measures - NIHR Health ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
38<br />
Criteria for selecting a patient-based outcome measure<br />
biases is greater, the further removed a trial is<br />
from a drug trial, so that differences in the<br />
processes of care between arms of a trial have<br />
greater chances of influencing patients’<br />
judgements of outcomes.<br />
The social psychological literature on such forms<br />
of bias is quite extensive, particularly with regard<br />
to phenomena such as halo effects, social desirability<br />
effects and so on. There is evidence from<br />
qualitative research based on recordings of subjects<br />
talking about completing the NHP, that a considerable<br />
amount of cognitive work by patients precedes<br />
them selecting response categories (Donovan et al.,<br />
1993). Respondents, for example, attempt to work<br />
out what investigators’ intentions are in asking<br />
particular questions. However, there is no research<br />
evidence that we have found that considers whether<br />
different patient-based outcome measures might be<br />
more or less prone to cognitive effects that could<br />
bias results.<br />
Other sources of potential bias have been examined<br />
in relation to health status questionnaires<br />
(Anonymous, 1995). Item bias occurs when<br />
background variables such as the respondent’s<br />
gender or age, affect the response to items in the<br />
questionnaire. In comparing groups, item bias<br />
analysis tests the influence of variables, such as age,<br />
sex or race on patterns of responses and examines<br />
whether the possible differences between groups<br />
is correctly shown in the score (Groenvold et al.,<br />
1995). A QoL questionnaire used with breast<br />
cancer patients was analysed and item bias was<br />
found in three out of nine dimensions due to age<br />
and other factors (Groenvold et al., 1995).<br />
A simple form of bias was identified in a study<br />
of outcomes of care for rheumatoid arthritis<br />
(Jenkinson et al., 1993). <strong>Patient</strong>s with rheumatoid<br />
arthritis were asked either as inpatients or outpatients<br />
to complete several health status measures.<br />
All were followed up and reassessed with the same<br />
measures in outpatient clinics 3 months later.<br />
Those who were inpatients at baseline showed very<br />
substantial improvements on the mobility scale of<br />
the FLP but not on the equivalent scale of the NHP.<br />
Outpatients showed little improvement by either<br />
measure. Much of the improvement on<br />
the FLP scale amongst those who were initially<br />
inpatients was attributed to the fact that the FLP,<br />
but not the NHP, produced more severe scores for<br />
anyone confined to a bed per se regardless of health<br />
status. Moving out of hospital confinement alone<br />
produced substantial changes in one, but not the<br />
other instrument. In a trial where randomisation<br />
was between, say hospital and outpatient or<br />
ambulatory management, it seems likely that one<br />
instrument would have much more potential for<br />
systematic bias, by ‘exaggerating’ the degree of<br />
improvement of individuals leaving hospital care.<br />
Another potential source of bias is the influence of<br />
psychological mood upon patient-based outcome<br />
measures. There is a range of evidence that factors<br />
such as depressed mood have a disproportionate<br />
influence upon patterns of answers to health status<br />
questionnaires (Spiegel et al., 1988; Sensky and<br />
Catalan, 1992). Indeed in the MOS study, individuals<br />
with confirmed depression had amongst<br />
the poorest general health status scores of any<br />
chronically ill group studied (Wells et al., 1989).<br />
Disentangling the reasons for such patterns is not<br />
easy. Depression may be associated with poor<br />
physical health for a variety of reasons (Brooks<br />
et al., 1990). However, it is also possible that such<br />
patterns reflect cognitive distortion. In a randomised<br />
trial design, such effects may not necessarily<br />
have important consequences. However, unrecognised<br />
depression may distort evidence of overall<br />
effectiveness of treatments across dimensions of<br />
health status. Some instruments have been shown<br />
to be relatively immune to such effects; the HAQ,<br />
for example, seems relatively unaltered by<br />
depressed mood (Peck et al., 1989).<br />
Again, informal inspection of the content of<br />
instruments is as likely to identify the kinds of gross<br />
systematic bias just illustrated. More generally, the<br />
field has tended to address this issue more by<br />
attempting to reduce random error in patientbased<br />
outcome measures.<br />
<strong>Patient</strong>-based outcome measures may therefore<br />
vary in how clearly and precisely the numerical<br />
values generated by measures relate to underlying<br />
distributions of patients’ experiences of health<br />
status. Investigators need to consider (i) precision<br />
of response categories, (ii) precision of numerical<br />
values, (iii) distribution of items over true range,<br />
(iv) ceiling and floor effects, (v) precision of scales<br />
and (vi) sources of potential bias in the scoring of<br />
instruments. The degree of precision required of a<br />
patient-based outcome measure will depend on<br />
other aspects of trial design such as sample size,<br />
and also on the differences expected to be found<br />
between arms of the trial. However, investigators<br />
need to have some sense of the meaning of the<br />
scores that will be generated by instruments that<br />
they intend to use in a trial and precision is a<br />
component of meaning.<br />
Summary<br />
Overall, we are here concerned with how precise<br />
are the distinctions made by an instrument, with at