29.06.2013 Views

Evaluating Patient-Based Outcome Measures - NIHR Health ...

Evaluating Patient-Based Outcome Measures - NIHR Health ...

Evaluating Patient-Based Outcome Measures - NIHR Health ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

38<br />

Criteria for selecting a patient-based outcome measure<br />

biases is greater, the further removed a trial is<br />

from a drug trial, so that differences in the<br />

processes of care between arms of a trial have<br />

greater chances of influencing patients’<br />

judgements of outcomes.<br />

The social psychological literature on such forms<br />

of bias is quite extensive, particularly with regard<br />

to phenomena such as halo effects, social desirability<br />

effects and so on. There is evidence from<br />

qualitative research based on recordings of subjects<br />

talking about completing the NHP, that a considerable<br />

amount of cognitive work by patients precedes<br />

them selecting response categories (Donovan et al.,<br />

1993). Respondents, for example, attempt to work<br />

out what investigators’ intentions are in asking<br />

particular questions. However, there is no research<br />

evidence that we have found that considers whether<br />

different patient-based outcome measures might be<br />

more or less prone to cognitive effects that could<br />

bias results.<br />

Other sources of potential bias have been examined<br />

in relation to health status questionnaires<br />

(Anonymous, 1995). Item bias occurs when<br />

background variables such as the respondent’s<br />

gender or age, affect the response to items in the<br />

questionnaire. In comparing groups, item bias<br />

analysis tests the influence of variables, such as age,<br />

sex or race on patterns of responses and examines<br />

whether the possible differences between groups<br />

is correctly shown in the score (Groenvold et al.,<br />

1995). A QoL questionnaire used with breast<br />

cancer patients was analysed and item bias was<br />

found in three out of nine dimensions due to age<br />

and other factors (Groenvold et al., 1995).<br />

A simple form of bias was identified in a study<br />

of outcomes of care for rheumatoid arthritis<br />

(Jenkinson et al., 1993). <strong>Patient</strong>s with rheumatoid<br />

arthritis were asked either as inpatients or outpatients<br />

to complete several health status measures.<br />

All were followed up and reassessed with the same<br />

measures in outpatient clinics 3 months later.<br />

Those who were inpatients at baseline showed very<br />

substantial improvements on the mobility scale of<br />

the FLP but not on the equivalent scale of the NHP.<br />

Outpatients showed little improvement by either<br />

measure. Much of the improvement on<br />

the FLP scale amongst those who were initially<br />

inpatients was attributed to the fact that the FLP,<br />

but not the NHP, produced more severe scores for<br />

anyone confined to a bed per se regardless of health<br />

status. Moving out of hospital confinement alone<br />

produced substantial changes in one, but not the<br />

other instrument. In a trial where randomisation<br />

was between, say hospital and outpatient or<br />

ambulatory management, it seems likely that one<br />

instrument would have much more potential for<br />

systematic bias, by ‘exaggerating’ the degree of<br />

improvement of individuals leaving hospital care.<br />

Another potential source of bias is the influence of<br />

psychological mood upon patient-based outcome<br />

measures. There is a range of evidence that factors<br />

such as depressed mood have a disproportionate<br />

influence upon patterns of answers to health status<br />

questionnaires (Spiegel et al., 1988; Sensky and<br />

Catalan, 1992). Indeed in the MOS study, individuals<br />

with confirmed depression had amongst<br />

the poorest general health status scores of any<br />

chronically ill group studied (Wells et al., 1989).<br />

Disentangling the reasons for such patterns is not<br />

easy. Depression may be associated with poor<br />

physical health for a variety of reasons (Brooks<br />

et al., 1990). However, it is also possible that such<br />

patterns reflect cognitive distortion. In a randomised<br />

trial design, such effects may not necessarily<br />

have important consequences. However, unrecognised<br />

depression may distort evidence of overall<br />

effectiveness of treatments across dimensions of<br />

health status. Some instruments have been shown<br />

to be relatively immune to such effects; the HAQ,<br />

for example, seems relatively unaltered by<br />

depressed mood (Peck et al., 1989).<br />

Again, informal inspection of the content of<br />

instruments is as likely to identify the kinds of gross<br />

systematic bias just illustrated. More generally, the<br />

field has tended to address this issue more by<br />

attempting to reduce random error in patientbased<br />

outcome measures.<br />

<strong>Patient</strong>-based outcome measures may therefore<br />

vary in how clearly and precisely the numerical<br />

values generated by measures relate to underlying<br />

distributions of patients’ experiences of health<br />

status. Investigators need to consider (i) precision<br />

of response categories, (ii) precision of numerical<br />

values, (iii) distribution of items over true range,<br />

(iv) ceiling and floor effects, (v) precision of scales<br />

and (vi) sources of potential bias in the scoring of<br />

instruments. The degree of precision required of a<br />

patient-based outcome measure will depend on<br />

other aspects of trial design such as sample size,<br />

and also on the differences expected to be found<br />

between arms of the trial. However, investigators<br />

need to have some sense of the meaning of the<br />

scores that will be generated by instruments that<br />

they intend to use in a trial and precision is a<br />

component of meaning.<br />

Summary<br />

Overall, we are here concerned with how precise<br />

are the distinctions made by an instrument, with at

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!