Evaluating Patient-Based Outcome Measures - NIHR Health ...
Evaluating Patient-Based Outcome Measures - NIHR Health ...
Evaluating Patient-Based Outcome Measures - NIHR Health ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
28<br />
Criteria for selecting a patient-based outcome measure<br />
wide-ranging purposes, they require more<br />
extensive validation.<br />
There are potentially three ways in which such<br />
measures need to be assessed for validity: as<br />
measures of (i) health status, (ii) of personal<br />
preferences and utilities, and (iii) of the social<br />
value. It is important to recognise that these are<br />
distinct constructs. Thus, Nord (1992) suggests that<br />
if measures are intended to provide assessments of<br />
the social value of interventions, as opposed to a<br />
measure of the individual utilities of patients, then<br />
one important component of validation would<br />
need to be reflective equilibrium whereby respondents<br />
are directly invited to consider and accept<br />
the implications in terms of resource allocation<br />
of judgements made about weightings. To the<br />
extent that public opinion accepted decisions<br />
about resource allocation of health services based<br />
on evidence of costs and benefits of treatments<br />
and where benefits are in part measured by<br />
utilities, according to Nord, this would be an<br />
indirect form of validation of utility measures<br />
used. An Australian study found very little public<br />
support for health policies that aimed to maximise<br />
health benefit without egalitarian considerations<br />
(Nord et al., 1995).<br />
Nord and colleagues has examined the validity<br />
of generic measures by more direct methods.<br />
In a series of surveys of Norwegian and Australian<br />
samples, Nord and colleagues (1993) examined<br />
respondents’ ratings of specific health states for an<br />
instrument in comparison with the same health<br />
states judged by ‘Person Trade Off’. The latter<br />
method involves respondents being asked to state<br />
what numbers of patients receiving one treatment<br />
are equivalent to a specific number receiving a<br />
second different treatment and is intended more<br />
directly to assess the social value of different<br />
health states. By comparing respondents’ ratings<br />
from the two methods, it emerged that the<br />
instrument weighted by conventional rating<br />
scales produced much lower values for health<br />
states than did the Person Trade Off method, so<br />
that it appeared to produce lower social value for<br />
interventions. As Nord and colleagues argue,<br />
measures need to be examined separately to<br />
assess their validity as measures of individuals’<br />
utilities and as measures of social value. The two<br />
are distinct constructs and a measure may be<br />
valid as a measure of one but not the other and<br />
this is an area of continued methodological<br />
debate as to the validity of instruments for these<br />
different purposes (Carr-Hill, 1992; The EuroQol<br />
Group, 1992; Gafni and Birch, 1993; Dolan and<br />
Kind, 1996).<br />
Summary<br />
The apparently simple question as to whether an<br />
instrument measures what it purports to measure<br />
has to be considered by means of a range of different<br />
kinds of evidence including how content was<br />
determined, inspection of the content, and of<br />
patterns of relationships to other variables. Because<br />
no single set of observations is likely to determine<br />
validity and different kinds of evidence are needed,<br />
judgement of this property of an instrument in<br />
relation to a specific trial is not straightforward.<br />
Responsiveness<br />
Does the instrument detect changes<br />
over time that matter to patients?<br />
For use in trials, it is essential that a health<br />
status questionnaire can detect important changes<br />
over time within individuals, that might reflect<br />
therapeutic effects (Kirshner and Guyatt, 1985;<br />
Kirshner, 1991). This section addresses sensitivity<br />
to change, or responsiveness. The latter term is<br />
preferable because sensitivity has a number of<br />
more general uses in epidemiology. As it is conceivable<br />
for an instrument to be both reliable<br />
and valid but not responsive, this dimension<br />
of a health status measure is increasingly essential<br />
to evaluate. Potential confusion is caused by the<br />
fact that in order to emphasise its importance,<br />
some authors treat it as an aspect of validity<br />
(Hays and Hadorn, 1992). Guyatt and colleagues<br />
(1989a) define responsiveness as the ability of<br />
an instrument to detect clinically important<br />
change. They provide illustrative evidence of<br />
the importance of this aspect of instruments with<br />
data from a controlled trial of chemotherapy for<br />
breast cancer. Four health status instruments<br />
considered to be validated were completed by<br />
women. However only one of the four instruments<br />
showed expected differences over time as well<br />
as providing valid evidence of women’s’ health<br />
status. Guyatt and colleagues (1987, 1992a,b)<br />
have emphasised an important distinction<br />
between discriminative instruments intended to<br />
be particularly valid in distinguishing between<br />
respondents at a point in time and evaluative<br />
instruments that need to be particularly sensitive<br />
to changes within individuals over time in the<br />
context of clinical trials. The question at the<br />
beginning of this section emphasises that<br />
patient-based outcome measures changes of<br />
importance to patients. Whilst responsiveness<br />
as such is not defined in terms of importance<br />
to patients, this would seem an important<br />
specification in relation to patient-based<br />
outcome measures.