Evaluating Patient-Based Outcome Measures - NIHR Health ...
Evaluating Patient-Based Outcome Measures - NIHR Health ...
Evaluating Patient-Based Outcome Measures - NIHR Health ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
10<br />
What are patient-based outcome measures?<br />
considering either omit this dimension or include<br />
only superficial assessments.<br />
A potential problem is that assessments of psychological<br />
well-being in particular were often developed<br />
more to measure inter-patient differences for<br />
purposes of diagnosis or needs assessment than<br />
as outcome measures. Evidence for their appropriateness<br />
as an outcome measure requiring<br />
sensitivity to changes over time therefore needs to<br />
be examined carefully. Clearly in the context of a<br />
trial, obtaining a more detailed assessment of one<br />
dimension such as depression or pain must involve<br />
some reduction of depth on other dimensions if<br />
the overall burden of data collection on patients<br />
is not to be too great, so careful thought is<br />
required about the significance of the<br />
proposed specific dimension.<br />
Generic instruments<br />
Generic instruments are intended to capture a very<br />
broad range of aspects of health status and the consequences<br />
of illness and therefore to be relevant to<br />
a wide range of patient groups. The content of such<br />
questionnaires has been deliberately designed to be<br />
widely appropriate. They may provide assessments<br />
of the health status of samples of individuals not<br />
recruited because they have a specific disease, for<br />
example, from primary care or the community as a<br />
whole. One of the most widely used of such instruments<br />
is the SF-36 (Ware and Sherbourne, 1992).<br />
It is a 36-item questionnaire which measures health<br />
status in eight dimensions: physical functioning,<br />
role limitations due to physical problems, role<br />
limitations due to emotional problems, social<br />
functioning, mental health, energy/vitality, pain<br />
and general perceptions of health. An additional<br />
single item asks the respondent about health<br />
change over 1 year which is not scaled. Item<br />
responses are summed to give a score for each<br />
dimension. Evidence has also been presented that<br />
SF-36 can be used in several other forms. The items<br />
have been summed into two summary measures:<br />
the physical component summary and mental<br />
component summary (Ware et al., 1995). The<br />
SF-36 has also been further reduced into a<br />
12-item version (Ware et al., 1996b).<br />
A more lengthy generic instrument is the Functional<br />
Limitations Profile (FLP) which is the English<br />
version of Sickness Impact Profile (SIP) developed<br />
in the United States (Patrick and Peach, 1989). The<br />
FLP measures sickness related behavioural<br />
dysfunction by assessing individual perceptions of<br />
the effect of illness upon usual daily activities. It<br />
consists of 136 items grouped into 12 dimensions<br />
covering ambulation, bodycare and movement,<br />
mobility, household management, recreation and<br />
pastimes, social interaction, emotion, alertness, sleep<br />
and rest, eating, communication and work. Unlike<br />
the SF-36, the FLP uses weights expressing the<br />
severity of individual items that have been derived<br />
from prior research. The FLP has a summary score<br />
for physical and psychosocial dimensions and a total<br />
score can also be calculated.<br />
Advantages and disadvantages<br />
The main advantage of generic instruments is that<br />
they can in principle be used for a broad range of<br />
health problems. This means that they may be of<br />
use if no disease-specific instrument exists in a<br />
particular area (Visser et al., 1994), although this<br />
is increasingly unlikely to be the case. Because it<br />
can be widely used, it enables comparisons across<br />
treatments for groups of patients with different<br />
groups, to assess comparative effectiveness. Because<br />
of their broad range of content and more general<br />
applicability, such instruments have been used<br />
more frequently than disease-specific instruments<br />
to assess the health of non-hospital samples in the<br />
general population. This has led to the use of such<br />
data to generate ‘normative values’, that is scores in<br />
the well with which patients with health problems<br />
can be compared. Because generic instruments<br />
are intended to be broad in scope, they may have<br />
value in detecting unexpected positive or negative<br />
effects of an intervention, whereas disease-specific<br />
instruments focus on known and anticipated<br />
consequences (Cox et al., 1992; Fletcher et al.,<br />
1992). Another potential advantage is that by<br />
covering a wide range of dimensions in a relatively<br />
economic format, they reduce the patient burden<br />
entailed by using a number of questionnaires.<br />
A less tangible advantage to any individual user<br />
is that, if trials generally converged on the use<br />
of a small number of generic instruments, a more<br />
general body of experience and comparative<br />
evidence could emerge to enhance the value<br />
and interpretability of patient-based outcome<br />
measures (Hadorn et al., 1995; Ware, 1995).<br />
Against these postulated advantages have to be<br />
weighed some potential disadvantages. In particular,<br />
it may be argued that by including items across a<br />
broad range of aspects of health status, generic<br />
instruments must sacrifice some level of detail in<br />
terms of relevance to any one illness. The risk is<br />
therefore of some loss of relevance of questionnaire<br />
items when applied in any specific context. A particularly<br />
important potential consequence for clinical<br />
trials is that generic instruments would have fewer<br />
relevant items to the particular disease and intervention<br />
and therefore be less sensitive to changes<br />
that might occur as a result of an intervention.