Evaluating non-randomised intervention studies - NIHR Health ...

More documents

Recommendations

Info

Evaluation of checklists and scales for assessing quality of non-randomised studies36In terms of the four pre-specified core items, 15 ofthe 60 tools included none of the core itemsdespite covering at least five domains, 16 coveredone core item and 15 covered two items. Theremaining 14 tools covered at least three coreitems and were considered to be the ‘best’ tools inour sample, two of which covered all fouritems. 104,105 It is interesting that of the six toolsthat included items in all six internal validitydomains, 72,77,83,85,97,106 only one 85 included threeof our four core items. None of the tools designedonly for RCTs included three of the four coreitems.‘Best’ toolsFourteen tools were identified which covered atleast five of the six internal validity domains andthree of the four core items. Tables 9 and 10itemise the pre-specified items covered by eachtool.Amongst the top 14 tools, the internal validitydomain with the poorest coverage was analysis(four tools with zero items – CASP, 64 Fowkes, 107Newcastle–Ottawa 66 and Weintraub 108 ), followedby blinding (missed by Bracken 104 and Zaza 86 ) andascertainment (missed by Cowley 109 andHadorn 102 ). The item most commonly missed wasequal follow-up between groups (included by onlytwo tools – Bracken 104 and Downs 85 ). Only threetools asked about use of intention-to-treat analysis(Cowley, 109 Thomas 65 and Vickers 110 ).The two core domains were reasonably wellcovered. For the creation of groups domain, all ofthe tools except those specifically designed onlyfor observational studies (Bracken, 104 CASP 64 andNewcastle–Ottawa 66 ) included an item onrandomisation, but only two tools specificallyconsidered the use of allocation concealment(Downs 85 and DuRant 99 ). Of the four core items,the most commonly missed item was that relatingto how allocation occurred. Only eight toolsincluded this item. Ideally, we were looking for anitem that asked about how participants got intotheir respective groups, for example, was it byclinician or patient preference or was it spatial ortemporal assignment. All of the tools except forDowns, 85 DuRant 99 and Hadorn 102 included thesecond pre-specified item – balancing of groups bydesign.For the comparability of groups domain the twopre-specified items – identification of prognosticfactors and use of case-mix adjustment – weremissed by only two tools 109,111 and one tool, 108respectively. All of the tools except the CASP tool 64and the Newcastle–Ottawa tool 66 asked if baselinecomparability had been assessed.Our pre-specified items in the remaining sixdomains (Table 10) were, on the whole, not wellcovered, except perhaps for that relating to theselection of the study sample. Every tool includedan item about the representativeness of thesample, and only four did not ask about the studyinclusion/exclusion criteria (CASP, 64 Cowley, 109Newcastle–Ottawa 66 and Thomas 65 ). One of theitems in this domain that is related to internalvalidity – retrospective or prospective selection ofthe sample – was included by only two tools,Cowley 109 and Reisch. 111The remaining five domains concerning thequality of study reporting were not well covered bythe tools. The most commonly included item wasone that considered clear specification of theinterventions (nine tools). On the other hand,clear specification of the outcomes was included inonly five tools.Qualitative assessment of the ‘best’toolsOf the best 14 tools, eight were judged to beunsuitable for use in a systematicreview. 64,99,102,104,105,107,108,110 A description ofwhich of the core criteria they covered and ourassessment of them is provided in Appendix 4.In summary, their unsuitability was largely relatedto the fact that they were not designed for use in asystematic review of effectiveness: one waspublished to guide the reporting of observationalstudies; 104 five were intended to help in the criticalappraisal of research articles; 64,99,107,108,110 and onewas developed for an epidemiological review. 105Overall, these tools generally prompted somethinking regarding quality issues, but were notformatted in such a way as to allow an overallassessment of study quality or the comparison ofquality across studies. Some 64,108 did conclude witha more general item requiring a judgement on theoverall quality of the study, but little guidance wasprovided as to how this judgement should bemade. The Hadorn tool 102 was intended for use insystematic reviews, but the assessors queried theinclusion of, or phrasing of, several of the items.For example, the emphasis on drug trials and useof placebos was felt to be overly specific.Six quality assessment tools were judged to bepotentially useful for systematicreviews, 65,66,85,86,109,111 although in several casessome modifications would be useful. All but one of
© Queen’s Printer and Controller of HMSO 2003. All rights reserved.TABLE 9 Selected features of identified quality assessment toolsNewcastle-Bracken 104 CASP 64 Cowley 109 Downs 85 DuRant 99 Fowkes 107 Hadorn 102 Ottawa 66 Reisch 111 Spitzer 105 Thomas 65 Vickers 110 Weintraub 108 Zaza 86#5 Creation of groups5.1. Generation of random ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓5.1. sequence5.2. Concealment of allocation ✓ ✓5.3. How allocation occurred a ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓5.4. Balance groups by design a ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓#6 Blinding6.1. Blind (or double-blind) ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓6.1. administration6.2. Blind outcome assessment ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓#7 Ascertainment7.1. Receipt of the intervention ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓7.2. Attributable outcomes ✓ ✓ ✓ ✓ ✓ ✓#8 Follow-up8.1. Equal follow-up between ✓ ✓8.1. groups8.3. Completeness of follow-up ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓#9 Comparability9.1. Baseline comparability9.1. assessed ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓9.2. Prognostic factors identified a ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓9.3. Case-mix adjustment a ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓#10 Analysis10.1. Intention-to-treat analysis ✓ ✓ ✓10.2. Appropriate analysis10.2. methods ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓a Items specfic to assessment of non-randomised studies.Health Technology Assessment 2003; Vol. 7: No. 2737
Page 1 and 2: Health Technology Assessment 2003;
Page 3 and 4: Evaluating non-randomisedinterventi
Page 7: Health Technology Assessment 2003;
Page 23 and 24: © Queen’s Printer and Controller
Page 27 and 28: © Queen’s Printer and Controller
Page 36 and 37: Evaluation of checklists and scales
Page 44 and 45: 32TABLE 8 Details of top 60 quality
Page 50 and 51: 38TABLE 10 Other domains: reporting
Page 56 and 57: Use of quality assessment in system
Page 62 and 63: Empirical estimates of bias associa
Page 76 and 77: Empirical evaluation of the ability
Page 82 and 83: 70TABLE 22 Comparison of concurrent
Page 86 and 87: 74TABLE 26 Comparison of methods of
Page 92 and 93: 80TABLE 33 Hypothetical example dem
Page 94 and 95: 82TABLE 34 Hypothetical example dem
Page 98 and 99:
Empirical evaluation of the ability
Page 100 and 101:
Discussion and conclusions88histori
Page 102 and 103:
Discussion and conclusions90For exa
Page 104 and 105:
Discussion and conclusionsNon-rando
Page 107 and 108:
Health Technology Assessment 2003;
Page 109 and 110:
Page 111 and 112:
Page 113 and 114:
Page 115 and 116:
Page 117 and 118:
Page 119 and 120:
Page 121:
Page 124 and 125:
Appendix 1data)) or (non-random$ or
Page 126 and 127:
Appendix 2AuthorYearENDARESourcePub
Page 128 and 129:
Appendix 2Author:Accession No:Endno
Page 130 and 131:
Appendix 20 0 00Additional outcomes
Page 132 and 133:
Appendix 2Endnote NoWas CMA conside
Page 134 and 135:
122AuthorOrigin aModified toolTool
Page 136 and 137:
Page 138 and 139:
Page 140 and 141:
Page 142 and 143:
Page 144 and 145:
Page 146 and 147:
Page 148 and 149:
Appendix 4136DuRant, 1994 99The typ
Page 151 and 152:
Page 153 and 154:
Page 155 and 156:
© Queen’s Printer and Controller
Page 157 and 158:
Page 159 and 160:
Page 161 and 162:
Page 163 and 164:
Page 165 and 166:
Page 167 and 168:
Page 169 and 170:
Page 171 and 172:
Page 173 and 174:
Page 175 and 176:
Page 177 and 178:
Page 179 and 180:
Page 181 and 182:
Page 183 and 184:
Page 185:
Page 188 and 189:
Health Technology Assessment Progra
Page 190:
Health Technology Assessment Progra
show all

Evaluating non-randomised intervention studies - NIHR Health ...

Create successful ePaper yourself

Delete template?

Save as template?