Professional Documents
Culture Documents
>L
ir,c<
J b O ___
ISBN 0-7279-1099-X
Typesct by A pekTypeseiters Ltd, Nailsea, Bristol
Printcd and bound in Great Britain by Latim erTrend Ltd, Plymouth
Contents
Preface
Acknowledgements
1 Introducing criticai appraisal
C la rify yo u r reasons for read in g
S p ecify yo u r info rm atio n n eed
Id en tify the relevan t repo rts
C ritic a lly ap p raise the p apers
C O N TF.N TS
Statistical significance
T h e p lay of chance
Probabiliry
T h e logic of statistical tests
C onfidence intervals
Pitfalls in the analysis
O utliers
Skew
N on-independence
S erendipity m asq u erad in g as hypothesis
B lack-box analyses
Bias
C onfounding
6 T h e s t a n d a r d a p p r a is a l q u e s tio n s
Are the aim s clearly stated?
Was the sam ple size justified?
Are the m easurem ents likely to be v alid and reliable?
Are the statistical m ethods described?
Did untow ard events occur d urin g the study?
Were the basic data ad eq u ately described?
Do the num bers add up?
Was the statistical significance assessed?
W hat do the m ain findings m ean?
How are n ull findings interpreted?
Are im portant effects overlooked?
How do the results com pare w ith previous reports?
W hat im plications does the study have for your practice?
Y- 7 A pp raisin g surveys
The essential questions
T he specific questions
R ationale for the essential questions
R ationale for the specific questions
T he com plete list for the appraisal o f surveys
CONTENTS
X8
A p p r a is in g c o h o rt s tu d ie s
T h e essen tial questions
T h e specific questions
R atio n ale for the essential questions
R atio n ale for the specific questions
T h e com plete list for the appraisal of cohort studies
9 A p p r a is in g c lin ic a i t r ia ls
T h e essential questions
T h e specific questions
R atio n ale for the essential questions
R atio n ale for the specific questions
T h e com plete list for the app raisal o f clinicai trials
10 A p p r a is in g c a s e - c o n t r o l s t u d ie s
T h e essen tial questions
T h e specific questions
R atio n ale for the essential questions
R atio n ale for the specific questions
T h e com plete list for the appraisal o f case-co ntrol studies
V
11 A p p r a is in g r e v ie w p a p e r s
T h e essen tial questions
T h e specific questions
R a tio n ale for the essential questions
R atio n ale for the specific questions
T h e com plete list for the appraisal o f review papers
In d ex
63
P reface
ix
A c k n o w le d g e m e n ts
1 Introducing criticai
app raisal
(N T R O D U C IN G C R IT IC A L A P PR A 1SA L
S p e c ify y o u r in fo rm a tio n n e e d
C larifyin g w hat you w ant to find out sh ould indicate the am ount
of inform ation required. M an y queries can be best answ ered from
current textbooks or review articles. B ut these w ill not contain the
m ost recen t studies, and will not contain the levei of detail of the
original papers. T h us, the read er should ask: ^X^hat kind of reports
do I w ant? How m uch detail do I need? H ow com prehensive do 1
need to be? How far back should I search? T h e answ ers to these
questions follow from the reasons for reading.
Is it o f interest?
W h y w as it done?
H ow w as it done?
W h at has it found?
W h a t are the im p lication s?
W h at else is of interest?
Q U E ST IO N S T O A SK W H EN RE A D IN G A PAPER
Q UESTIO N S T O A SK W H E N RE A D IN G A PAPER
3 Identifying the
research m ethod
Surveys
S u rv eys are used to jd e sc rib e h " w j ^
A sam ple of
in d iv id u ais is iden tifiedV T na^it're"c)b tain ed on each at m ore or
less the sam e time. T h e sam ple b ein g studied may be taken from
the gen eral population, or m ay be a highly selected one. For
exam p le, surveys could be c a rrie d out to establish the leveis of
seru m cholesterol in the gen eral population. However, surveys can
also be u sed to study specific groups such as pregnant w om en,
p h ysio th erap ists, o rp erso n s ag ed betw een 65 and 90 years. Surveys
7
ID EN TJ FY JN G TH E R E SE A R CH M E T H O D
Complications
M o st survevs do not have a sep arate control or gom parjson
group. T h u s, studies which have them are usu ally not surveys.
H ow ever, in the analysis of surveys one__subgroup in the sam ple
m ay be com pared with another (for exam ple, m en versus w om en,
or old versu s young). C om parisons are being m ade but there. is no
sense in w h ich one group is actin g as a control to another group. Ali
the in d ivid u ais have been selected at the sam e tim e and then
in tern ai com parisons are m ade.
Terms o f identification
U se o f the term survey in a p ap er should identify the m ethod, but
so m etim es the term is m istakenly used for what is really a cohort
study. Cross-sectional is a helpful term because it is seldom used
w ith an y other research m ethod. T he term s saniple and random
sample are unhelpful because they often app ear in the description of
the o th er research designs. T h ere are m any different w ays of
d raw in g a sam ple, described by the term s stratified, cluster, and
systematic. T hese term s are seldom used with the other research
m eth o d s, except stratified, which can be used in a clin icai trial.
C o h o r t studies
C o h o rt studies are used to find out w hat happens to patients. For
ex am p le, they could invesdgate how long patients w ith acute lowb ack p ain take to recover; or they could m onitor the natural history
1DENT1FVING T H E R E SE A R C H M ETHOD
Essential features
T h e d efin in g ch aracteristic of cohort studies is the elem ent of
tim e: in co h o rt stu d ies tim e flows forwards. A set of individuais is
id en tified at one p o in t in tim e, and followed up to a later time to
asc ertain w h at h as h ap p en ed . T he direction o f tim e is always
forw ard s. S tu d ies in w h ich individuais are selected at one point and
traced b ack w ard s to see how they w ere at some tim e previously are
n ot co h o rt studies.
Complications
C o h o rt stud ies can be read ily im agin ed as identifym g a group of
p atien ts an d fo llo w in g th em into the future. H ow ever, som e studies
iden tify a set of p atien ts at some tim e in the p ast and follow them
up to the p resen t. T h e se studies m ight at first ap p ear to be looking
b ack w ard s in tim e, b ut th ey are not. T im e flows forwards from the
p oint at w hich the p atien ts are identified.
Terms o f identification
Clinicai trials
C lin ic a i trials sh o u ld be the easiest m ethod to identify. T hey are
u sed to test w h eth er o n e h ealth care intervention is superior to
an o th er. C lin ic a i tria ls are often described in term s o f testing
d ru g s, b u t they can be u sed to investigate m an y different types of
h ealth care in terv en tio n : surgery, vaccin atio n, anti-pressure sore
m attresses, an d h ealth education. W h en a co m p letely new type of
treatm en t has b een developed it m ay be tested against a placebo
b ecau se th ere is no other treatm ent to com pare it against.
1DENT1FY1NG T H E R E SE A R C H M E T H O D
E s s e n tia l fe a tu r e s
C lin icai trials are always concerned w ith effectiveness. A
characteristic of w ell-conducted clinicai trials is that they iden tify a
set of patients with a diagnoscd disease, and then ran dom ly allocate
them to the new or current best treatm ent. T h e focus of the study
should be on the outcom e of the treatm ents, seeking the one w hich
is superior. C lin icai trials are also concerned w ith the side-effects of
treatm ents.
C o m plications
Som etim es cohort studies are used to assess effectiveness; in
such stu d ies a group of treated patients is follow ed up to see how
m any gain benefit. C ohort studies are a poor m ethod of assessing
a treatm en t and can be severely cnticised: it is difficult to m ake a
fair com parison betw een treatm ents in a cohort study.
T h e b rief outline o f clinicai trials stated that two treatm en ts were
being com pared. T h is is com m only the case, b ut in som e instances
m ore than two treatm ents can be investigated. D oing so adds to the
com plexity of the stu d y and its analysis, although the resultin g
study can still be a v alid clinicai trial.
T e rm s o f Identification
T h e term s effectiveness, efficacy, and evaluation or phrases like
assess the value o f or improve the outcome, are often used in papers on
clin icai trials. T h e term s double blvid and plucebo-controlled are
seldom used except in a clinicai tnal. RaridoiTi alloccition of patients
to treatm ents is essential for a fair com parison, and th us its
presence suggests that a study is a clinicai trial. (H owever, the term
random selection is m ore likely to refer to a su rv ey.)T h e sim ple term
outcome can feature in cohort studies as w ell as in clin icai trials.
/ Case-control studies
" '.' C ase-co n tro l studies ask w hat m akes a group of individuais
different. ften the group o f individuais w ill have som e disease, in
which case the question w ill be directed at the causes of their
disease. In other instances the individuais will have behaved in
som e u-ay^TucTs^failed to com ply with therap y or failed to attend
for a clin ic appointm ent.
10
1D EN TIFY 1N G T H E R E SE A R C H M E T H O D
Complications
C ase-co n tro l stu d ies often have an elem ent of um e m w hich a
backw ard s look is taken to p ast events. T h ey look to see w hether
curren t disease co uld have b een caused b y past events. T he
d irectio n ality of tim e is cru cial for distinguishing betw een cohort
studies an d case-co n tro l studies: cohorts look forwards,
control stud ies loo k backw ards.
T e rm s o f id e n t if ic a t io n
As w ell as ca se-cO n trol there are several other term s for this type
o f study, in c lu d in g c a s e - r e fe r r e n t , c a s e - c o m p a r a t o r and
ison T h e possession of a control group is not a defin g
ch aracteristic b ecause clin icai trials should also have one, an
cohort stu d ies often do. B ecau se the m ethod looks backw ards in
tim e it is so m etim es called a r e tr o sp ectiv e stu d y, but this term can be
used for co h o rt studies.
11
4 Interpreting the
results
Statistical significance
In the past som e papers sim p ly presented tables and graphs w ith
a description and exp lan atio n of the main findings. It is now
conventional for research studies to assess the statistical signi
ficance of the findings thro ugh statistical tests. T h e need for.
statistiaj.j e s t s arise sjo ec au se o f the pervasiveness o f the play of
chance.
IN T E R P R E T 1N G T H E R E SU LT S
run w e exp ect ap p ro x im ately equal num bers of boys and girls, the
p la y o f ch an ce m ean s th a t we seldom get a 50:50 sp lit in small
sam ples.
T h e effects of the p la y o f chance are seen everyw here in medicai
research. Sup p o se two treatm en ts are being com pared in a clinicai
trial. P atients w ill have b een ran dom ly allocated to the two groups.
R an d o m isation p rotects against system ati differences betw een, the
two gro up s, b ut it does not prevent d ifferen ce arisin g by chance.
F orexam p le, it co u ld h ap pen th at slighdy m ore o f the severely ill
p atien ts co uld have b een allo cated to one treatm ent group, creating
an ap p aren t difference b etw een the treatm ents even if there were
n o n e. In p ractice it is v ery u n u su al for the two groups in a clinicai
trial to be exactly the sam e: there are often sm all chance differences
b etw een them . L ess often, there are very large chance differences
b etw een the two groups.
T h e im p o rtan ce of the p la y o f ch ance lies in the extern tojwhich
it m igh t.h ave aiffectiHlfie^oljserxed. .results. So m etim es, wTiatKjjkS^
T ik e a n interestirig r'sTf'may fin ally prove to be a statistical^fluke^/
F o rtu n ately, statistical m eth ods allo w us to estim ate w heth er or not
the observed resu lts co u ld be due to the p lay of chance. C entral to
th ese m ethods is the co n cep t o f probability.
Probability
T h e p ro b ab ility o f th ro w in g a six (w ith a fair, six-sided dice) is
o n e in six. T h e p ro b ab ility of a sin gle ticket w in nin g the national
lo tte ry is one in 14 m illio n . P ro b ab ilities a r e j jim ply a w ay of
d esc rih in g how lik elv .it is that an event w ill h appen. T h e y are often
exp ressed as d ecim al fractio n s, w here one in six becom es 0 1 6 7 .
T h e in terp retatio n o f p ro b ab ilities is quite straightforw ard. W hen
an event has a very sm all p ro b ab ility, for exam ple, 0 0 0 0 1 , it is very
u n lik ely to h ap p en . W h en the p ro bability is large, say 0 9, the event
is v ery likely to h ap p en .
P ro b ab ilities v ary b etw een 0-0 an d 1-0, w here zero m eans an
event w ill n ever h ap p en and 1-0 m eans it is certain to happen.
T h u s, the p ro b ab ility th a t a h ealth y ad u h w ill even tually die is 1 0,
b ecau se we ali die so m etim e. In contrast, the p robability of the
ad u lt dying tom orrow is less than one in 100 000, i.e. 0 00001. It
is not qu ite zero b ecau se som e u n likely event, such as being run
over by a b u s, m igh t ju st h ap pen . B ut it is very sm all because
u n lik ely events are u n lik e ly to happen.
P ro babilities lie at the h eart of statistical tests. T h ey are often
term ed P -values, in w h ich the lette r P stan ds for probability. They
-s \ iJt_
1 3
IN TE R PRE TIN G T H E R E S U L T S
IN TE R PRE T1N G T H E R E SU L T S
every 20 sign ifican ce tests which have been carried out. (T his is
b ecau se P = 0 0 5 actu ally says that the chance alone could create
the result one tim e in 20.) T here are two corollaries. First, studies
w h ich have co n d ucted a m u ltitu d e of significance tests will
re g u larly en co u n ter puriously significant results. Second, sm aller
P -valu es, say P < 0-01 or even P < 0 -0 0 1 , give increased confidence
th at the resu lt w as not a chance affair.
C o n fid e n c e in t e r v a ls
C o n fid en ce in terv als provide an alternative way of assessing the
effects o f ch an ce. T h e y can also be m ore inform ative. Suppose a
clin icai tria l of two antihypertensive drugs showed that one lowered
d iasto lic blood pressure by an average of 15 mm H g, w hereas a
second low ered it b y an average of only 5 mm H g. T he average
difference o f 10 m m Hg seems im pressive, but we know that this
v alu e co u ld be influenced by chance. T h e im portant question is
w h eth er the tru e v alu e for the difference betw een treatm ents could
b e as low as zero (i.e . no difference). V^s^.tjdj.this from_die 9 5 /o
confidence in terval. It gives the range w ithin w hich we are 95%
certain th at the tru e value lies. If the antihypertensive trial h ad
given a ran g e of 3 to 17, we w ould say that the true value could be
as low as 3 or as h igh as 17. Zero lies outside this interval so we
co n clud e th at it is un likely to be the true value. T h is is equivalent
to o b tain in g a P -value of P < 0-0 5,. i.e. the result achieved statistical
sign ificance. (N ote th at if the confidence interval had been w ider,
say ran gin g from - 4 to 24, the interpretation w ould be different.
T h is ran ge in clu d es zero, so th at it could be the true value. We
w ould th en co n clude that we have no evidence that there is a
difference b etw een the treatm ents.)
C o n fiden ce intervals are alw ays interpreted in the same w ay.
w h eth er the research method w as a clin icai trial, a cohort study, or
w hatever. T h e y are inspected to test the proposal that there was no
effect, for exam p le, no difference betw een two groups. If the zero
difference lies w ith in the confidence interval we conclude th at
there w as no effect. If it lies outside the range, we exclude no effect
as being unlikely. T h is is equivalen t to saying that the result w as
statistically sign ificant.
T h e ad van tage of confidence intervals is that they do more than
in d icate w h eth er the result m igh t be a chance effect. T hey show,
allo w in g for the p lay of chance, how sm all and how large the tru e
size of effect m igh t be.
15
IN T E R P R E T IN G T H E R E SU L T S
Outliers
me
1'
fe S e tC ^ - V
coped w ith.
^ Skew
So m e m easurem ents, such as length ^
X h h la t a e s
grouped around an average value but w ith a lon g ta .l of h.ghi value s.
T he h ieh values cant be called outliers b ecau se th ey are not really
separate from the bulk of the data. In stead, th e y g rad u ally shade
away from the m ost com m on values. T he p resen ce of a lon g ta,l of
observations on one side of the average is c a lle d skew ness. Ske
acts in the sam e w ay as outliers to disto rt a u s u c ^ ests. lt
com m only^een when the observations
i r L r i L l ^ W
patients.
N T E R P R E T IN G T H E R E SU LT S
t nsfo ,n8 th r
, rc
* 1
zzz
; : c t>o, * * .
- * - *-*
how it w as co p ed w ith .
N o n - in d e p e n d e n c e
it w ould be
assu m ed th a t m e asu rm g
e ^
ch ild .T h e assum ption that
effect on th e m e asu rem en t o f * e next c
m of
spedal
r r ;
S e r e n d ip it y m a s f c e r a d i n g a s h y p o th e s is
f nr m ost research studies collect q u ite large am ounts o
M an y, if n o t m o st, re sear
been set
p to test a few
hjpothes^ Howev r,
mPay other avenues which can be xplored^ ^
s p
e c f i c
g ,
d etailed
not to exp lo re th e d ata lu i y.
Chance observations a p re se n te U s ,f * e , ^ J
w ere b ein g te ste .
of wo
e s te d Ein s u b s e q u e n t stu d ie s. A
b e u s e d t o g e n e ra te a n d te s , h y p o th e s e s .
single s e .
rv--.
IN T E R P R E T IN G T H E R E S U L T S
IN T E R P R E T IN G T H E R E S U L T S
B lack-box an alyses
A ppropriate statistical tests increase the credibihty of a paper.
However, it does not follow that the more sophisticated the tests
used the m ore authoritative the findings. P resent d ay C om puter
packages m ake it very easy to carry out com plex an alyses, even
when the u ser has an incom plete understandm g of the m ethod
being u sed . T h e m ore com plex analyses tend to m ake m ore
assum ptions about the data b ein g analysed. It is also m u ch easier to
make m istakes durin g the analysis.
Papers w h ich present only the results of some com p lex analysis
should be view ed w ith some suspicion. T h is approach makes. 1
more d ifficult to identify problem s in the data, such as skewness
and o u tliers. It is b etter if the results of m ore sim ple analyses are
presented first. T h en these can be checked to see w hether they
accord w ith the m ore com plex. If discrepancies are found they
should be explained in the text of the paper. U n exp lain e
discrep an cies cast doubt on the propriety of the analysis.
S e
= . ^
T^ i r
eed ; b er r ed
^
has occurred can assist in its Id e n tific a tio n .
b ias
Confounding
pressure tend s to
the n ew ^ P er'
observed relationship.
^
d
hypertension causes
5 Introduction to the
check lists
IN T R O D U C T IO N T O T H E C H E C K U S T S
H ow ever, the item s listed in the essen tial ap praisal secuon specify
S r m l flaw s w h ich c a n beset ea ch type o f m e th o d I f . stu V
IN T R O D U C T IO N TO
Table 5.1
T H E C H E C K L IS T S
A s s c s s m e n t o f t h e i m p o r t a n c e o f f ia w s
'
'
~Z
F la w s e n c o u n t e r e d
T v d c o f fla w
ly p e 01 n a w
(n a tu re a n d s iz e )
D ir e c t io n o f e f f e c t
Is th e f in d in g
o f th e s t u d y
^ g a te d ?
Design
Conduct
I n te r p r e ta tio n
6 The standard
appraisal questions
published
c/>mipnrp
m
w
hich
Inform
ation
is
p
resen
tea
f
the seq u en ce m
pap ers:
A re the aim s clearly stated?
W as the sam p le size justified?
,
Are the m easu rem en ts lik ely to be valid and reliable.
Are the statistical m ethods described?
D id u n to w ard events o ccur d u n n g the study.
W ere the b asic data ad eq u ately describe
D o the n u m b ers add up?
W as the statistical significance assessed.
W h at do th e m ain findings m ean?
H ow are n u ll findings interpreted?
A re im p o rtan t effects overlooked?
H ow do the results com pare w ith previous repoi:
W h at im p licatio n s does the study have for your practice.
?
tackled an important
T H E ST A N D A R D A P P R A ISA L Q U E STIO N S
T H E ST A N D A R D A P P R A ISA L Q U E ST IO N S
thus beyond the researchers control, but often such events signal a
study of poor quality.
, ,
"
S ub jects are som etim es lost from parts of a study, either b ecause
they tru ly d isap p eared , or because m easurem ents m ade on them
were not in clu d ed in the final report. M any papers present several
tables in w h ich the data are subdivided in different w ays. T h is
provides an oppo rtun ity to check for absent subjects and m issing
data. Ideally, the num b er of subjects in ali tab les should add up to
the value stated at the b eginn ing of the results sectio n (som etim es
the n u m b er of subjects is given in the m ethods). Inconsistencies in
the n um b er of subjects should be explained in the text. F ailure to
do so indicates som e sloppiness; the authors m ay not have checked
the tables for typ in g errors, or they may be u n co n cern ed about the
consequences of m issin g data. Sm all discrepancies (of the order of
1%) are u n likely to have m uch im pact on the findings, but large
discrepancies are a serious hazard w arning.
TH E STAN D ARD A P P R A IS A L Q U E ST IO N S
T H E ST A N D A R D A P P R A ISA L Q U E ST IO N S
fam iliar with a particular field should be more circum spect about
accepting the claims of a single paper.
29
7 Appraising surveys
T his chapter presents the issues which are relevant prim arily to
surveys. These are arranged in two parts: the brief essential
questions and the more detailed specific questions. An explanation
is given for each item. These lists are then combined, at the end of
the chapter, with the list of standard questions described m
Chapter 6. These lists provide a complete guide to the appraisal of
surveys.
A PPR A ISIN G SU R V E Y S
APPRA1S1NG SU R V E Y S
A PPR A 1SIN G SU RV EY S
A PPRA ISIN G SU R V E Y S
34
A P PR A ISIN G SU RV EY S
Conduct
D id untoiuard events occur during the study?
Analysis
Were the basic data adequately described?
Do the numbers add up?
Was the statistical significance assessed?
Interpretation
W h at do the m ain findings mean?
T h e qu estio ns in italics are the stan d ard ones which w ere described in C h ap ter 6.
35
8 Appraising cohort
studies
This chapter presents the issues which are relevant prim arily to
cohort studies. These are arranged in two parts: the brief essential
questions and the more detailed specific questions. An explanation
is given for each item. These lists are then combined, at the end of
the chapter, with the list of standard questions described in
Chapter 6 .These lists provide a complete guide to the appraisal of
cohort studies.
A P PR A ISIN G C O H O R T S T U D IE S
A P P R A IS IN G C O H O R T ST U D IE S
A PPRA 1SIN G C O H O R T S T U D IE S
A PPR A ISIN G C O H O R T S T U D IE S
control group, chosen because they live far from the plant, could
actually work at the plant. M easuring exposure can be difficult,
especially if the c o n c e rn is with chronic exposure, when individ
uais exposure leveis can change over time. T h u s, whenever
exposure/intervention is described, two key questions to be asked
are How accurately was exposure m easured and Could some of
the Controls have been exposed?.
A P PR A JSIN G C O H O R T ST U D IE S
41
A P PR A ISIN G C O H O R T S T U D IE S
Conduct
D id untoward events occur during the study?
Analysis
D id the analysis allow fo r the passage o f tim e?
Do the numbers add up?
Were the basic data adequately described?
Was statistical significance assessed?
Interpretation
W hat do the main findings mean?
W h a t else m ight in flu en ce the o b se rve d o u tco m e?
How are null findings interpreted?
A re importam effects overlooked?
How do the results compare with previous reports?
W hat implications does the study have fo r y o u r practice?
T h e questions in italics are the stan d ard ones which w ere describ ed in C hapter 6.
42
9 Appraising clinicai
trials
A PPRA ISIN G C L IN IC A L T R IA L S
A P P R A ISIN G C L IN IC A L T R JA L S
A PPR A ISIN G C L IN I C A L T R IA L S
46
A PPR A ISIN G C L IN IC A L T R IA L S
A PPRA ISIN G C L IN IC A I. T R JA L S
not negate the whole study. Careful statistical analysis can go a long
way to take account of such differences. Thus, when they exist, the
question is whether they have been allowed for in the analysis.
A PPR A ISIN G C L IN IC A L T R IA L S
Conduct
How was the randomisation carried out?
D id untward events occur during the study?
Analysis
Were the treatm ent groups com parable at baseline?
Were results analysed by intention to treat?
Was the statistical significance assessed?
Were the basic data adequately described?
Do the numbers add up?
Interpreta tion
W hat do the main findings mean?
How are nullfindings interpreted?
A re importam effects overlooked?
How do the results compare with previous repons?
What implications does the study have fo r your practice?
* T h e q u estio n s in itah cs are the stan d ard o nes w hich were d escrib ed in C h apter 6.
49
10 Appraising
case-control studies
50
A P P R A ISIN G C A S E -C O N T R O L ST U D IE S
APPRA ISIN G C A S E -C O N T R O L S T U D IE S
there are many forces of selection which determine where cases are
managed, it is best if both cases and Controls have been subject to
the same forces. Controls are usually selected to be sim ilar in terms
of age, scx, social class, and area of residence to the cases.
The obsession with the com parability of Controls follows from
the analysis of these studies: evidence about causes of disease
comes from a comparison of cases and Controls. If differing forces
of selection introduce differences between cases and Controls, these
might be mistaken for risk factors for the disease.
Were data collected in the same way for cases and Controls?
Having recruited cases and Controls, both must be asked about
past exposure to potentiai risk factors. This information should be
obtained in_the same way for each, whether by interview, postal
questionnaire or case-note review. But even using the same method
may not produce identical information gathering. If the data
collector knows which are cases and which Controls, then interviews or case-note searches m ay be influenced. Blinding to case
ojntrol status should be used where possible.
Another source of probems is recall bias: patients who have a
serious disease tend to review their past history in detail to find an
explanation for their illness. T hus they are more likely to report
events which the Controls m ight forget. Case-control studies are
bedevilled by biased information gathering. The details of the datacollection techniques need to be scrutinised carefully to determ ine
whether data collection was identical for cases and Controls.
A P P R A ISIN G C A S E -C O N T R O L ST U D IE S
54
A P PR A ISIN G C A S E -C O N T R O L S T U D IE S
Conduct
D id untow ard events occur during the study?
Analysis
Were the basic data adequately described?
Do the numbers add up?
W as th ere d ata -d red g in g ?
as the statistical significance assessed?
Interpretation
W hat do the main findings mean?
W h e re are the biases?
C o u ld th ere be co n fo u n d in g ?
H ow are null findings interpreted?
A re im portant effects overlooked?
Hoiu do the results compare tvith previous reports?
W hat implications does the study h ave fo r you r practice?
* T h e qu estio ns in italies are the stan d ard ones which were d escrib ed in C hapter 6.
55
11 Appraising review
papers
This chapter presents the issues which are relevant prim arily to
review papers and meta-analyses. These are arranged in two parts:
the brief essential questions and the more detailed specific
questions. An explanation is given for each item. These lists are
combined, at the end of the chapter, with those items from the
jist of standard questions which are relevant to this type of study.
These lists provide a complete guide to the appraisal of review
papers and meta-analyses.
A P P R A I S IN G
r e v ie w
PAPERS
A P P R A ISIN G REVIEW P A PE R S
a p p r a is in g
r e v ie w
papers
A P P R A ISIN G
r e v ie w p a p e r s
61
A P P R A IS IN G R E V IE W PAPERS
Conduct
W ere the d etailed stu d y designs review ed ?
W as m issing in fo rm a tio n sought?
Analysis
Were the basic data adequately described?
W as p u b lica tio n bias taken into accou n t?
W as h e tero g e n e ity o f effect investigated?
Interpretation
W hat do the main findings mean?
A re th e re o th e r findings w hich m e rit attention?
A re the con clu sio n s justified?
How do lhe findings compare zuiih previous reports?
W hat unplications does the study have fo r you r practice?
* T h e qu estio ns in italics are the standard ones w hich were describ ed in C hapter 6.
62
Index
features 1 0 - 1 1 ,5 0
generalisation o f results 51
limitations 52
questions about 2 3 -9 , 5 0 -5
terminology 11
case-referrent 11
causality 39
chance 1 2 -1 3
P-values and 14 -15
see also probability
chance observations 1 7 ,3 3 - 4
check lists 20-1
see also questions
clinicai trials
accounting for patients 4 4 - 5
appraisal of 4 3-9
bias 4 5, 46-7
choice o f subjects 45
complications 10
deviations from planned
treatment 48
features 9 - 1 0 ,4 3
ntention to trcat 48
outcome assessments 45
placebos 9
powerful evidence 58
questions about 2 3 -9 , 4 3 -9
randomisation of patients 44,
4 5 -6
relevance of outcomes 47
side-effects 1 0 ,4 8
terminology 10
treatment allocation 44
treatment description and
administration 46
of treatment effectiveness 10
treatment group comparability
4 7-8
IN D E X
cohort 9
cohort studies
ageing of participants 40
appraisal of 36-42
complications 9
control and comparison groups
9 ,3 7
design and aims of 39
exposure/intervention
measurcment 3 9 -4 0
features 8 -9 , 36
follow-up 3 7 -8
generalisation o f results 37
influences on outcome 4 0 -1
outcome measures 40
questions about 2 3 -9 , 3 6 -4 2
sampling 37
terminology 9
and treatment assessment 10
comparison groups
in cohort studies 9
in surveys 8
C o m p u t e r analysis 18
Computer searches, of research
papers 57
confidence intervals 15, 27
and null findings 28
confounding 19
in case-control studies 5 3-4
control groups
in case-control studies 11,
5 1-2
in cohort studies 9, 37
in surveys 8
cross-sectional 8
data
description 26
missing 25, 26
numerical discrepancies 26
see also sampling; subjects
data-dredging 54
design changes 25
doublc blind 10
cffcci
h e ic r o g e n e iiy o f 60
s iz e o f 2 4 , 2 7 , 2 9
effectw cn css 10
effic a c y 1 0
evaluation 1 0
64
effects overlooked? 28
implications for practice 29
interpretation o f 2 7 -8
null 28
see also generalisation; results
flaws
analysis pitfalls 1 6 - 1 9
evaluation of 2 1 - 2
finding and assessing 5
in sampling 31
in statistical procedures 3 3-4
use of check lists 2 0 - 1
Jollozu-up 9
follow-up, in cohort studies 3 7 -8
generalisation, o f results 34
case-control studies 51
cohort studies 37
surveys 30, 34
grab samples 31
heterogeneity o f effect 60
hypotheses 4
and chance observations 17
null 14
independence 17
information needs 2
intervention, measurement of
3 9 -4 0
likely case 2 0 -1
literature search 1-2
measuring techniques 2 4 -5
misclassification bias 53
multicentre studies 24
non-independence 17
non-response, see response rate
null findings 28
null hypotheses 14
observers, multiple 24
o n tco m e 9, 10
assessing vatu e o j 10
xmproving 10
outcomes, in cohort studies 40,
4 0 -1
follow-up data 38
IN D E X
outliers 16
pilot studies 25, 33
placebo-controlled 10
placebos 9
practice, implications o f study for
29
probability 1 3 - 1 4 , 1 4 - 1 5
see also chance
prospeciive 9
purpose o f research study, see aims
P-values 1 3 - 1 4 , 1 4 - 1 5
quasi-randomised treatments 44
questionnaires 4
questions
for al! research papers 23-9
about case-control studies
2 3 - 9 , 5 0 -5
check lists 2 0 -1
about clinicai trials 2 3 -9 , 4 3 -9
about cohort studies 2 3 -9 ,
3 6 -4 2
when reading research papers
3 -6
on review papers 5 6 -6 1
about surveys 2 3 -9 , 3 0 -5
random allocation 10
random isation, in clinicai trials
4 4 ,4 5 -6
random isation codes 4 5 - 6
random sample 8
random selection 10
reading
appraisal o f 2
clarifying reasons for 1
identifying reports 2
selectiviry ir. 2
specifying inform ation 2
steps in 1
recall bias 52
reliability 25
research papers
abstract 3, 5
aims 4
C o m p u t e r searches 57
discussion 5, 6
implications 5 -6
introduction 3 -4 , 6
m ethodology 4, 7
presentation o f data 4
references 6
sensitivity analyses 58
structure o f 3
title 3
unpublished 5 9 -6 0
weighting systems 58
see also case-control studies;
clinicai trials; cohort studies;
flaws; questions; reading;
results; review papers; surveys
response rate 32
follow-up o f non-responders 33
reasons for non-response 32
results
analysis pitfalls 1 6 -1 9
comparison with previous
reports 2 8 - 9
fulfilling study aims? 5
interpretation o f 5 ,1 2
presentation o f 4 -5
statistical significance 12 -15
see also findings; generalisation
retrospeciive 9, 11
review papers
appraisal o f 5 6 - 6 1
bias in 57
conclusions 6 0 - 1
focus o f 5 8 -9
heterogeneity o f effect 60
identification o f research papers
57
missing information 59
other findings? 60
publication bias 5 9 -6 0
quality o f research papers 57
questions about 5 6 -6 1
review o f study designs 59
summary of results 58
unpublished studies 5 9 -6 0
sample 8
sample size 1 2 - 1 3 , 2 4
sampling
cluster 8
in cohort studies 37
flawed 31
grab samples 31
selection bias 33
stratified
65
IN15BX
in surveys 7, 8, 31
sysiem aiic 8
selecon, see sampling
selection bias 33
sensitivity analyses 58
serendipity 17, 3 3 -4
side-effects 10, 48
significance tests 1415
multiple 54
on survey data 3 3 - 4
see also statistical tests
skew 1 6 - 1 7
statistical significance 1 2 - 1 5 , 2 6 -7
statistical tests
assumptions in 1 7 ,2 5
description o f methods 25
logic o f 1 4 -1 5
number and complexity of 25,
26
pitfalls in analysis 1 6 - 1 9
see also chance; probability;
significance tests
strafied sampling 8
studies, see research papers
subgroups, analysis by 17
subjects
missing 26
variability o f 26
survcillancc bias 53
survey 8
surveys
appraisal of 3 0 -5
complications 8
66
n
S8$
w
m
H ^V i
iP
Q
'M
A;*&
*+??*.
RH
hps
^fei
:friQ
#
?&
.giffv'