Professional Documents
Culture Documents
.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact support@jstor.org.
Institute of Mathematical Statistics is collaborating with JSTOR to digitize, preserve and extend access to
Statistical Science.
http://www.jstor.org
StatisticalScience
1990,Vol. 5, No. 1, 2-34
Quantifying
Probabilistic
Expressions
FrederickMostellerand Cleo Youtz
Abstract.For 20 different
studies,Table 1 tabulatesnumericalaveragesof
opinionson quantitative
meaningsof 52 qualitativeprobabilistic
expressions.Populationswithdiffering
occupations,
mainlystudents,
physicians,
othermedicalworkers,and sciencewriters,contributed.
In spite of the
varietyof populations,formatof question,instructions,
and context,the
variationoftheaveragesformostoftheexpressions
was modest,suggesting
that theymightbe usefulforcodification.
One exceptionwas possible,
becauseit had distinctly
different
meaningsfordifferent
people.We report
newdata froma surveyofsciencewriters.
The effectofmodifiers
suchas
veryor negation(not,un-,im-,in-) can be describedapproximately
by a
simplerule.The modifiedexpressionhas probability
meaninghalfas far
fromtheappropriate
boundary(0 or 100) as thatoftheoriginalexpression.
This paperalso reviewsstudiesthatshowstabilityofmeaningsover20
years,mildeffects
oftranslation
intootherlanguages,context,smallorder
and effects
ofscale forreporting
on extremevalues.
effects,
The stemprobability
withmodifiers
givesa substantialrange6% to 91%
andthestemchancemightdo as welliftriedwithvery.The stemsfrequent,
probable,
likely,
and oftenwithmodifiers
produceroughly
equivalentsetsof
means,butdo notcoveras widea rangeas probability.
Extremevaluessuch
as alwaysand certainfallat 98% and 95%,respectively,
and impossible
and
neverat 1%.
The nextstepwillbe to offer
codifications
andsee howsatisfactory
people
findthem.
Keywordsandphrases:Quantifying
language,codifying
language,meaning
ofqualitativeexpressions.
INTRODUCTION
misunderstood,
whetherthe emitters
or receiversare
physicians,patients,scientists,science writers,or
In everyday
language,peopleapplytheexpressions
othercitizens.By associatingnumericalvalues with
alwaysand certainto eventsthatoccurin fewerthan
specificqualitativeexpressions
we mayultimately
im100%oftheiropportunities;
furthermore,
on average,
provecommunication.
In the long run we plan to
peopleregardveryhighprobability
as morelikelythan
proposesomecodification.
In thispaper,we wantto
almostcertain,
a surpriseto manyofus. Communicareportwhatsuchexpressions
currently
meanbysumtionsthatemployqualitativeexpressions
forfrequenmarizingresultsfrommanystudiesand givingnew
cies or rates of occurrencerun the risk of being
findings
fromsciencewriters.
In this firsttreatment,
our intendedreadersand
users are scientistsincludingstatisticians,
because
Frederick
Mostelleris RogerL Lee Professor
ofMaththeirlanguageoftenneeds interpretation
to wider
ematicalStatistics,Emeritusat Harvard University audiencesthan specialists.Statisticianshave special
and Directorofthe Technology
Assessment
Groupin
skills,interests,
and stakesin communication
of inthe HarvardSchoolof PublicHealth. Cleo Youtz is
formation
aboutprobabilities.
Theymayalso wishto
Mathematical
Assistantin theDepartment
ofStatistics participatein producing
a codification
ofprobability
at HarvardUniversity.
expressions.
As thisworkis stillinprogress,
have twomainforms.First,in some
readersare encourCodifications
aged to send commentson the paper to Frederick areasofworka fewstandardexpressions
maybe used
at theDepartment
for
all
Mosteller
to
of
belief
or relative
situations
express
degree
ofStatistics,
ScienceCenFor
Kent
codified
some
frequency.
ter,1 Oxford
example
[33]
Street,Cambridge,
Massachusetts
02138.
2
in theintelligence
fieldin termsofodds.
expressions
He used as a base almostcertain,chancesare good,
and chancesaboutevenand quotedoddsofat least9:1
in favor,at least 3:1 in favor,and 1:1 or a 50-50
withthereverseoddsgoingwith
chance,respectively,
not hapabout something
statements
corresponding
seem
pening.Whetherwe-likeit or not,statisticians
thatassociates
a similarcodification
tohavegenerated
significant
numbersless than 0.05 withstatistically
and numbersgreaterthan 0.05 withnotstatistically
significant.
wouldmerelyrelate
A secondformof codification
sets of probability
expressionsto certainrangesof
numbersso as to offersomewhatmoreprecisionin
Inforlanguageto thosewhowishto use thefindings.
mationin the presentpaper could be used forthis
dishereinvitespreliminary
purpose.Its presentation
thatcouldbe the basis foradcussionand criticism
ditional work before firmingup either form of
codification.
59.4%ofthephyForthebeta-blocker
antihistamine.
"lessthan1 perthousand,"
sicianschosethecategory
foran antiwhereas20.7 percentchosethiscategory
arises
thatthedifference
Mapes suggested
histamine.
are
because the side effectsfroman antihistamine
His
muchmildercomparedto thoseof,abeta-blocker.
pointis thatthemeaningofrarechangeswithcontext.
On the otherhand, physiciansmay actuallyhave
ofthe
different
oftherateofside effects
perceptions
twotypesofmedication.
PresidentGerald R. Ford said that a swine flu
epidemicin the 1976-1977season was "a veryreal
estimatesfrom
[23]gotprobability
Boffey
possibility."
fourexperts:2%, 10%, 35%, and "less than even."
Boffeyconcludedfromthese estimatesthat the
chancesoftheepidemicare"farlowerthantheofficial
rhetoric... would lead one to expect." Thus he sees a
betweenthemeaningofthe qualitative
disagreement
and an averageof
expression"veryreal probability"
perhaps20 to 25%. Mosteller[28]useddatafromCliff
[24] and Selvidge[17] to estimatea publicmeaning
at 29%.
of"veryrealpossibility"
thevariationin perThese twoexamplesillustrate
expressions.
ceptionofthemeaningofprobability
CONTEXT
It has oftenbeen pointedout that the context
the
counts,rates,oramountscan influence
employing
estimate.For an examplementionedto one of the
authorsby Leo Crespi,"a handfulof grapes"and "a
handfulofpeopleon thebeachat ConeyIsland"could
the firstin the range
estimates,
implyverydifferent
of 5 to 20 grapes,and the secondhundredsor even
ofcourse,complithousandsofpeople.This problem,
We think
expressions.
catesmattersforprobabilistic
ofthestudiesdiscussedhereas dealingwithprobabilfor
itiesin theregionfrom1 to 99%. The probabilities
veryrareevents,suchas atomicdisastersand regional
blackoutsfromfailureofthe powersystem,are very
to laymen;and,as weunderto communicate
difficult
stand,muchremainsto be learned.Communicating
problem
the idea ofverysmallrisksis an important
in theriskfield;it deservesand has its ownresearch
and Lichtenstein
[31]).
effort
(Slovic,Fischoff
Some authors have emphasizedthe differences
thatchangingcontextcan create.For example,in a
of propositions,
Tverskyand
studyof formulation
Kahneman [32] show verysubstantialchangesin
andan equivalentgambetweena certainty
preference
in termsof
the samepossibilities
ble by formulating
lives saved versuslives lost. Kahneman,Slovic and
Tversky[27]includearticlesbymanyauthorsshowing
effects
ofcontext.Ouremphasisis moreon the near
in Table 1,rather
ofopinionsas illustrated
constancy
side of
thoughthe right-hand
than the differences,
Unweighted Weighted
average
average
Unweighted Weighted
average
average
98
99
38
38
15
14
Notoften
13
13
Not veryoften
9814*,20
42
37
Possible
78
81
High chance
58
58
50
50
41
41
720
14
13
Poor chance
9017a* 9117t*,20
15
13
Low chance
66
66
Liable to happen
36
36
6620
Might happen
77
79
Usually
Always
965 9715a,
941,2
91
91
Almostalways
8919
885
95
97
81
86
81
947
9316a
1320
951,&
Almostcertain
61
Veryfrequent
Frequent
45
45
Notinfrequent
17
17
Infrequent
4520
91
84
91
81
1720
52
Veryinfrequent
4120
Veryhighprobability
16
High probability
1020 2013*
Moderate probability
Low probability
171
1620
Verylow probability
620
82
85
Verylikely
7910
746
69
69
Likely
17
16
Unlikely
616
879
858a,13*,2o
638b,8c
7013*,20
65lo
148b
Veryunlikely
82
85
Veryprobable
1513*
70
69
Probable
16
15
19
Unusually
28
26
Sometimes
87
1915b
1720
189 201,2,14*
65
69
17
Once in a while
39
37
Not unreasonable
22
22
Occasionally
328b
8013* 879,20
6212a
3920
7013*,20
2718b*
282o 293*
3816b
26
23
172
12
12
Seldom
Veryseldom
Rarely
1513* 166,20
181o 2014*
6,8*,18b*,20
873* 8818a*,18b*
571rc
6412b
5915b,
704a,20
16a,
7116a
73*
162,9
19
6017a*
732,18b*
2316
617,16b
743* 755
Veryrarely
61
Almostnever
50
50
As oftenas not
6218b*
647
Never
23*,8d
3,8*,18b*
420
75
instructions
or samplesizes.
list,a, b, c andd indicatedifferent
The subscripts
indicatestudieslistedin thereference
* Indicatesmedian.
185
126
420
62
, ab?20
8018b*
471
1715b 207,18a*
7818a*
506
3716c
5017b*
5920
793*
283*,4a
Very often -
Often
2212b,15a
2012a,,8a*
19
156
Veryimprobable
824b,2o
7420
1920
513* 620 1 10
85
7119
8518a*
11
138b
7015b
8415a
1017a*,17b*
129
3620
19
236 2510
796,10
1410
1320
5120 611
16
114*20
5013*,20
1912b
8020 871
56
Impossible
721o 8013*8220
3611b 3911a
1612a
278b
8120
55
1118b*
9015b 9120
Certain
91lo
86
16c, 19
1003*,,4a,4b,4c,
3820
1b
1516a
Table 1 illustratesdifferences
owingto samples,inor context.
structions
In comparingcontextresultswithno-context(in
isolation)resultsfor 22 expressions,Selvidge [17]
roundedto the nearest5% for
foundthatdifferences
5 for7 expressions,
medianswere0 for10expressions,
and 15 for1 expression.The
10 for4 expression's,
wasforappreciably,
whichwentfrom
largestdifference
25% in isolationto 10% in context.
Pepperand Prytulak[30] offeran elaborateinvestigationfor studyingeffectof contextby creating
andhigh-frequency
forframlow-,moderatecontexts
ing the same expressions,as well as evaluationin
isolation.The main sourceof divergencewas that
in thecontextofairplanecrashesand of
probabilities
ofearthquakes
werefarremovedfrom
theoccurrence
the numericalvalues giventhe same expressionsin
remarks
aboutstudentsmissingbreakfasts
ortheproportionofmenthatMiss Swedenthoughtfoundher
The paperillustrates
thatcontextcan push
attractive.
events
themeaninga goodway,butthatforordinary
are modest.We emphasizeagainthat
thedifferences
we are notconsidering
numerical
evaluationsofprobabilisticexpressionsin such contextsas veryrare
in shorttimeperiods.Contexttakes
eventsoccurring
severalformsin additionto the substantivetopic,
suchas typeof scale used and orderofpresentation.
in the sectionon
We discussthese mattersfurther
specialtopics.
SCIENCE WRITERS
We gathered
data on meaningsofthe52 qualitative
fromsciencewriters
expressions
through
a mailquesFor a situationwithoutcontext,
tionnaire.
thewriters
gaveestimatesforprobabilities
forthe52 expressions
and lowerand upperlimitstheythought
theirreaders
wouldset foreach expression.
The responseratewas
about37%. The averageresponsesare similarto those
fromotherstudiesreportedin Table 1. The relation
oftherangesreported
fortheestimatesis informative
in spiteof the low responserate. Some background
for the special study of the science writersand
the selectionof the 52 expressionsappears in the
Appendix.
RESULTS
Median, Quartiles and Variability
As oneexpects,extreme
expressions
likealwaysand
neverhave smallvariationas measuredbythe interquartilerange(IQR), the distancebetweenthe quartilesof the cumulativedistribution.
As a measureof
variability
of freguency
distributions
of the science
writers'"ownestimates,"
we use the IQR, becauseit
is lesssensitive
thanthestandarddeviation
toextreme
values. More centrallylocated expressions(nearer
50%) usuallyhavebroadervariation.
Examples of Distributions
Figure1 showsrelativefrequency
distributions
(using class intervalsof length10 centeredon certain
of5) forthesciencewriters'
multiples
personalchoices
forpossible,almostalways,and unlikely.We chose
theseexpressionsto give a notionof the varietyof
distributions
To get a feelingforthe
encountered.
behaviorand systematic
movement
ofdistributions
as
the modifiers
fora stemchange,runningdownthe
medianand quartilesin Table 2 fora givenstem
quicklyshowshowthemiddle50% ofthedistribution
moves.Figure1 showsthatpossiblehas a bimodal
distribution.
This bimodalityalreadysuggeststhat
as a qualitativeexpression.
possibleis unsatisfactory
We returnto it below.Otherexpressionswerenot
bimodal(exceptforsuch minorvariationsfora few
as expectedin histograms).
expressions
Relation of Variabilityto Level
The 11 expressionswithmedianprobabilities
between33 and 67 sortthemselvesinto groupswhose
differs.
variability
Group1 containsnumbersthateithercomparedirectlywiththe 50-50 situation(less than an even
chance,evenchance,betterthanevenchance)or compare the chanceof the eventwiththat of the nonevent(less oftenthannot,as oftenas not,moreoften
lead to smallinterquarthannot).These expressions
tile ranges.The six dots at the base of the middle
them.Essentially
thevalue
panelofFigure2 represent
50% is anchoring
the responses,especiallywitheven
chanceand as oftenas not. We mightget similar
anchoringeffectselsewhereif we asked how often
"aboutoncein n times"occurred.
respondents
thought
offer
andarethereGroup1 expressions
highprecision
for
forepotentially
codification.
attractive
F. MOSTELLER
AND C. YOUTZ
TABLE 2
Quartiles,median and interquartilerangeforthe sciencewriters'ownpreferredestimatesfor52 probabilityexpressions,pooledfrom
producedbyFormA and Form B
distributions
25%
Median
75%
IQR
10.3
5.3
19.7
10.1
24.8
19.6
14.5
14.3
7.5
.2
38.5
.3
50.2
.5
42.7
.3
High chance
Betterthan even chance
Even chance
Less than an even chance
Poor chance
77.5
53.3
49.7
39.6
8.4
80.4
57.6
50.0
40.2
10.3
89.1
60.2
50.2
45.0
19.7
11.7
6.9
.5
5.4
11.3
Low chance
Liable to happen
Mighthappen
5.0
59.8
19.9
9.8
68.2
37.6
12.8
77.7
50.1
7.8
17.9
30.2
Usually
65.6
75.1
82.2
16.7
9.9
17.5
9.9
23.5
17.4
25.0
15.3
37.6
26.1
35.0
22.4
52.6
16.3
17.5
12.5
29.1
12.5
9.8
7.4
3.2
20.0
15.1
10.2
4.9
27.7
25.0
17.5
7.7
15.2
15.1
10.1
4.5
Expression
25%
Median
75%
IQR
Always
Almostalways
99.6
89.7
99.7
91.7
99.8
95.2
.3
5.5
Not often
Not veryoften
Certain
Almostcertain
98.7
87.5
99.6
90.2
99.8
95.0
1.1
7.5
Possible
Impossible
Veryfrequent
Frequent
Not infrequent
Infrequent
Veryinfrequent
75.3
60.0
32.7
10.1
3.6
82.6
72.2
49.6
17.3
5.2
89.7
75.3
57.3
22.6
10.0
14.5
15.2
24.6
12.5
6.4
Veryhighprobability
High probability
Moderateprobability
Low probability
Verylow probability
89.8
77.1
40.1
7.8
1.9
92.5
82.3
52.4
15.0
4.9
95.2
87.2
58.7
22.3
7.6
5.4
10.1
18.5
14.5
5.7
Expression
Verylikely
Likely
Unlikely
Veryunlikely
80.1
62.6
9.8
2.7
87.5
71.1
17.2
5.0
90.2
77.6
22.7
9.8
10.1
15.0
13.0
7.1
Unusually
Sometimes
Once in a while
Not unreasonable
Veryprobable
Probable
Improbable
Veryimprobable
81.5
64.7
7.6
1.5
89.7
70.2
12.5
4.8
90.4
77.7
22.3
7.5
8.9
13.0
14.7
5.9
Occasionally
Now and then
Seldom
Veryseldom
Veryoften
Often
More oftenthan not
As oftenas not
Less oftenthan not
77.5
65.0
57.1
49.8
34.8
82.8
72.5
59.8
50.0
40.0
89.9
75.4
60.4
50.3
42.7
12.4
10.4
3.3
.6
7.9
Rarely
Veryrarely
3.6
1.2
7.2
3.0
10.0
5.0
6.5
3.8
Almostnever
Never
1.2
.1
2.9
.3
4.6
.4
3.4
.3
50-
40~
Almost
Always
~40-
Unlikely
20
Possible
20-
30
60X0i
IQ
I *Possible
4C-
30
0
A...
/4S
15 25 35 45 55 65 75 85 95
Estimate
(percenft
ofsciencewriters'own estimatesfor
FIG. 1. Frequencydistributions
threeexpressions,gouped by 10% intervalscenteredat multiplesof
5%.
has
moderate
probability)
Group2 (not infrequentt,
an IQR aboutthesize one wouldexpectfromextrapolationfromthetwoouterthirdsofFigure2.
fromnot
extravariation,increasing
Group3 offers
and mighthappentopossible.Becauseit
unreasonable
70
MediG17
100
expression,
seemeda logicalratherthana quantitative
we did notregardthefirstoftheseas a probabilistic
norwas mighthappena cheerfulchoice,
expression,
becauseofthe
buttheywereincludedforexamination
intheinterval33 to 67.Finally
ofexpressions
scarcity
possibleseemsexcellent,at firstblush,formakinga
from
but recallthebimodality
veryvaguestatement,
Figure1.
1.00
.80
o
S5
Acceptability
VeryUnlikey
~.60
.2020
40
Mode
IQR
Even chance
Always
Never
Impossible
Certain
50
96
2
4
95
99.4
98
97
93
92
5.9
6.3
5.4
6.9
8.1
Sometimes
31
71
64
47
44
51
42
mode
Largeacceptability
Likely
.40
Median
6p
80
in Percent x
Probabllity
100
LargeIQR
Liableto happen
Notinfrequent
Notunreasonable
Might happen
Possible
43
47
58
51
55
24.1
25.6
27.2
31.7
35.0
42.4
Withlargenon-response,
it is customary
in sample
to checkforchangesinresponsebetweenearly
surveys
and late respondents(Mosteller [29], page 215;
Bartholomew[21]; and Hansen and Hurwitz[26]).
For each expression,
we comparedtheaverageprobabeforethesecondmailbilityforthosewhoresponded
forthoseresponding
ingwiththeaverageprobability
afterward.
To look for some systematictrend,we
dividedexpressions
intohigh(averageprobability
over
50%) and low(therest).Thenwe scoredeach expressionas going"up"or going"down"fromearlyto late.
2 x 2 tableexhibiteda slighttendency
The resulting
for"high"expressions
to go "down"and "low"expressions to go "up." A chi-squaredtest correctedfor
fellalmostexactlyat the 0.05 level,3.86
continuity
insteadofthetabled3.84.Although
thismightsound
like a regressioneffect,the grouping"high"versus
"low"is chosenon thebasisofall respondents,
and so
theselectionexplanationwouldnotbe veryrelevant.
The averagereductionforthe "high"groupis 0.4%,
and theaverageincreaseforthe "low"groupis 0.5%.
Roughcalculationssuggestthat 0.4% is about one
standarddeviationfortheaverageofeithertheupper
or lowerhalfof the expressions,
and so the changes
observedare comparableto thosethatsamplingfluc-
Cliff[24] in a studyofthemeaningsofexpressions
in a largercontextfoundevidencethat ratingsof
expressions
weremodifiedin an approximately
multiplicativefashionwhenadverbswereintroduced
to
modify
adjectives.The ratingsfellon a scale havinga
neutralzeropointand runningfromnegative(unfavorable)numbersthroughpositive(favorable).For
example,verywickedhad a rating1.25 timesthatof
wicked.Thus the adverbsprovidedmultipliersobtainedby averagingthe effectsoverthe expressions
wheretheadverbswereused.
In a similarspirit,we want to providesummary
valuesoftheeffects
ofmodifiers
usedinthe52 expressions,thoughwe haveto deal witha probability
scale
runningfrom0 to 100%. As a simplemethodof
we use multipliers
summarizing
that relate to the
distanceto the nearerextreme,
thoughwe couldrephrasethe methodso as to treat50% like the zero
pointoftheCliffscale.
The modifiersformthreeclasses of expressions:
that is
(a) thosewhere"very"reducesa probability
less than 50%, (b) thosewhere"very"inoriginally
that is originally
creasesa probability
greaterthan
50%,and (c) thosewhere"not"(in-,im-,un-)changes
fromoriginally
than50% to less
an expression
greater
than 50%. We findit convenientto workwithdistances fromthe extremesand to relatethe initial
distancefromthenearerextreme(0% or 100%)to the
finaldistancefromits nearerextreme.Whenwe do
summarizes
the
this,a singlemultiplicative
parameter
If x is theinitialvalueand
effect
ofall themodifiers.
then
value,and k is themultiplier,
Xmodis themodified
relationsforthethreeclassesare:
theestimated
(a)
Xvery =
kx
where"very"reducestheestimate),
(expressions
(b)
100
Xvey
k(100 - x)
where"very"increasestheestimate),
(expressions
(c)
Xnot=
k(100- x)
ofnegation:not,in-,un,im-).
(expressions
thesinglevaluek = 1/2,works
Withtheseconventions,
wellforall threegroupsusingtheunweighted
averages
orthesciencewriters'
(orweighted
data) ofall studies
fromTable 1. (We used the averagelog of the ratio
ofthesecondcolumnto thefirstin Tables 4, 5 and 6
Class (a)
Infrequent
Lowprobability
Unlikely
Improbable
Seldom
Rarely
Xvery
17
16
17
16
12
9
7
6
11
7
7
4
TABLE
/2x
8.5
8
8.5
8
6
4.5
Frequent
Highprobability
Likely
Probable
Often
100 - xve,
100-x
45
16
31
30
35
19
9
18
18
15
TABLE
1/2(100- x)
22.5
8
15.5
15
17.5
Veryoften
Usually
100- x
xnot
1/2(100- x)
45
30
31
35
17
16
17
15
22.5
15
15.5
17.5
15
23
13
19
7.5a
11.5a
aSubstantial deviation.
Stabilityover Time
Simpson[18] studiedthevariability
ofquantitative
meaningsfor20 qualitativetermsovertimeby comparingresponsesto twoquestionnaires,
onecompleted
in 1942by 335 studentsand theotherin 1962by 395
students.He usedthemidpoint
oftherangegivenby
the studentforeach expressionas its locationvalue.
For the 20 terms,the averageabsolutedeviationbetweenthe meansof the midpointsforthe 1942 and
1962 responseswas only2.0%. Thus the resultsare
similar.
strikingly
Translation
Grigoriu
and Mihaescu[25] translated30 probabilityexpressions
intoRomanianequivalentsand found
"theaveragenumerical
valuesweresimilarfordifferentprofessional
groups[physicians,
medicalstudents,
and medicalrelatedprofessionals]
and veryclose to
the values reportedin the Englishliterature"(page
364).Among16 expressions
thatappearbothin their
studyand amongthe 52 entriesin Table 1, the unweightedaveragesgivenin Table 1 differed
by 10
percentage
pointsor morefor5 expressions.
Contextand Translation
Beyth-Marom
[22] reportsan elaborateset of exin
periments
carriedoutin a forecasting
organization
Israel with 30 expressionsin Hebrew.These were
translated
intoEnglish,and 8 areamongour52. With
respectto context,she foundmorevariationin the
whentheywere
numerical
evaluationsofexpressions
givenin a contextofthe likelihoodof futurespecific
eventsthanwhenmerelygivenas expressions
to be
evaluated(she speaks of presentingexpressionsin
in commonwithourstudy
isolation).The expressions
in isolationcloseto
producednumerical
probabilities
ours,in spiteofthetranslation
problem.
Order
Orderof presentation
is one kindof context,and
ourFormA versusFormB resultsoffersomeinformationon thistopic.Forfivehigh-probability
expressionswithaveragesbetween90 and 100,thegainwas
smallforFormA overFormB, averaging
just 0.3%.
Forthe 17 expressions
withaveragesbetween50 and
90%,thegainforFormA overFormB averaged2.5%.
For8 expressions
withaveragesbetween20 and 50%,
the gainaveraged1.3%. For 22 expressions
withaveragesbetween0 and 20%,theaveragegainwas negative,-0.3%. Thus,forthemostpart,FormA (listing
fromhighto low)led to slightly
higheraveragesthan
FormB (lowto high),exceptforthevery-low-probawhereFormB tendedto produce
bilityexpressions,
slightly
highervalues.We werepleasedthatorderhad
10
expressionsincludedin thisstudy
The probability
emergedfromsearchesand frompaperssentto us by
scholars.AugustineKong in 1983 made a computer
meanings
searchforpapersdealingwithquantitative
ofprobabilistic
He found18 articles,and
expressions.
theirreferences
led to others.Afterpublicationof
Kong,Barnett,Mostellerand Youtz [8],severalreaders kindlywroteus about additionalpapers. From
about40 articleswe foundmorethan300expressions,
Our
manybeingevaluatedbya sampleofrespondents.
colleagueTimothyReaganreviewedand sortedthem
into three categories:probability,frequency,and
other.AfterreviewingReagan's analysis,Lincoln
forthis
Moses and F. Mostellerchose52 expressions
study.One aim was to choose a set of expressions
whoseassociatednumerical
averageswouldcoverwell
the rangeof probabilities
(expressedin percentages)
from0 to 100%.
Respondents
in thestudiesthatwe summarize
Mostrespondents
andhealthworkinTable 1 werestudents,
physicians,
ers.Newdatainthecurrent
studycomefrommembers
of an associationof sciencewriters.Theirviewsare
inforcommunicate
becausesciencewriters
important
mationfromscientiststo the public,includingother
scientists.
ofthe governing
Withthe cooperation
bodyofthe
ofScienceWriting,
CouncilfortheAdvancement
Inc.,
membersoftheNationalAssociationof
we informed
of
theirnewsletter
ScienceWriters(NASW) through
the proposedsurveyof sciencewriters.Barbara J.
Culliton,Presidentof the Council,wrotea letterto
accompanyour questionnairesent to the NASW
members.
The Questionnaire
to givetheprobabilwereasked,first,
Respondents
ity (as a percentageexpressedto the nearestwhole
numberfrom0 to 100) that theypersonallywould
in the
attachto eachoftheexpressions.
(Respondents
studiesin Table 1 oftenansweredsuch a question.)
Second,theywereaskedto givetherangeofprobabilitiesthattheythoughttheirreaderswouldassociate
The rangegivessomeideaofthe
withthatexpression.
associatedwithan expression;substantial
variability
about meaningcould raise questions
disagreements
Althoughone does
aboutthe value forcodification.
fromsuchan inquiry,
notexpectdelicatedistinctions
to offer25-30 as a rangeas
it seemsverydifferent
opposedto 5-85,f6rinstance.
The questionnairegroupedthe expressionsby
stems,as in the set based on the stem likely:very
and veryunlikely.The stems
likely,likely,unlikely,
happen,likely,
werealways,certain,chance,frequent,
selprobable,rarely,
probability,
possible,
never,often,
meandom,and usually.Based on theirgrammatical
ings,the expressionswereorderedforeach stem.If
the orderhad been haphazard,or the itemsspread
respondents
conscientious
thequestionnaire,
through
wouldhaverequireda longtimeto completethetask,
becauseto be consistenttheywouldhave to huntup
theirestimatesforotherexpressionswiththe same
btem.
In FormA
We usedtwoformsofthequestionnaire.
expressionsforeach stem were orderedfromhigh
to low,and in FormB fromlow to high.
probability
mightmakea difference,
choicesofordering
Arbitrary
and a designwitha balancingapproachusingalso the
againstbias.We
reverseorderoffered
someprotection
and thencombinedthe
analyzedtheformsseparately
results.
The Mailings and Response Rate
to
On May 6, 1987,we mailed637 questionnaires
membersof the NASW in the United States and
mailingto the
Canada.On June4 we senta follow-up
Fromthetwomailings
475whohadnotyetresponded.
withmainlyusableresponses;
we received238 replies2
orreturned
undelivered
about5% wereeitherreturned
blankbythesciencewriters.
Editingthe Responses
recontrolled
conditions,
Underthemostcarefully
and
spondentsdo as theyplease, as experimenters
surveyscientistswell know.Althoughwe asked the
some
to giveestimatesin wholenumbers,
respondents
gaveanswerssuchas 0.001,0.02,<50, >70, 1+, 60-,
10-20, whichrequiredediting.Some changeswere
simple.For example,we changed0.0001to 0 and 1020 to the midpoint,15. Whenwe could not make a
we changedtheresponseto a
reasonableadjustment,
blank.
itis a familiar
In workwithscalesinquestionnaires,
findingthat respondentsoccasionallyget turned
ofthe numberthey
aroundand givethe complement
intend.For example,in ourstudyalwaysmighthave
produceda personalestimateof 2% and a rangeof
0-5%, whereasthe respondentintended98% and
95-100%. When such errorsoccurredon the more
11
12
performance.
uponphotointerpreter
ofancillaryinformation
AmericanInstitutesforResearch,InstituteforResearchin
AIR-20131-12/70-FR.
Office,
Washington
Psychobiology,
n = 20:enlistedU.S. Armyimageinterpreters.
[11] MAPES, R. E. A. (1979). Verbaland numericalestimatesof
contexts.SocialScienceand Mediin therapeutic
probability
cine13A 277-282.
withchlor"Sideeffects
givenexpression
na = 29: physicians
givenexpression
arefrequent."
amphenicol
nb = 33:physicians
withneomycin
sulphatearefrequent."
"Sideeffects
[12] NAKAO, M. A. and AXELROD, S. (1983).Numbersare better
of frequency
haveno place
thanwords:Verbalspecifications
Amer.J. Med.74 1061-1065.
in medicine.
Means read from
nb =106 physicians.
na = 103 physicians.
chart.
[13] REAGAN, R. T., MOSTELLER, F. and YOUTZ, C. (1989).The
quantitativemeaningsof verbal probabilityexpressions.
74 433-442.
J.Appl.Psychology
in a psychology
course,Stanford
n = 115: undergraduates
University.
and proba[14] REYNA, V. F. (1981).The languageofpossibility
and Cognition
Memory
ofnegationon meaning.
bility:Effects
9 642-650.
n = 41 adultvolunteers.
[15] ROBERTS, D. E. and GUPTA, G. (1987). To the editor.New
EnglandJ. Med. 316 550.
na = 45 house staff.nb = 24 attendingphysicians.
ofwords.
themeanings
[16] ROBERTSON, W. 0. (1983).Quantifying
J.Amer.Med.Assoc.249 2631-2632.
at the
na = 53: Seattlephysicians.
nb= 80: graduatestudents
School of BusinessAdministraof Washington's
University
Orthopedic
tion.n0= 40: BoardofTrusteesat theChildren's
Hospitaland MedicalCenter,Seattle.
to rareevents.
probabilities
[17] SELVIDGE, J. (1972). Assigning
GraduateSchoolofBusinessAdministraPh.D. dissertation,
tion, George F. Baker Foundation, Harvard Univ.
Subjectswere HarvardBusinessSchool studentsin MBA
program.na = 59: Estimates made on basis of a statement
without
context.nb=127: Contextswereprovided.
Also in Mosteller,F. (1977). Assessingunknownnumbers:
andPublicPolicy
In Statistics
OrderofmagnitudTe
estimation.
(W. B. Fairleyand F. Mosteller,eds.) 163-184.AddisonWesley,Reading,Mass.
in meanings
forquantitative
[18] SIMPSON, R. H. (1963).Stability
over20 years.Q. J. Speech49 146-151.
terms:A comparison
secondary
technologists,
nurses,laboratory
n = 51:physicians,
and engineers.
schoolteachers,
fromsciencewriters.
estimates
study,
[20] Current
to
fromexpression
Variessomewhat
n = 230:sciencewriters.
211-237.
expression,
[21] BARTHOLOMEW, D. J. (1961).A methodofallowingfor"notAppl.Statist.10 52-59.
at-home"
bias in samplesurveys.
[22] BEYTH-MAROM, R. (1982).How probableis probable?A nuJ. Foreexpressions.
ofverbalprobability
mericaltranslation
casting1 257-269.
ofa decisionhowthenation
[23] BOFFEY, P. M. (1976).Anatomy
declaredwar on swine flu.Science 192 636-641.
Psychol.Rev. 66 27as multipliers.
[24] CLIFF, N. (1959).Adverbs
on quantisymposium
a four-paper
44. Thispaperstimulated
ofadverbson themeaningofadjectives:
fication
oftheeffect
Chance1 (3) 32-51(1988).
[25] GRIGORIU, B. D. andMIHAESCU, T. (1988).Evaluareanumerfolositein limbajulmedical.
de probabilitate
ica a expresiilor
Rev. Med. Chir. Soc. Med. Nat. Iasi 92 361-364.
bridgeUniv.Press,Cambridge.
the "possibil[28] MOSTELLER, F. (1976).Swineflu:Quantifying
ity."Science 192 1286, 1288.
errors.In Interna[29] MOSTELLER, F. (1978). I. Non-sampling
tional Encyclopedia of Statistics (W. H. Kruskal and J. M.
under
risk.In Judgment
perceived
versusfears:Understanding
Uncertainty:Heuristics and Biases (D. Kahneman, P. Slovic
Comment
HerbertH. Clark
In the last fewyears,Mosteller,Youtz and their
and frequency
colleagueshave lookedat probability
freimprobable,
suchas usual,verylikely,
expressions
quentand as oftenas not.Their interestis in how
technicalinthesetermsare used in communicating
Stanford
ofPsychology,
HerbertH. Clarkis Professor
94305.
California
Stanford,
University,