Professional Documents
Culture Documents
RandomVariablesandDiscrete
ProbabilityDistributions
352
RandomVariableandProbabilityDistribution
Inthebackground,thereisarandomexperiment.As
wediscussed,accompanyingthisexperiment,we
have:
ASampleSpace(allpossibleoutcomesofthe
experiment)
Aprobability(assignedtoeachoutcomeinthe
experiment)
Wenowaddtheconceptofarandomvariable anda
probabilitydistribution.
353
RandomVariable: Arandomvariableisafunction
thatassignsanumbertoeachoutcomeofthe
experiment.
Therearetwomaintypesofrandomvariables:
DiscreteRandomVariables
ContinuousRandomVariables
Thedistinctionbasicallydependsontherangeof
valuesthattherandomvariablecantake.
Adiscreterandomvariable:isonethatcantakeon
acountablenumberofvalues.
Acontinuousrandomvariableisonethatcantake
onanuncountablenumberofvalues.
354
Example:Experimentisflippingacoin10times,and
letX=#ofheadsobservedintheexperiment. Thisisa
discreterandomvariable,sinceXcanonlytakeon
thevalues{0,1,2,,10},whichisfiniteandtherefore
countable.
Example:Supposetheexperimentismeasuringthe
timetocompleteatask,andletX=totaltimetaken.
Thisisacontinuousrandomvariable:Sincetimeis
continuous,therangeofvaluesthatXcantakeisa
continuum,andthereforeuncountable.
355
Tohelpunderstandthedistinctionbetween
countableanduncountablesets,keepinmind:
a) Thesetofallintegernumbersiscountable.
b) Thesetofallrealnumbersisuncountable(a
continuum).
356
ProbabilityDistribution
Aprobabilitydistributiondescribesthevalues
thatarandomvariablecantake,alongwith
theprobabilityassociatedwitheachvalue.
Aprobabilitydistributioncanbesummarized
byatable,aformulaoragraph.
InChapter6wefocusontheprobability
distributionofadiscreterandomvariable.
357
IfXisadiscreterandomvariable,itsprobability
distributionsimplyrepresentstheprobabilitythatX
cantakeoneachoneofitspossiblevalues.
Weuseuppercase todenotearandomvariable,
andweuselowercase todenoteaparticularvalue
thatthisrandomvariablecantake.
Werepresenttheprobabilitythattherandom
variableXwillequalxasP(X=x) or,moresimply,
P(x).
358
RequirementsofaDiscreteProbabilityDistribution
Asaresultoftheconditionsrequiredofaprobability
(nonnegative,theymustaddto1),theprobability
distributionPofadiscreterandomvariablemust
satisfy:
Wherethenotation
DenotesthesumofP(x)overallpossiblevaluesx
thattherandomvariableXcantake.
359
Example7.1:TheStatisticalAbstractofthe
UnitedStatesispublishedannually.Itcontainsa
widevarietyofinformationbasedonthecensus
aswellasothersources.
Itsgoalistoprovideinformationaboutavariety
ofdifferentaspectsofthelivesofthecountrys
residents.
Oneofthequestionsaskshouseholdstoreport
thenumberofpersonslivinginthehousehold.
Thefollowingtablesummarizesthedata.
Developtheprobabilitydistributionofthe
randomvariabledefinedasthenumberof
personsperhousehold.
360
NumberofPersons
1
2
3
4
5
6
7ormore
NumberofHouseholds(millions)
31.1
38.6
18.8
16.2
7.2
2.7
1.4
Total116.0
361
Probabilitydistributionscanbeestimatedfrom
relativefrequencies.
x
P(x)
1
31.1/116.0=.268
2
38.6/116.0=.333
3
18.8/116.0=.162
4
16.2/116.0=.140
5
7.2/116.0=.062
6
2.7/116.0=.023
7ormore
1.4/116.0=.012
Total
1.000
362
Whatistheprobabilitythereare4ormore
persons inanygivenhousehold?
x
1
2
3
4
5
6
7ormore
P(x)
.268
.333
.162
.140
.062
.023
.012
Example7.2: Amutualfundsalespersonhas
arrangedtocallonthreepeopletomorrow.
Basedonpastexperiencethesalespersonknows
thatthereisa20%chanceofclosingasaleoneach
call.
Determinetheprobabilitydistributionofthenumber
ofsalesthesalespersonwillmake.
Therandomvariableofinterestisthenumberof
salesmade. Noticethattheexperimentconsistsof
callingthreedifferentclients,andthattheoutcomes
ofeachcallareindependentofeachother.
Foreachcall,letSdenotetheeventsuccess(i.e.
closingasale).PastexperiencesaysthatP(S)=.20.
Therefore,SC isnotclosingasale,andP(SC)=.80
364
Wecandevelopingtheprobabilitydistribution
usingaProbabilityTree.
Sales Call 1
Sales Call 2
Sales Call 3
(.2)(.2)(.8)= .032
P(S)=.2
P(S)=.2
P(SC)=.8
P(S)=.2
SSS
P(SC)=.8
P(S)=.2
S S SC
S SC S
P(SC)=.8
P(S)=.2
S SC SC
SC S S
P(SC)=.8
P(S)=.2
SC S SC
SC SC S
P(SC)=.8
SC SC SC
P(SC)=.8
P(S)=.2
P(SC)=.8
X
3
2
1
0
P(x)
.23 = .008
3(.032)=.096
3(.128)=.384
.83 = .512
Population/ProbabilityDistribution
Thediscreteprobabilitydistributiondescribes
apopulation.
Sincewehavepopulations,wecandescribe
thembycomputingvariousparameters.
Twoofthepopulationparameterswestudied
previouslyare:populationmean and
populationvariance.
366
PopulationMean(ExpectedValue)ofaDiscrete
RandomVariable
Ourgeneraldefinitionofthepopulationmeanis
IfweknowthattherandomvariableXisdiscrete,we
canreexpressintermsoftheprobability
distributionofX.Wehave:
ThisparameterisalsocalledtheexpectedvalueofX
andisrepresentedbyE(X).
367
PopulationVarianceofaDiscreteRandomVariable
Thepopulationvarianceofadiscreterandomvariable
canbeexpressedsimilarly.Itistheweightedaverageof
thesquareddeviations fromthemean:
Asbefore,thereisashortcutformulation
Thestandarddeviationisthesameasbefore:
368
Example7.1(continued): Findthemean,variance,and
standarddeviationforthepopulationofX=numberof
personsperhousehold.Assumethatthecategory7or
moreisactually7.
ThemeanE(X)isgivenby:
E ( X ) 1 P(1) 2 P(2) ... 7 P(7)
=1(.268)+2(.333)+3(.162)
+4(.140)+5(.062)+6(.023)
+7(.012)
=2.513
369
ThevarianceofXisgivenby:
ThestandarddeviationofXis
= 1.958 1.399
370
LawsofExpectedValue
TherearecertainpropertiesoftheExpectedValue
thatareusefultoknow.Letcbeaconstant.Then:
(1)E(c)=c
Inwords:Theexpectedvalueofaconstant(c)isjust
thevalueoftheconstant.
(2)E(X+c)=E(X)+c, and(3)E(cX)=cE(X)
Inwords:Wecanpullaconstantoutofthe
expectedvalueexpression(eitheraspartofasum
witharandomvariableXorasacoefficientofrandom
variableX).
371
Example7.4: Monthlysaleshaveameanof
$25,000andastandarddeviationof$4,000.
Profitsarecalculatedbymultiplyingsalesby30%and
subtractingfixedcostsof$6,000.
Findthemean monthlyprofit.
1)Describetheproblemstatementinalgebraicterms:
saleshaveameanof$25,000 E(Sales)=25,000
profitsarecalculatedby
Profit=.30(Sales) 6,000
372
Findthemean monthlyprofit.
E(Profit) =E[.30(Sales) 6,000]
=E[.30(Sales)] 6,000 [byrule#2]
=.30E(Sales) 6,000 [byrule#3]
=.30(25,000) 6,000=1,500
Thus,themeanmonthlyprofitis$1,500
373
LawsofVariance
Asbefore,letcbeaconstant.Then:
1. V(c)=0
Inwords:Thevarianceofaconstantiszero(makes
totalsense).
2. V(X+c)=V(X)
Inwords:Thevarianceofarandomvariableanda
constantisjustthevarianceoftherandomvariable
(per1above).
3. V(cX)=c2V(X)
Inwords:Thevarianceofarandomvariableanda
constantcoefficientisthecoefficientsquared times
thevarianceoftherandomvariable.
374
Example7.4(continued):Monthlysaleshaveameanof
$25,000andastandarddeviationof$4,000.Profitsare
calculatedbymultiplyingsalesby30%andsubtracting
fixedcostsof$6,000.
Findthestandarddeviation ofmonthlyprofits.
1)Describetheproblemstatementinalgebraicterms:
saleshaveastandarddeviationof$4,000
V(Sales)=4,0002 =16,000,000
(remembertherelationshipbetweenstandarddeviation
andvariance)
profitsarecalculatedby Profit=.30(Sales) 6,000
375
2)Thevariance ofprofitis=V(Profit)
=V[.30(Sales) 6,000]
=V[.30(Sales)]
[byrule#2]
[byrule#3]
=(.30)2V(Sales)
=(.30)2(16,000,000)=1,440,000
Again,standarddeviation isthesquareroot of
variance,sostandarddeviationofProfit=
(1,440,000)1/2 =$1,200
376
Example7.4(summary):Monthlysaleshavea
meanof$25,000andastandarddeviationof
$4,000.Profitsarecalculatedbymultiplying
salesby30%andsubtractingfixedcostsof
$6,000.
Findthemean andstandarddeviation of
monthlyprofits.
Themean monthlyprofitis$1,500
Thestandarddeviation ofmonthlyprofitis
$1,200
377
BivariateDistributions
Uptonow,wehavelookedatunivariate distributions,
thatis,probabilitydistributionsinone variable.
Asyoumightguess,bivariatedistributions are
probabilitiesofcombinationsoftwo variables.
Bivariateprobabilitydistributionsarealsocalledjoint
probability.
AjointprobabilitydistributionofXandYisatableor
formulathatliststhejointprobabilitiesforall pairs of
valuesxandy,andisdenotedP(x,y).
P(x,y)=P(X=xandY=y)
378
Asyoumightexpect,therequirementsfora
bivariatedistribution aresimilartoa
univariate distribution,withonlyminor
changestothenotation:
forallpairs(x,y)
379
Example7.5: XavierandYvettearerealestate
agents.LetXdenotethenumberofhousesthat
XavierwillsellinamonthandletYdenotethe
numberofhousesYvettewillsellinamonth.
Ananalysisoftheirpastmonthlyperformanceshas
thefollowingjointprobabilities(bivariateprobability
distribution).
380
MarginalProbabilities
Asbefore,wecancalculatethemarginalprobabilities by
summingacrossrowsanddowncolumnstodetermine
theprobabilitiesofXandYindividually:
For example: the probability that Xavier sells 1 house = P(X=1) =0.50
381
DescribingtheBivariateDistribution
Wecandescribethemean,variance,andstandard
deviationofeachvariable inabivariatedistribution
byworkingwiththemarginalprobabilities
382
PopulationCovariance
Wecanexpressthepopulationcovariance of
twodiscretevariablesintermsoftheirjoint
probabilitydistribution as:
oralternativelyusingthisshortcutmethod:
383
CoefficientofCorrelation
The(population)coefficientofcorrelationis
calculatedinthesamewayasdescribed
earlier
384
Example7.6
X
0
0
0
1
1
1
2
2
2
0.7
Y
0
1
2
0
1
2
0
1
2
0.5
Probability
0.12
0.21
0.07
0.42
0.06
0.02
0.06
0.03
0.01
X - (x)
-0.7
-0.7
-0.7
0.3
0.3
0.3
1.3
1.3
1.3
Y - (y)
-0.5
0.5
1.5
-0.5
0.5
1.5
-0.5
0.5
1.5
[X - (x)][Y-(y)]
0.042
-0.074
-0.074
-0.063
0.009
0.009
-0.039
0.020
0.020
-0.150
386
SumofTwoVariables
Thebivariatedistributionallowsustodevelop
theprobabilitydistributionofany combinationof
thetwovariables. Ofparticularinterestisthe
sum oftwovariables.
IfweconsiderourexampleofXavierandYvette
sellinghouses,wecancreatetheprobability
distributionofX+Y:
Fromhere,wecananswerquestionslikewhatis
theprobabilitythattwohousesaresold?
P(X+Y=2)=P(0,2)+P(1,1)+P(2,0)
=.07+.06+.06=.19
387
Whatistheprobabilitythatatleastonehouseis
soldbetweenXavierandYvette?
Thisis:
Pr(X+Y1)=P(X+Y=1)+Pr(X+Y=2)+Pr(X+Y=3)+Pr(X+Y=4)
=0.63+0.19+0.05+0.01=0.88
Whatistheprobabilitythatthenumberofhouses
soldis2or3?
Thisis:
Pr(2X+Y3)=0.19+0.05=0.24
388
SumofTwoVariables
Likewise,wecancomputetheexpectedvalue,
variance,andstandarddeviationofX+Yintheusual
way
E(X+Y)=0(.12)+1(.63)+2(.19)+3(.05)+4(.01)=1.2
V(X+Y)=(0 1.2)2(.12)++(4 1.2)2(.01)=.56
7.389
LawsofExpectedValueandVariancefortheSumof
TwoRandomVariables
Previously,westatedLawsforexpectedvaluesand
variancesinvolvingarandomvariableXanda
constantc.
Wealsohavelawsinvolvingthesumoftworandom
variables:
1.
E(X+Y)=E(X)+E(Y)
2.
V(X+Y)=V(X)+V(Y)+2COV(X,Y)
IfXandYareindependent,COV(X,Y)=0andthus(2)
becomes:
V(X+Y)=V(X)+V(Y)
390
Example7.5(continued):Wehadalreadyderivedthe
marginaldistributionsofXandYbefore.Wehave:
WehadobtainedE(X+Y)andV(X+Y)byderivingthe
distributionofX+Y.ButwecanusetheLawsofsumsof
randomvariables:
E(X+Y)=E(X)+E(Y)=.7+.5=1.2
V(X+Y)=V(X)+V(Y)+2COV(X,Y)
=.41+.45+2(.15)=.56
391
LawsofExpectationandVarianceforLinear
CombinationsofTwoRandomVariables
Letcanddbetwoconstants.Wecangeneralizethe
lawsofexpectationandvariancefromthesumX+Yto
anylinearcombination
cX+dY
Wehave:
1. E(cX+dY)=cE(X)+dE(Y)
2.
V(cX+dY)=c2V(X)+d2V(Y)+2cdCOV(X,Y)
IfXandYareindependent,COV(X,Y)=0andthus(2)
becomes:
V(cX+dY)=c2V(X)+d2V(Y)
392
Example:PortfolioDiversificationandAssetAllocation.
Consideraninvestorwhoformsaportfolio,consistingofonly
twostocks,byinvesting$4,000inonestockand$6,000ina
secondstock.Supposethattheresultsafter1yearare:
OneYearResults
Stock
1
2
Total
Initial
Investment
$4,000
$6,000
$10,000
ValueofInvestment
AfterOneYear
$5,000
$5,400
$10,400
OR
Percentageinvestedin
firststock
RateofReturn
onInvestment
R1 =.25(25%)
R2 =.10(10%)
Rp =.04(4%)
Percentageinvestedin
secondstock
MeanandVarianceofaPortfolioofTwoStocks
TheportfolioRp isdefinedas:
Rp =w1 R1 +w2 R2
wherew1 andw2 aretheproportionsorweights
ofinvestments1and2.
Usingthelawsforlinearcombinationsof
randomvariables,wehave:
E(Rp)=w1 E(R1)+w2 E(R2)
V(Rp)=w12 V(R1)+w22 V(R2)+2w1w2 COV(R1,R2)
E(R1)andE(R2)aretheexpectedvaluesofR1andR2,
while1 and2 aretheirstandarddeviations,and is
thecoefficientofcorrelation.
394
Example7.8:Aninvestorhasdecidedtoforma
portfoliobyputting25%ofhismoneyinto
McDonaldsstockand75%intoCiscoSystems
stock.
Theinvestorassumesthattheexpectedreturns
willbe8%and15%,respectively,andthatthe
standarddeviationswillbe12%and22%,
respectively.
a) Findtheexpectedreturnontheportfolio.
b) Computethestandarddeviationofthereturns
ontheportfolioassumingthat:
i. thetwostocksreturnsareperfectlypositively
correlated
ii. thecoefficientofcorrelationis.5
iii. thetwostocksreturnsareuncorrelated
395
a) Theexpectedvaluesofthetwostocksare
E(R1)=.08 andE(R2)=.15
Theweightsarew1 =.25andw2 =.75.
Thus,
E(R2)=w1E(R1)+w2E(R2)
=.25(.08)+.75(.15)
=.1325
(anexpectedportfolioreturnof13.25%)
396
Next,notethatthestatementoftheproblem
didnotgiveusthecovariancebetweenR1
andR2directly
However,itgaveusthestandarddeviationsof
R1andR2,anditaskedustosolvethe
problemunderthreedifferentscenarios
regardingthecorrelationbetweenR1andR2.
Recallingtheformulaforcorrelation,ifwe
havethecorrelationandthestandard
deviations,wecanrecoverthecovariance
398
Recallthat:
and,therefore:
399
Therefore,inthethreecorrelationscenariostobe
considered,wehave:
(i)
10.12 0.22=0.0264
(ii)
0.50.120.22=0.0132
(iii)
0
400
Recallthat
V(Rp)=w12 V(R1)+w22 V(R2)+2w1w2 COV(R1,R2)
Therefore:
(i)If =1:
V(Rp)=0.252 0.122+0.752 0.222+2 0.25 0.75 0.0264=0.0380
(ii)If =0.5:
V(Rp)=0.252 0.122+0.752 0.222+2 0.25 0.75 0.0132=0.0331
(iii)If =0:
V(Rp)=0.252 0.122+0.752 0.222+0=0.0281
401
PortfolioDiversificationinPractice
Theformulasintroducedinthissection
requirethatweknowtheexpectedvalues,
variances,andcovariance(orcoefficientof
correlation)oftheinvestmentswere
interestedin.
Thequestionarises,Howdowedetermine
theseparameters?Themostcommon
procedureistoestimatetheparametersfrom
historicaldata,usingsamplestatistics.
402
PortfolioswithMoreThanTwoStocks
Wecanextendtheformulasthatdescribethemean
andvarianceofthereturnsofaportfoliooftwo
stockstoaportfolioofanynumberofstocks.
MeanandVarianceofaPortfolioofkStocks
k
w E(R )
E(Rp )=
i 1
V(Rp )=
i 1
w i2 i2 2
w w COV(R , R )
i
i 1 ji 1
PortfolioswithMoreThanTwoStocks
Whenkisgreaterthan2thecalculationscanbe
tediousandtimeconsuming.
Forexample,whenk=3,weneedtoknowthe
valuesofthethreeweights,threeexpected
values,threevariances,andthreecovariances.
Whenk=4,therearefourexpectedvalues,four
variancesandsixcovariances.[Thenumberof
covariances requiredingeneralisk(k1)/2.]
404
BinomialDistribution
Thebinomialdistributionisaspecificexampleofa
discreteprobabilitydistribution.
Thebinomialdistribution istheprobabilitydistribution
thatresultsfromdoingabinomialexperiment.
Binomialexperimentshavethefollowingproperties:
1) Fixednumberoftrials,representedasn.
2) Eachtrialhastwopossibleoutcomes,asuccessand
afailure.
3) P(success)=p(andthus:P(failure)=1p),foralltrials.
4) Thetrialsareindependent,whichmeansthatthe
outcomeofonetrialdoesnotaffecttheoutcomesof
anyothertrials.
405
SuccessandFailurearejustlabels fora
binomialexperiment,thereisnovaluejudgment
implied.
Forexampleacoinflipwillresultineitherheads
ortails.Ifwedefineheadsassuccessthen
necessarilytailsisconsideredafailure.
Otherbinomialexperimentnotions:
Anelectioncandidatewinsorloses
Anemployeeismaleorfemale
406
BinomialRandomVariable
Therandomvariableofabinomialexperimentisdefinedasthe
numberofsuccessesinthentrials,andiscalledthebinomial
randomvariable.
Example:flipafaircoin10times
1)Fixednumberoftrials n=10
2)Eachtrialhastwopossibleoutcomes {heads(success),tails(failure)}
3)P(success)=0.50;P(failure)=10.50=0.50
4)Thetrialsareindependent (i.e.theoutcomeofheadsonthefirstflip
willhavenoimpactonsubsequentcoinflips).
Henceflippingacointentimesisabinomialexperimentsince
allconditionsweremet.
407
BinomialRandomVariable
Thebinomialrandomvariablecounts thenumberof
successesinn trialsofthebinomialexperiment.It
cantakeonvaluesfrom0,1,2,,n.Thus,itsa
discrete randomvariable.
Tocalculatetheprobability associatedwitheach
valueofX,weusecombinatorics:
forx=0,1,2,,n
408
Combinatoricsisabranchofmathematicswhichhelpsus
countdiscretestructures(outcomesinourcase).
n! denotesthefactorialoftheintegern.Itisdefinedas:
,with 0!=1
Combinatoric theorytellsusthatthetotalnumberof
differentways inwhichwecanhavexsuccessesinn
trialsis:
Byindependenceofeachtrial,theprobabilityofany
outcomethatyieldsxsuccessesinntrialsis:
Pr(xsuccesses)
Pr(nxfailures)
409
Thus,wehavethattheprobabilityofanyoutcomethat
yieldsxsuccessesinntrialsis:
Inaddition,thereareatotalof
suchoutcomes.
Thus,addinguptheprobabilitiesofallsuchoutcomes,
weobtainthebinomialprobabilityformula:
410
Example: Aquizconsistsof10multiplechoice
questions.Eachquestionhasfivepossible
answers,onlyoneofwhichiscorrect.
Supposeastudentplanstoguesstheanswer to
eachquestion.
Whatistheprobabilitythatthestudentgetsno
answerscorrect?
Whatistheprobabilitythatthestudentgetstwo
answerscorrect?
411
Thiscanbeseenasabinomialexperiment,
where:
a) n=numberoftrials=10
b) successofeachtrialis``correctlyguessing
theanswer.
c) Eachanswerisindependentoftheothers.
Sinceeachanswerisguessedandthereare
fivechoicesforeachquestion,wehave
P(success)=1/5=.20
412
Thus,wehaveabinomialexperimentwhere
n=10,andP(success)=.20
Whatistheprobabilitythatthestudentgets
noanswers correct?ThisisP(X=0):
Whatistheprobabilitythatthestudentgets
twoanswers correct?Thatis,P(X=2):
CumulativeProbability
Thusfar,wehavebeenusingthebinomialprobability
distributiontofindprobabilitiesforindividualvalues
ofx.
ThecumulativedistributionofXatthevaluexis
definedasP(Xx).Thatis,theprobabilitythatXis
lessthanorequaltothevaluex.
Supposethequizisfailediflessthan50%ofthe
answersarerespondedcorrectly.Findtheprobability
thatthestudentfailsthequiz.ThisisPr(X 4).Thisis
givenby:
P(X4)=P(0)+P(1)+P(2)+P(3)+P(4)
415
P(X4)=P(0)+P(1)+P(2)+P(3)+P(4)
WealreadyknowP(0)=.1074andP(2)=.3020.Using
thebinomialformulatocalculatetheothers:
P(1)=.2684,P(3)=.2013,andP(4)=.0881
WehaveP(X4)=.1074+.2684++.0881=.9672
Thus,itsabout97%probablethatthestudentwillfail
thetestusingtheluckstrategyandguessingat
answers
416
BinomialTable
Calculatingbinomialprobabilitiesbyhandistediousand
errorprone.Thereisaneasierway.RefertoTable1 in
AppendixB.
Theprobabilitieslistedinthetablesarecumulative,
i.e.P(Xk) k istherowindex;thecolumnsofthetable
areorganizedbyP(success)=p
n = 10
k
0
1
2
3
4
5
6
7
8
9
0.01
0.9044
0.9957
0.9999
1.0000
1.0000
1.0000
1.0000
1.0000
1.0000
1.0000
0.05
0.5987
0.9139
0.9885
0.9990
0.9999
1.0000
1.0000
1.0000
1.0000
1.0000
0.1
0.3487
0.7361
0.9298
0.9872
0.9984
0.9999
1.0000
1.0000
1.0000
1.0000
0.2
0.1074
0.3758
0.6778
0.8791
0.9672
0.9936
0.9991
0.9999
1.0000
1.0000
0.25
0.0563
0.2440
0.5256
0.7759
0.9219
0.9803
0.9965
0.9996
1.0000
1.0000
0.3
0.0282
0.1493
0.3828
0.6496
0.8497
0.9527
0.9894
0.9984
0.9999
1.0000
0.4
0.0060
0.0464
0.1673
0.3823
0.6331
0.8338
0.9452
0.9877
0.9983
0.9999
0.5
0.0010
0.0107
0.0547
0.1719
0.3770
0.6230
0.8281
0.9453
0.9893
0.9990
0.6
0.0001
0.0017
0.0123
0.0548
0.1662
0.3669
0.6177
0.8327
0.9536
0.9940
0.7
0.0000
0.0001
0.0016
0.0106
0.0473
0.1503
0.3504
0.6172
0.8507
0.9718
0.75
0.0000
0.0000
0.0004
0.0035
0.0197
0.0781
0.2241
0.4744
0.7560
0.9437
0.8
0.0000
0.0000
0.0001
0.0009
0.0064
0.0328
0.1209
0.3222
0.6242
0.8926
0.9
0.0000
0.0000
0.0000
0.0000
0.0001
0.0016
0.0128
0.0702
0.2639
0.6513
0.95
0.0000
0.0000
0.0000
0.0000
0.0000
0.0001
0.0010
0.0115
0.0861
0.4013
0.99
0.0000
0.0000
0.0000
0.0000
0.0000
0.0000
0.0000
0.0001
0.0043
0.0956
417
Whatistheprobabilitythatthestudentgetsno
answers correct?i.e.whatisP(X=0),given
P(success)=.20 andn=10?
n = 10
k
0
1
2
3
4
5
6
7
8
9
0.01
0.9044
0.9957
0.9999
1.0000
1.0000
1.0000
1.0000
1.0000
1.0000
1.0000
0.05
0.5987
0.9139
0.9885
0.9990
0.9999
1.0000
1.0000
1.0000
1.0000
1.0000
0.1
0.3487
0.7361
0.9298
0.9872
0.9984
0.9999
1.0000
1.0000
1.0000
1.0000
0.2
0.1074
0.3758
0.6778
0.8791
0.9672
0.9936
0.9991
0.9999
1.0000
1.0000
0.25
0.0563
0.2440
0.5256
0.7759
0.9219
0.9803
0.9965
0.9996
1.0000
1.0000
0.3
0.0282
0.1493
0.3828
0.6496
0.8497
0.9527
0.9894
0.9984
0.9999
1.0000
0.4
0.0060
0.0464
0.1673
0.3823
0.6331
0.8338
0.9452
0.9877
0.9983
0.9999
0.5
0.0010
0.0107
0.0547
0.1719
0.3770
0.6230
0.8281
0.9453
0.9893
0.9990
0.6
0.0001
0.0017
0.0123
0.0548
0.1662
0.3669
0.6177
0.8327
0.9536
0.9940
0.7
0.0000
0.0001
0.0016
0.0106
0.0473
0.1503
0.3504
0.6172
0.8507
0.9718
0.75
0.0000
0.0000
0.0004
0.0035
0.0197
0.0781
0.2241
0.4744
0.7560
0.9437
0.8
0.0000
0.0000
0.0001
0.0009
0.0064
0.0328
0.1209
0.3222
0.6242
0.8926
0.9
0.0000
0.0000
0.0000
0.0000
0.0001
0.0016
0.0128
0.0702
0.2639
0.6513
0.95
0.0000
0.0000
0.0000
0.0000
0.0000
0.0001
0.0010
0.0115
0.0861
0.4013
0.99
0.0000
0.0000
0.0000
0.0000
0.0000
0.0000
0.0000
0.0001
0.0043
0.0956
Whatistheprobabilitythatthestudentgetstwo
answers correct?i.e.whatisP(X=2),given
P(success)=.20 andn=10?
n = 10
k
0
1
2
3
4
5
6
7
8
9
0.01
0.9044
0.9957
0.9999
1.0000
1.0000
1.0000
1.0000
1.0000
1.0000
1.0000
0.05
0.5987
0.9139
0.9885
0.9990
0.9999
1.0000
1.0000
1.0000
1.0000
1.0000
0.1
0.3487
0.7361
0.9298
0.9872
0.9984
0.9999
1.0000
1.0000
1.0000
1.0000
0.2
0.1074
0.3758
0.6778
0.8791
0.9672
0.9936
0.9991
0.9999
1.0000
1.0000
0.25
0.0563
0.2440
0.5256
0.7759
0.9219
0.9803
0.9965
0.9996
1.0000
1.0000
0.3
0.0282
0.1493
0.3828
0.6496
0.8497
0.9527
0.9894
0.9984
0.9999
1.0000
0.4
0.0060
0.0464
0.1673
0.3823
0.6331
0.8338
0.9452
0.9877
0.9983
0.9999
0.5
0.0010
0.0107
0.0547
0.1719
0.3770
0.6230
0.8281
0.9453
0.9893
0.9990
0.6
0.0001
0.0017
0.0123
0.0548
0.1662
0.3669
0.6177
0.8327
0.9536
0.9940
0.7
0.0000
0.0001
0.0016
0.0106
0.0473
0.1503
0.3504
0.6172
0.8507
0.9718
0.75
0.0000
0.0000
0.0004
0.0035
0.0197
0.0781
0.2241
0.4744
0.7560
0.9437
0.8
0.0000
0.0000
0.0001
0.0009
0.0064
0.0328
0.1209
0.3222
0.6242
0.8926
0.9
0.0000
0.0000
0.0000
0.0000
0.0001
0.0016
0.0128
0.0702
0.2639
0.6513
0.95
0.0000
0.0000
0.0000
0.0000
0.0000
0.0001
0.0010
0.0115
0.0861
0.4013
0.99
0.0000
0.0000
0.0000
0.0000
0.0000
0.0000
0.0000
0.0001
0.0043
0.0956
Whatistheprobabilitythatthestudentfails
thequiz?i.e.whatisP(X4),given
P(success)=.20 andn=10?
n = 10
k
0
1
2
3
4
5
6
7
8
9
0.01
0.9044
0.9957
0.9999
1.0000
1.0000
1.0000
1.0000
1.0000
1.0000
1.0000
0.05
0.5987
0.9139
0.9885
0.9990
0.9999
1.0000
1.0000
1.0000
1.0000
1.0000
0.1
0.3487
0.7361
0.9298
0.9872
0.9984
0.9999
1.0000
1.0000
1.0000
1.0000
0.2
0.1074
0.3758
0.6778
0.8791
0.9672
0.9936
0.9991
0.9999
1.0000
1.0000
0.25
0.0563
0.2440
0.5256
0.7759
0.9219
0.9803
0.9965
0.9996
1.0000
1.0000
0.3
0.0282
0.1493
0.3828
0.6496
0.8497
0.9527
0.9894
0.9984
0.9999
1.0000
0.4
0.0060
0.0464
0.1673
0.3823
0.6331
0.8338
0.9452
0.9877
0.9983
0.9999
0.5
0.0010
0.0107
0.0547
0.1719
0.3770
0.6230
0.8281
0.9453
0.9893
0.9990
0.6
0.0001
0.0017
0.0123
0.0548
0.1662
0.3669
0.6177
0.8327
0.9536
0.9940
0.7
0.0000
0.0001
0.0016
0.0106
0.0473
0.1503
0.3504
0.6172
0.8507
0.9718
0.75
0.0000
0.0000
0.0004
0.0035
0.0197
0.0781
0.2241
0.4744
0.7560
0.9437
0.8
0.0000
0.0000
0.0001
0.0009
0.0064
0.0328
0.1209
0.3222
0.6242
0.8926
0.9
0.0000
0.0000
0.0000
0.0000
0.0001
0.0016
0.0128
0.0702
0.2639
0.6513
0.95
0.0000
0.0000
0.0000
0.0000
0.0000
0.0001
0.0010
0.0115
0.0861
0.4013
P(X4)=.9672
420
0.99
0.0000
0.0000
0.0000
0.0000
0.0000
0.0000
0.0000
0.0001
0.0043
0.0956
UsingtheBinomialProbabilityTable
Thebinomialtablegivescumulativeprobabilitiesfor
P(Xk).
However,wemaybeinterestedinprobabilitiesof
thetype:
a) Pr(X=k)
b) Pr(Xk)
c) Pr(k1 Xk2)(withk1 <k2)
Wecancomputetheseprobabilitiesfrom
cumulativeprobabilities,weexplainhownext
421
IfXisdiscrete,wecanobtainP(X=k) fromP(Xk)
andP(Xk1)by:
P(X=k)=P(Xk) P(Xk1)
Likewise,forprobabilitiesgivenasP(Xk),we
have:
P(Xk)=1 P(Xk1)
Finally,wecancomputePr(k1 Xk2) as:
Pr(k1 Xk2)=Pr(Xk2) Pr(Xk1 1)
422
Example:Problem7.93. Theleadingbrandof
dishwasherdetergenthasa30%marketshare.Asample
of25dishwasherdetergentcustomerswastaken.What
istheprobabilitythat10offewercustomerschosethe
leadingbrand?
Thisisanexampleofabinomialrandomvariable:
X=#ofcustomerswhoboughtleadingdishwasherbrand
Theunderlyingexperimentconsistsof:
n=25trials
p=Prob(Success)=0.30
TheproblemasksforP(X10).UsingTable1inthe
Appendix,wehaveP(X10)=0.9022
423
Example:Problem7.97. Itisbelievedthat10%ofall
votersintheUnitedStatesconsiderthemselvesas
Independent.Asurveyasked25peopletoidentify
themselvesasDemocrat,Republicanor
Independent.
a) Whatistheprobabilitythatnoneofthepeoplein
thesurveyareIndependent?
b) Whatistheprobabilitythatfewerthanfivepeople
areIndependent?
c) Whatistheprobabilitythatmorethantwopeople
areIndependent?
424
Onceagain,thisisanexampleofabinomialrandom
variable
X=#ofIndependentvotersinthesurvey
Theunderlyingexperimentconsistsof
n=25trials
p=Prob(success)=0.10
Theproblemasks:
a) Pr(X=0)
b) Pr(X4)
c) Pr(X 3)
425
UsingTable1,wehave:
a) Pr(X=0)=0.0718
b) Pr(X4)=0.9020
c) HowaboutPr(X3)?RecallthatTable1only
displaysprobabilitiesoftheformPr(Xx).
Wehave
Pr(X 3)=1 Pr(X2)=1 0.5371=0.4629
426
BinomialDistributioninExcel
ExcelcancomputePr(X=x)andPr(Xx)whenXisa
binomialrandomvariable.
Thecommandis:
BINOM.DIST(number_s,trials,probability_s,cumulative)
Where:
Number_s :Thenumberofsuccessesintrials.
Trials :Thenumberofindependenttrials.
Probability_s :Theprobabilityofsuccessoneachtrial.
Cumulative :IfcumulativeisTRUE,thenBINOM.DIST
returnsthecumulativedistributionfunctionPr(X x);if
FALSE,itreturnstheprobabilityfunctionPr(X=x).
427
ExpectedValueandVariancefortheBinomial
Distribution
Asyoumightexpect,statisticianshavedeveloped
generalformulasforthemean,variance,and
standarddeviationofabinomialrandomvariable.
Theyare:
Asinpreviouscases,youneedtorememberthese
formulas
428
Example7.93(continued):Constructanintervalsuch
that,withprobabilityatleast75%itwillincludethe
actualnumberofcustomersthatchoosetheleading
brand.
OK,toanswerthisquestion,wegobacktoChebysheffs
Theorem,whichstatesthat:Theprobabilitythata
randomvariableXlieswithinkstandarddeviationsof
itsmeanisatleast:
Sincewewantthistobeatleast75%,wefirstneedto
findtheksuchthat
1
Thisyields
0.75
.
429
Next,recallthatintheexamplewehaven=25 andp=0.30
Therefore,usingtheexpectationandvarianceformulas
forBinomialrandomvariables,wehave:
E[X]=np =7.5 and
Therefore,anintervalthatwillinclude,withatleast75%
probability,theactualnumberofcustomerswhowill
choosetheleadingbrandis:
[7.5 2*2.291,7.5+2*2.291]=[2.918,12.082]
Thus,wecansaythatwithprobabilityatleast75%,
between3and12peoplewillchoosetheleadingbrand
inthisexample.
430
PoissonDistribution
NamedforSimeonPoisson,thePoissondistribution
isadiscreteprobabilitydistributionandreferstothe
numberofevents (a.k.a.successes)withinaspecific
timeperiodorregionofspace.
Forexample:
i.
Thenumberofcarsarrivingataservicestationin1
hour.(Theintervaloftimeis1hour.)
ii. Thenumberofflawsinaboltofcloth.(Thespecific
regionisaboltofcloth.)
iii. Thenumberofaccidentsin1dayonaparticular
stretchofhighway.(Theintervalisdefinedbyboth
time,1day,andspace,theparticularstretchof
highway.)
431
DifferencebetweenBinomialandPoisson
Randomvariables:
Abinomialrandomvariableisthenumberof
successesinagivennumberoftrials,whereas
aPoissonrandomvariableisthenumberof
successesinanintervaloftimeorinaspecific
regionofspace.
432
ThePoissonExperiment
Likeabinomialexperiment,aPoissonexperiment
hasfourdefiningcharacteristicproperties:
i. Thenumberofsuccessesthatoccurinany
intervalisindependent ofthenumberof
successesthatoccurinanyotherinterval.
ii. Theprobabilityofasuccessinanintervalisthe
sameforallequalsizeintervals
iii. Theprobabilityofasuccessisproportionalto
thesizeoftheinterval.
iv. Theprobability ofmorethanonesuccessinan
intervalapproaches0astheintervalbecomes
smaller.
433
ThePoissonrandomvariable isthenumberof
successesthatoccurinaperiodoftimeoran
intervalofspaceinaPoissonexperiment.
successes
E.g.Onaverage,96 trucksarriveataborder
crossingeveryhour.
time period
E.g.Thenumberoftypographicerrorsinanew
textbookeditionaverages1.5 per100pages.
successes (?!)
interval
434
PoissonProbabilityDistribution
TheprobabilitythataPoissonrandomvariable
assumesavalueofx isgivenby:
ande isthenaturallogarithmbase.
TheexpectedvalueandvarianceofaPoisson
randomvariableXaregivenby:
435
Example7.12:Astatisticsinstructorhasobserved
thatthenumberoftypographicalerrorsinnew
editionsoftextbooksvariesconsiderablyfrom
booktobook.Aftersomeanalysisheconcludes
thatthenumberoferrorsisPoissondistributed
withameanof1.5per100pages.
Theinstructorrandomlyselects100pagesofa
newbook.Whatistheprobabilitythatthereare
notypos?
436
Thatis,whatisP(X=0)giventhat=1.5?
Howtoproceed?
First,notethatwearenowtalkingaboutan
intervalof400pages.
Intheoriginalstatementoftheproblem,we
weretoldthattheexpectednumberoftypos
inanintervalof100pageswas=1.5.
Therefore,theexpectednumberoftyposin
anintervalof400pagesis4*1.5=6
Thus,whenwedealwithanintervalof400
pages,wemustuse=6inthePoisson
distributionformula.
438
Fora400pagebook,whatistheprobability
thatthereareno typos?
P(X=0)=
439
Fora400pagebook,whatistheprobabilitythat
therearefiveorless typos?
P(X5)=P(0)+P(1)++P(5)
Thisisrathertedioustosolvemanually.Abetter
alternativeistorefertoTable2 inAppendixB
k=5,=6,andP(Xk)=.446
there is about a 45% chance there are 5 or less typos
440
Characterizearangeofvaluesthatwillincludetheactual
numberoftyposfoundina400pagebookwith
probabilityatleast90%.
Again,wecanuseChebysheffs Theorem.Sincewewant
thistobeatleast75%,wefirstneedtofindtheksuch
that
Thisyields
Next,recallthat,ifXisaPoissonrandomvariable,then
).
E[X]=and V(X)= (andtherefore,
IfXisthenumberoftyposin400pages,then
and
441
Thus,theintervalis:
[6 3.162*2.449,6+3.162*2.449]=[1.743,13.743]
Wecannothaveanegativenumberoftypos,
sowecantruncatetheintervalatzero.
Thus,wecanstatethatwithprobabilityat
least90%,therewillbebetween0and13
typosina400pagebook.
442
PoissonDistributioninExcel
ExcelcancomputePr(X=x)andPr(Xx)whenX
isaPoissonrandomvariable.
Thecommandis:
POISSON.DIST(x,mean,cumulative)
Where:
x:Thenumberofevents(successes).
mean :Theexpectednumberofsuccessesper
interval.
Cumulative :IfcumulativeisTRUE,then
POISSON.DISTreturnsthecumulativedistribution
functionPr(Xx);ifFALSE,itreturnsthe
probabilityfunctionPr(X=x).
443
Example:Exercise7.117: Thenumberofbank
robberiesthatoccurinalargeNorthAmerican
cityisPoissondistributedwithameanof1.5
perday. Findtheprobabilitiesofthefollowing
events:
a) Threeormorebankrobberiesoccurinaday.
a) Between10and15robberiesoccurduringa
5dayperiod.
444
ThisisPr(X3).Again,touseTable2,weneedto
expressthisintermsofaprobabilityofthetype
Pr(Xk).Notethat:
Pr(X3)=1Pr(X2)
whereXisaPoissonrandomvariablewith=1.5
ThisisPr(10X15).Wehave:
Pr(10X15)=Pr(X15) Pr(X9)
whereXisPoissonwith=1.8*5=7.5
445
UsingTable2inAppendixB,if=1.5,then
Pr(X2)=0.8088
andthereforePr(X3)=1Pr(X2)=0.1912.
Andif=7.5,then
Pr(X15)=0.9954 andPr(X9)=0.7764
andtherefore
Pr(10X15)=Pr(X15) Pr(X9)=0.219
446