Professional Documents
Culture Documents
Dispersion
Knowingthemeanisnotenough
Knowingthemeanisnotenough
MeasuresofDispersionAvailable
Rangedefinitionandadvantage
Therangesimplytellsusthehighestvalueinthe sampleminusthelowestvalue.
Exampleabove:$129=3;$155=10.
Advantage:Easytocalculate,intuitive.
Range(cont'd)
RangeUsage
Oneuseofrangeisqualitycontrol,whereabsolute minimamayneedtobesetforsafetystandards.
MeanAbsoluteDeviation
MAD =
X i X N
MeanAbsoluteDeviationExample
WagesinSouth Dakota(Mean$10)
Wage 5 7 11 12 15 |510| |710|
X i X
5 2
=5 =3
5 7
3 5
11 12 15
Avg: 10
MeanAbsoluteDeviationExample
WagesinNorthDakota (Mean$10)
Wage 9 9.50 10 9.50 12
X i X
|9.510|=0.5 |9.510|=0.5
Avg: 10
MeanAbsoluteDeviation Advantages
Usesinformationfromallobservations Notasaffectedbyoutliersastherange
EveryobservationaffectstheMADequally
Relativelyintuitive(comparedtowhat'scoming....)
Absolutevalueturnsouttohaveadifficultproperty:
|X|
MeanAbsoluateDeviation Disadvantage
X 2
Variance
i =1 MAD =
X i X N =
2 i= N i= 1
2 Xi X N
Samplevs.PopulationVariance
Noticethatsampleandpopulationvarianceare calculateddifferently!
i= N i =1 i=n i =1
PopulationVariance: SampleVariance:
2/ N Xi X
2 / n1 X iX
VarianceExample
WagesinSouth Dakota(Mean$10) 2
Wage 5 7 11 12 15 (510)2 =25 (710)2 =9 (1110)2 =1 (1210)2 =4 (1510)2 =25 Sum: 64 2:64/(51)=16
5 25 25 9 4 1 7 1112 15
X iX
Avg: 10
VarianceExample
WagesinNorth Dakota(Mean$10)
Wage 9 9.50 10 9.50 12
2 X iX
(910)2
=1
Avg:10
2:5.5/(51)=1.375
VarianceAdvantage
VariancehasandadvantageoverMAD:
X2 X 2 2 |X| X 1 0 1 2
0
2
DisadvantagesofVariance
StandardDeviation
MAD,Variance,andStandard DeviationCompared
SOUTHDAKOTA: Range MAD 10 3.2 NORTHDAKOTA: Range MAD 3 0.8
Variance 16 Std.Dev. 4
ThestandarddeviationandtheMADareusuallyof thesameorderofmagnitude.
TheRuleofThumb
TheRuleofThumb
Thetypicalcase:
Example:NorhDakota.=10,=1.17
Chebyshev'sTheorem
Atleastfraction 11/k2ofdata
k k
+k
Chebyshev'sTheorem(cont'd)
Atleastfraction 11/k2ofdata
k k
+k
Chebyshev'sTheorem
k=2
=10
+k=18
InterQuartileRange
1. Writethedataoutinorder 2. Breakitintofourparts,eachwithanequalnumber ofobservations 3. Pickthetopnumberinthefirstpart,andthetop numberinthethirdpart 4. Subtracttheformerfromthelatter
InterQuartileRangeExample
Agesof16students:
23 21 23 19 23 20 17 21 24 37 21 20 19 22 18 24
InterQuartileRangeExample
Agesof16students:
Inorder,thisis:
23 21 23 19 23 20 17 21 24 37 21 20 19 22 18 24
17 18 19 19 20 20 21 21 21 22 23 23 23 24 24 37
InterQuartileRangeExample
Agesof16students:
Inorder,thisis:
4obs.
23 21 23 19 23 20 17 21 24 37 21 20 19 22 18 24
17 18 19 19 20 20 21 21 21 22 23 23 23 24 24 37
First(bottom)Quartile
4obs.
SecondQuartile
4obs.
4obs.
ThirdQuartile
Fourth(top)Quartile
InterQuartileRangeExample
Agesof16students:
Inorder,thisis:
4obs.
23 21 23 19 23 20 17 21 24 37 21 20 19 22 18 24
2319=4
4obs.
17 18 19 19 20 20 21 21 21 22 23 23 23 24 24 37
First(bottom)Quartile
4obs.
SecondQuartile
4obs.
ThirdQuartile
Fourth(top)Quartile
IQR:AdvantagesandDisadvantages
Skewness
Theskewnesslooksatthecubeofthedifference fromthemean:
i= N i=1
Xi X N
Skewness=
N X i X 3
i =1
i= N
N 1 N 2
PositiveVs.NegativeSkewness
Mean
Median
Median
Mean