You are on page 1of 10

ZeenaJarrar

Kiker6
DataExplorationMiniProject
Formydataexplorationproject,Ichosetofocusontheamountofmoneymypopulation
spendontheirclothesinayear.Thevaluesrepresenttheamountindollars,whilemypopulation
isstatisticsstudentsatLASA.TheunitsforthedataisUSdollarsbecauseweallliveintheUS,
thereforwealluseUSdollars.Usinggoogleforms,IcreatedasurveywiththequestionHow
muchmoneydoyouspendonclothesinayear?,andIpostedtheformlinktoaFacebookgroup
calledLASAStatistics201516.IwantedtocollectthistypeofdatabecauseIwantedtosee
thecomparisonofhowmuchmoneypeoplearewillingtospendonclothes,andtheamountthey
areabletospendonclothes.
UsingRStudio,Ipluggedinmydataintotheserverandrantestsusingseveraldifferent
codes.Forsamplesize,Ilookedattheamountofpeoplewhoansweredmyform,forthe5
numbersummaryIpluggedinfivenum(spent$money),formeanIpluggedin
mean(spent$money),formedianIlookedatmyfivenumbersummary,forrangeIsubtractedthe
minimumamountofmoneyspentfromthemaximum,forstandarddeviationIpluggedin
sd(spent$money),forvarianceIjustsquaredmystandarddeviationanswer,andforIQRI
subtractedQ1frommyfivenumbersummaryfromQ3.

SampleSize30people
>fivenum(spent$money)
5NumberSummary50$200$310$500$4000$

>mean(spent$money)
Mean580.53$
Median310$
Range3950$
>sd(spent$money)
StandardDeviation844.50$
>844.4965*844.4965
Variance713174.30$
IQR300$
myboxplot<boxplot(spent$money)
>myboxplot$out
Outliers:1000$1000$2500$2000$4000$

>hist(spent$money,main="MoneySpentonClothesperYear",xlab="AmountSpent($)")

>boxplot(spent$money,main="MoneySpentonClothesperYear",ylab="AmountSpent
($)")

>stem(spent$money)
Thedecimalpointis3digit(s)totherightofthe|

0|111111222222333334444
0|5557
1|00
1|
2|0
2|5
3|
3|
4|0

Once100isaddedtoeverydatapointintheset,themeanandmedianarethesame
numberplus100whilethestandarddeviationstaystheexactsame.

SampleSize130people
>fivenum(data.plus.100$money)
5NumberSummary150$300$410$600$4100$
>mean(data.plus.100$money)
Mean680.53$
Median410$
Range3950$
>sd(data.plus.100$money)
StandardDeviation844.50$

Variance713174.30$
IQR300$
bp100<boxplot(data.plus.100$money,main="MoneySpentonClothesperYear",ylab=
"AmountSpent($)")
>bp100$out
Outliers1100$1100$2600$2100$4100$

>hist(data.plus.100$money,main="MoneySpentonClothesperYear",xlab="AmountSpent
($)")

>boxplot(data.plus.100$money,main="MoneySpentonClothesperYear",ylab="Amount
Spent($)")

>stem(data.plus.100$money)
Thedecimalpointis3digit(s)totherightofthe|

0|22222233333344444
0|55556668
1|11
1|
2|1
2|6
3|
3|
4|1


WhenIincreasethenumbersinmyoriginaldataby50%,themean,median,and
standarddeviationallincreasedby50%aswellincomparisontotheoriginaldata.

SampleSize45people
>fivenum(fifty$money)
5NumberSummary75$300$465$750$6000$
>mean(fifty$money)
Mean870.80$
Median465$
Range5925$
>sd(fifty$money)
StandardDeviation1266.75$
>1266.75*1266.75
Variance1604643$
IQR450$
>bpf<boxplot(fifty$money,main="MoneySpentonClothesperYear",ylab="Amount
Spent($)")
>bpf$out
Outliers1500$1500$3750$3000$6000$

>hist(fifty$money,main="MoneySpentonClothesperYear",xlab="AmountSpent($)")

>boxplot(fifty$money,main="MoneySpentonClothesperYear",ylab="AmountSpent($)")

>stem(fifty$money)
Thedecimalpointis3digit(s)totherightofthe|

0|111112333333455555666788
1|055
2|

3|08
4|
5|
6|0

Assumingthatmyoriginaldataisanormaldistribution(thoughitsnot),thepercentthat
isgreaterthan5dollarsaboveyourmeanis49.6%.LookingatmyztableIcangetthe
percentagefromlookingattheoutputRgivesme.Thepercentthatisbetween3dollarsbelow
mymeanand2dollarsabovemymeanis0%becausemystandarddeviationissolarge.The
numberofdollarsrequiredforthetop10%is1661.40$andhigher.

>580.53+5
[1]585.53
>(585.53580.53)/844.50
[1]0.005920663
>3/844.5
[1]0.003552398
>2/844.5
[1]0.002368265
>1.28*844.5
[1]1080.96

>1080.96+580.53
[1]1661.49

BasedonthestatisticalanalysisofmydataIhavecometotheconclusionthatmost
peoplespendabout200$500$ontheirclothes.IwasexpectingmydatatobemorelikewhatI
gotfortheinformationontheincreaseby50%data,butitturnedouttobemuchlowerthan
whatIwasassuming.Peoplewhospent1000$oroverareallconsideredoutliersinthisdata
becausemostpeoplespentaround300$.Mostpeopleeitherdonthaveenoughmoneytospend
over1000$dollarsonclothesortheyjustspendtheirmoneyonthingsmoreimportantthan
clothes.

You might also like