Professional Documents
Culture Documents
Aspen Curtis
Mr. Price
MATH 1040
09 January 2018
Introduction
This project is determining the proportions of the different colors of Skittles within a
Procedure
2. Divide the Skittles into groups of Red, Orange, Yellow, Green, and Purple
5. Create a pie-chart
6. Collect the number summaries from four other classmates, five including yours
Categorical Data
Red 0.191
Orange 0.222
Yellow 0.175
Green 0.206
Purple 0.206
Curtis 2
Group Statistics
57 67 59 58 60 301
proportion
red yellow orange green purple
0.189 0.223 0.196 0.193 0.199
Curtis 3
Observation
I was really quite surprised with the results of the proportion of different colored Skittles.
I truly thought that some of the colors would make up more of a majority, but in reality all of the
colors are approximately the same. Even after taking samples from four of my classmates, the
results evened out more. Both graphs looked so similar and the proportions were so similar.
Quantitative Data
Number of Bags: 29
Mean: 59
Observation
The frequency histogram is pretty normally distributed. There are no known outliers with
the samples collected. The overall data is slightly lower than mine- which is what I was
expecting to be-but being that my bag of candies was the maximum of the class total, overall it
Reflection
Categorical data is types of data that can be divided into groups whereas quantitative data
expresses a certain quantities or range. So, in this experiment, the categorical data of the different
colors of skittles showed proportions of what was in the bag, whereas the quantitative took only
the individual amounts of each person’s bag of Skittles, with no care to what color or category it
fit into. Graphs like pie and pareto charts will give the best representation of the categorical data
because it shows the individual categories and their proportions. Box plots, histograms, and bar
graphs display quantitative data the best because it displays the numerical and general
used the proportions to express the certain categories from the bags. When using calculations to
find mean, median, and standard deviation will show how the bag size differs throughout the
Confidence Intervals
A confidence interval gives a parameter in which a specified probability lies and it serves
to the original categorical data, one can see that both my and my groups combined data lie in that
interval.
The 95% interval for the true mean number of candies per bag means that the true mean
will lie between 60 and 60.4, which again, if looking at both my and my groups summaries,
some numbers lie between that interval, making those numbers closer to the true mean.
Hypothesis Testing
This is when one conducts a test on an assumption about a population parameter, and can
.05 Significance Level Test Claim 20% all Skittles are Red: Fail to Reject the Null
.01 Significance Level Test Claim of Mean of 55 Skittles: Fail to Reject the Null
be rejected. It is the same with the mean of 55 skittles, as close as the p value does come, it is
Reflection
In order to do an interval estimate, the data must have a sample statistic, a specific
confidence level, and a margin of error. To do a hypothesis test one must have a normality of
data, failure outcomes, level of significance and a null hypothesis. All of my samples met the
conditions for both interval estimates and hypothesis testing. Some errors could have included
type 1-rejecting the null when it should not have been, type 2-failing to reject the null when it it
should have been rejected, and incorrect data. The sampling could have been taken on a larger
scale, giving a bigger sample which would display more accurate data. In conclusion, I have
Curtis 7
discovered how math can be applied to real world problems, which gives math and formulas