Professional Documents
Culture Documents
Percentiles: The th percentile of a group of numbers is that value below which lie %
( percent) of the numbers in the group. The position of the th percentile is given by
( + ) / , where is the number of data points.
Quartiles: The percentage points that break down the data set into quartersrst
quarter, second quarter, third quarter, and fourth quarter.
Powered by statisticsforbusinessiuba.blogspot.com 1
International University IU
+ Interquartile Range = 3rd Quartile 1st Quartile
Example The following data are numbers of passengers on ights of Delta Air Lines between San
Francisco and Seattle over 33 days in April and early May.
128, 121, 134, 136, 136, 118, 123, 109, 120, 116, 125, 128, 121, 129, 130,
131, 127, 119, 114, 134, 110, 136, 134, 125, 128, 123, 128, 133, 132, 136,
134, 129, 132
Find the lower, middle, and upper quartiles of this data set. Also nd the 10th, 15th, and
(Hint: Use ( + ) / )
Powered by statisticsforbusinessiuba.blogspot.com 2
International University IU
3. MODE/MEAN/VARIANCE/STANDARD DEVIATION
Sample Population
Mode
The mode of the data set
is the value that occurs
most frequently.
Mean
The mean of a set of
x= x /n = x /N
observations is their
average.
Variance
The variance of a set of
observations is the
s = (x x) /(n 1) = (x ) /N
average squared
deviation of the data
points from their mean.
Standard Deviation
Powered by statisticsforbusinessiuba.blogspot.com 3
International University IU
Step 06:
Step 07:
Powered by statisticsforbusinessiuba.blogspot.com 4
International University IU
Example Case of population Case of sample
The future Euroyen is the price of the The daily expenditure on food by a
Japanese yen as traded in the European traveler, in dollars in summer 2006, was as
futures market. The following are 30-day follows:
Euroyen prices on an index from 0 to 17.5, 17.6, 18.3, 17.9, 17.4, 16.9, 17.1,
100%: 17.1, 18.0, 17.2, 18.3, 17.8, 17.1, 18.3,
99.24, 99.37, 98.33, 98.91, 98.51, 99.38, 17.5, 17.4.
99.71, 99.21, 98.63, 99.10. Find the mean, standard deviation, and
Find the mean, standard deviation, and variance.
variance, viewed as a population.
(Hint: Use the calculator) (Hint: Use the calculator)
Solution It is not necessary to order the data from smallest to largest in both cases
Powered by statisticsforbusinessiuba.blogspot.com 5
International University IU
The result we can get is 0.414 The result we can get is 0.466
Finally, to calculate the Finally, to calculate the sample
population variance, we use variance, we use the following
the following formula: formula:
= (0.414) 0.172 = (0.466) 0.217
Chebychevs Theorem
STEP 01: Determine the sample mean ( ) and the sample standard deviation ( )
STEP 02: Choose the rule of Chebyshevs theorem and determine the value of
STEP 03: Calculate the interval
STEP 04: Determine the percentage of observations lying into the specified range
(Divide the number of observations lying into the specified range
by the total number of observations in the data set)
STEP 05: Draw a conclusion
Powered by statisticsforbusinessiuba.blogspot.com 6
International University IU
Empirical Rule
Condition: The empirical rule can apply if the distribution of the data is mound-shaped
that is, if the histogram of the data is more or less symmetric with a single mode or high
point.
1. Approximately 68% of the observations will be within 1 standard deviation of the mean.
2. Approximately 95% of the observations will be within 2 standard deviations of the mean.
3. A vast majority of the observations (all, or almost all) will be within 3 standard
deviations of the mean.
STEP 01: Draw the histogram of the data and check the condition that the
distribution of the data is mound-shaped
If the distribution of the data is mound-shaped, follow the next five steps.
Powered by statisticsforbusinessiuba.blogspot.com 7
International University IU
Example Check the applicability of Chebyshevs theorem and the empirical rule for the following
data set
12.5, 13, 14.8, 11, 16.7, 9, 8.3, 1.2, 3.9, 15.5, 16.2, 18, 11.6, 10, 9.5
We found that:
the sample mean = .
the sample standard deviation = .
From the data set itself, we see that there are 14 of 15 observations in the set,
. = . % are within the specified range, so the rule that at least
three-quarters will be within range is satisfied.
Powered by statisticsforbusinessiuba.blogspot.com 8
International University IU
5. BOX PLOT
Introduction
+ A box plot (also called a box-and-whisker plot) is another way of looking at a data set
in an effort to determine its central tendency, spread, skewness, and the existence of
outliers
Powered by statisticsforbusinessiuba.blogspot.com 9
International University IU
THE ELEMENTS OF A BOX PLOT
Powered by statisticsforbusinessiuba.blogspot.com 10
International University IU
Example Construct a box plot for the following data set
5, 8, 6, 9, 17, 24, 10, 5, 6, 13, 5, 3, 6, 12, 11, 10, 9, 10, 14, 15
= 20
The median is the observation in position (20 + 1)50/100 = 10.5, which is 9.5.
The lower quartile is the observation in position (20 + 1)25/100 = 5.25, which is 6.
The upper quartile is the observation in position (20 + 1)75/100 = 15.75, which is 12.75.
The smallest observation is 3.
The largest observation is 24.
Table 1
Table 2
Box Plot
Conclusion:
Based on the box plot, we can see that the distribution of the data is relatively symmetric.
And there is one suspected outlier, 24.
Powered by statisticsforbusinessiuba.blogspot.com 11