Professional Documents
Culture Documents
Central
Tendency
Dr. Tapan Kr. Dutta
Panskura Banamali College
Please
reset to
silent mode
all the
mobile
phone in
this session
Thank you
Predict Mean,
Median and Mode
from the given
dataset by the
shortcut method
Standard Notation
Mean X
Stand. Dev. S
Variance S2 2
Size n N
Numerical Data
Properties & Measures
Numerical Data
Properties
Central
Variation Shape
Tendency
Mean Range Skew
Median Variance
Mode Standard Deviation
Example - 1
Several different
measures of
central tendency
are defined below.
The mean of a sample or a
population is computed by adding all
of the observations and dividing by
the number of observations.
Returning to the example of the five
women, the mean weight would
equal (X1 + X2 + X3 + X4 + X5)/5
= Mean.
To find the median, we arrange the
observations in order from smallest
to largest value. If there is an odd
number of observations, the median
is the middle value. If there is an
even number of observations, the
median is the average of the two
middle values.
The mode is the most
frequently appearing value
in the population or
sample. Suppose we draw
a sample of five women
and measure their weights.
How do I know
which measure of
central tendency
to use?
MEAN
Use the mean to describe the middle of a
set of data that does not have an outlier.
Advantages:
Most popular measure in fields such as
business, engineering and computer
science.
It is unique - there is only one answer.
Useful when comparing sets of data.
Disadvantages:
Affected by extreme values (outliers)
Mean
1. Measure of Central Tendency
2. Most Common Measure
3. Acts as Balance Point
4. Affected by Extreme Values
(Outliers)
5. Formula (Sample Mean)
n
X
i X X X
X i 1 1 2 n
n n
MEDIAN
Use the median to describe the middle of a
set of data that does have an outlier.
Advantages:
Extreme values (outliers) do not affect
the median as strongly as they do the mean.
Useful when comparing sets of data.
It is unique - there is only one answer.
Disadvantages:
Not as popular as mean.
Median
1. Measure of Central Tendency
2. Middle Value In Ordered Sequence
If Odd n, Middle Value of Sequence
If Even n, Average of 2 Middle Values
n +1 5 +1
Positioning Point 3
2 2
Median = 22.6
Median Example
Even-Sized Sample
Raw Data:10.3 4.9 8.9 11.7 6.3 7.7
Ordered: 4.9 6.3 7.7 8.9 10.3 11.7
Position: 1 2 3 4 5 6
n +1 6 +1
Positioning Point 3 .5
2 2
7.7 + 8.9
Median 8.3
2
MODE
Use the mode when the data is non-numeric or when
asked to choose the most popular item.
Advantages:
Extreme values (outliers) do not affect the mode.
Disadvantages:
Not as popular as mean and median.
Not necessarily unique - may be more than one
answer
When no values repeat in the data set, the mode is
every value and is useless.
When there is more than one mode, it is difficult to
interpret and/or compare.
Mode
1. Measure of Central Tendency
2. Value That Occurs Most Often
3. Not Affected by Extreme Values
4. May Be No Mode or Several Modes
5. May Be Used for Numerical &
Categorical Data
Mode Example
No Mode
Raw Data: 10.3 4.9 8.9 11.7 6.3 7.7
One Mode
Raw Data: 6.3 4.9 8.9 6.3 4.9 4.9
More Than 1 Mode
Raw Data: 21 28 28 41 43 43
Central Tendency Solution*
Median
Raw Data: 17 16 21 18 13 16 12 11
Ordered: 11 12 13 16 16 17 18 21
Position: 1 2 3 4 5 6 7 8
n 1 8 1
Positionin g Point 4 .5
2 2
16 16
Median 16
2
What will happen to the
measures of central tendency if
we add the same amount to all
data values, or multiply each
data value by the same amount?
Data Mean Mode Median
Original Data Set: 6, 7, 8, 10, 12, 14, 14, 15, 16, 20 12.2 14 13
Add 3 to each data
9, 10, 11, 13, 15, 17, 17, 18, 19, 23 15.2 17 16
value
Multiply 2 times each
12, 14, 16, 20, 24, 28, 28, 30, 32, 40 24.4 28 26
data value
7 8 9 10 7 8 9 10
Numerical Data
Properties & Measures
Numerical Data
Properties
Central
Variation Shape
Tendency
Mean Range Skew
Median Variance
Mode Standard Deviation
Variance &
Standard Deviation
1. Measures of Dispersion
2. Most Common Measures
3. Consider How Data Are Distributed
4. Show Variation About Mean (X or
) X = 8.3
4 6 8 10 12
Sample Variance Formula
n
c h 2
X X
i
S 2 i 1
n 1
Sample Variance Formula
c h 2
X X n - 1 in denominator!
i (Use N if Population
S 2 i 1 Variance)
n 1
c X
1
X
h c
2
X
2
X
h 2
c X
n
X
h 2
n 1
Sample Standard Deviation
Formula
S S 2
n
cXi X h
2
i 1
n 1
cX 1 X h cX
2
2 X h 2
c
Xn X h 2
n 1
Variance Example
Raw Data:10.3 4.9 8.9 11.7 6.3 7.7
n n
c h
2
X X X
i i
S 2 i 1 where X i 1 8 .3
n 1 n
S2
a f a f a
10 . 3 8 . 3 2 4 . 9 8 . 3 2 7 .7 8 . 3 2 f
6 1
6 . 368
Exercise
Youre a financial
analyst. You have
collected the following
closing stock prices of
new stock issues: 17,
16, 21, 18, 13, 16, 12,
11.
What are the variance
and standard deviation
of the stock prices?
Variation Solution*
Sample Variance
Raw Data: 17 16 21 18 13 16 12 11
n n
cXi X h Xi
2
i 1 i 1
S2 where X 15.5
n 1 n
S 2
a17 15.5 f a16 15.5 f
2 2
a
11 15.5 f 2
8 1
1114
.
Variation Solution*
Sample Standard Deviation
n
cXi X h
2
i 1
S S
2
1114
. 3.34
n 1
68, 59, 57, 64, 52, 60, 62, 57, 61,
61, 71, 51, 59, 54, 65, 67, 54, 62,
58, 60, 54, 62, 65, 71, 63, 60, 61,
56, 67, 64, 57, 61, 60, 62, 59, 57,
64, 58, 61, 63, 62, 62, 60, 58, 67,
63, 64, 61, 60, 65, 67, 70, 58, 51,
61, 62, 65, 52, 60, 55, 63, 62, 60,
67, 55, 62, 61, 64, 57, 59.
By the informations
from above data
following frequency
table can be made
21
7
3