Professional Documents
Culture Documents
STATISTICS
Mean of Grouped Data
The mean (or average) of observations, is the sum of the values of all the
observations divided by the total number of observations.
If x1, x2,. . ., xn are observations with respective frequencies f1, f2, . . ., fn, then
this means observation x1 occurs f1 times, x2 occurs f2 times, and so on.
Now, the sum of the values of all the observations = f1x1 + f2x2 + . . . + fnxn,
and sum of the number of observations = f1 + f2 + . . . + fn.
So, the mean x of the data is given by
f1x1 + f2x2 + . . . + fnxn
x =
f1 + f2 + . . . + fn
n
∑ fixi
x= i= 1
∑
n
i= 1
fi
Solution: To find the mean marks, we require the product of each xi with
the corresponding frequency fi.
Marks Obtained Number of Students fixi
xi fi
10 1 10
20 1 20
36 3 108
40 4 160
50 3 150
56 2 112
60 4 240
70 4 280
72 1 70
80 1 80
88 2 196
92 3 276
95 1 95
Total
∑ fi =30 ∑ fixi =1779
Now, for each class-interval, we require a point which would serve as the
representative of the whole class. It is assumed that the frequency of each class-
interval is centred around its mid-point. So the mid-point (or class mark) of each
class can be chosen to represent the observations falling in the class.
Upper class limit + Lower class limit
Class mark =
2
∑ fixi 1860
x = = = 62 Mean
∑ fi 30
This method of finding the mean is known as the Direct Method.
Here, 59.3 is the exact mean,
while 62 is an approximate mean.
The third step is to find the product of di with the corresponding fi, and take the sum
of all the fi di’s.
Important to Note:
• The step-deviation method will be convenient to apply if all the di’s have a
common factor.
• The mean obtained by all the three methods is the same.
• The assumed mean method and step-deviation method are just simplified
forms of the direct method.
f 1− f 0
Mode = + × h
2 f1− f 0 − f 2
where = lower limit of the modal class,
h = size of the class interval (assuming all class sizes to be equal),
f1 = frequency of the modal class,
f0 = frequency of the class preceding the modal class,
f2 = frequency of the class succeeding the modal class.
7− 3
So, Mode = 40+ 15 = 52
14 − 3 − 6
The mean is the most frequently used measure of central tendency because it
takes into account all the observations, and lies between the extremes, i.e., the
largest and the smallest observations of the entire data. It also enables us to
compare two or more distributions.
However, extreme values in the data affect the mean.
In problems where individual observations are not important, and we wish to find
out a ‘typical’ observation, the median is more appropriate, e.g., finding the typical
productivity rate of workers, average wage in a country, etc. These are situations
where extreme values may be there. So, rather than the mean, we take the median
as a better measure of central tendency.
In situations which require establishing the most frequent value or most popular
item, the mode is the best choice, e.g., to find the most popular T.V. programme
being watched.
Remarks :
1. There is a empirical relationship between the three measures of central tendency :
3 Median = Mode + 2 Mean
may not be the same on both the axes.