You are on page 1of 5

Finish Line & Beyond

STATISTICS
Mean of Grouped Data
The mean (or average) of observations, is the sum of the values of all the
observations divided by the total number of observations.

If x1, x2,. . ., xn are observations with respective frequencies f1, f2, . . ., fn, then
this means observation x1 occurs f1 times, x2 occurs f2 times, and so on.
Now, the sum of the values of all the observations = f1x1 + f2x2 + . . . + fnxn,
and sum of the number of observations = f1 + f2 + . . . + fn.
So, the mean x of the data is given by
f1x1 + f2x2 + . . . + fnxn
x =
f1 + f2 + . . . + fn
n

∑ fixi
x= i= 1


n
i= 1
fi

Example: The marks obtained by 30 students of Class X of a certain school in a


Mathematics paper, consisting of 100 marks are presented in table below. Find the
mean of the marks obtained by the students.
Marks Obtained 10 20 36 40 50 56 60 70 72 80 88 92 95
Xi
Number of Students 1 1 3 4 3 2 4 4 1 1 2 3 1
fi

Solution: To find the mean marks, we require the product of each xi with
the corresponding frequency fi.
Marks Obtained Number of Students fixi
xi fi
10 1 10
20 1 20
36 3 108
40 4 160
50 3 150
56 2 112
60 4 240
70 4 280
72 1 70
80 1 80
88 2 196
92 3 276
95 1 95
Total
∑ fi =30 ∑ fixi =1779

www.excellup.com ©2009 send your queries to enquiry@excellup.com


Finish Line & Beyond
∑ fixi 1779
x= = = 59.3
∑ fi 30

Direct Method of Finding Mean:


In most of our real life situations, data is usually so large that to make a meaningful
study it needs to be condensed as grouped data. So, we need to convert given
ungrouped data into grouped data and devise some method to find its mean.
Let us convert the ungrouped data of Example 1 into grouped data by forming
class-intervals of width, say 15. Remember that, while allocating frequencies to each
class-interval, students falling in any upper class-limit would be considered in the
next class, e.g., 4 students who have obtained 40 marks would be considered in the
classinterval 40-55 and not in 25-40. With this convention in our mind, let us form a
grouped frequency distribution table.

Class Intervals 10 – 25 25 – 40 40 – 55 55 – 70 70 – 85 85 – 100


Number of Students 2 3 7 6 6 6

Now, for each class-interval, we require a point which would serve as the
representative of the whole class. It is assumed that the frequency of each class-
interval is centred around its mid-point. So the mid-point (or class mark) of each
class can be chosen to represent the observations falling in the class.
Upper class limit + Lower class limit
Class mark =
2

Class Interval Number of Students Class Mark Fixi


(fi) (xi)
10 -25 2 17.5 35.0
25 – 40 3 32.5 97.5
40 – 55 7 47.5 97.5
55 – 70 6 62.5 375.0
70 – 85 6 77.5 465.0
85 – 100 6 92.5 555.0
Total fi= 30 fixi = 1860.0

∑ fixi 1860
x = = = 62 Mean
∑ fi 30
This method of finding the mean is known as the Direct Method.
Here, 59.3 is the exact mean,
while 62 is an approximate mean.

Assumed Mean Method:


Sometimes when the numerical values of xi and fi are large, finding the product
of xi and fi becomes tedious and time consuming. So, for such situations, let us think
of a method of reducing these calculations.
Nothing can be done with the fi’s, but each xi can be changed to a smaller number
to make easier calculations.
The first step is to choose one among the xi’s as the assumed mean, and denote

www.excellup.com ©2009 send your queries to enquiry@excellup.com


Finish Line & Beyond
it by ‘a’. Also, to further reduce our calculation work, we may take ‘a’ to be that xi
which lies in the centre of x1, x2, . . ., xn. So, we can choose a = 47.5 or a = 62.5.

Let us choose a = 47.5.


The next step is to find the difference di between a and each of the xi’s, that is,
the deviation of ‘a’ from each of the xi’s.
i.e., di = xi – a = xi – 47.5

The third step is to find the product of di with the corresponding fi, and take the sum
of all the fi di’s.

Class Number of Class Mark di = xi – 47.5 fidi


Interval Students (fi) (xi)
10 – 25 2 17.5 -30 -60
25 – 40 3 32.5 -15 -45
40 -55 7 47.5 0 0
55 – 70 6 62.5 15 90
70 – 85 6 77.5 30 180
85 – 100 6 92.5 45 270
Total
∑ fi = 30 ∑ fidi = 435

So, the mean of Deviations


∑ fidi = 435
d= = 14.5
∑ fi 30
Since d is obtained by subtracting a from xi so x can be obtaines as follows:
x = d+a
= 14.5 + 47.5 = 62
This method is called Assumed Mean Method

Step Deviation Method of Finding Mean:


In the above method all the values of di are multiples of 15 which is equivalent to
class size. So to make calculations more simple let us divide all the values of di by
15 to arrive at smaller numbers for fidi

Class Number of Class di = xi – xi − a fiui


Interval Students (fi) Mark (xi) 47.5 ui=
h
10 – 25 2 17.5 -30 -2 -4
25 – 40 3 32.5 -15 -1 -3
40 -55 7 47.5 0 0 0
55 – 70 6 62.5 15 1 6
70 – 85 6 77.5 30 2 12
85 – 100 6 92.5 45 3 18
Total
∑ fi = 30 ∑ fiui =29

www.excellup.com ©2009 send your queries to enquiry@excellup.com


Finish Line & Beyond
∑ fiui = 29
u=
∑ fi 30
29
x =a+h u = 47.5 + 15 × = 62
30
(Since u was arrived at after dividing d by the class size so a+d =a+hu)
This method is called Step Deviation Method.

Important to Note:
• The step-deviation method will be convenient to apply if all the di’s have a
common factor.
• The mean obtained by all the three methods is the same.
• The assumed mean method and step-deviation method are just simplified
forms of the direct method.

Mode of Grouped Data


A mode is that value among the observations which occurs most often, that is, the
value of the observation having the maximum frequency. It is possible that more
than one value may have the same maximum frequency. In such situations, the data
is said to be multimodal.

 f 1− f 0 
Mode =  +   × h
 2 f1− f 0 − f 2 
where  = lower limit of the modal class,
h = size of the class interval (assuming all class sizes to be equal),
f1 = frequency of the modal class,
f0 = frequency of the class preceding the modal class,
f2 = frequency of the class succeeding the modal class.

Let us calculate the mode from the above table:


 =40
h = 15
f1 = 7
f0 = 3
f2 = 6

 7− 3 
So, Mode = 40+   15 = 52
 14 − 3 − 6 

Median of Grouped Data


Median is a measure of central tendency which gives the value of the middle-most
observation in the data. First step to find the median is to group data in ascending
order.
n+ 1
If the middle observation is at odd place then median =
2

www.excellup.com ©2009 send your queries to enquiry@excellup.com


Finish Line & Beyond
n n
+ +1
If the middle observation is at the even place then median = 2 2
2
That is, it is the average of the middle and successor of middle observations.

Let us take data from the first table:

Marks Obtained Frequency Cumulative frequency


10 1 1
20 1 2
36 3 5
40 4 9
50 3 12
56 2 14
60 4 18
70 4 22
72 1 23
80 1 24
88 2 26
92 3 29
95 1 30

Here number of Observations is 30


So, Median will be average of the 15th and the 16th observations.
15th Observation is 60
16th Observation is 70
Average = 65 Median

The mean is the most frequently used measure of central tendency because it
takes into account all the observations, and lies between the extremes, i.e., the
largest and the smallest observations of the entire data. It also enables us to
compare two or more distributions.
However, extreme values in the data affect the mean.

In problems where individual observations are not important, and we wish to find
out a ‘typical’ observation, the median is more appropriate, e.g., finding the typical
productivity rate of workers, average wage in a country, etc. These are situations
where extreme values may be there. So, rather than the mean, we take the median
as a better measure of central tendency.

In situations which require establishing the most frequent value or most popular
item, the mode is the best choice, e.g., to find the most popular T.V. programme
being watched.
Remarks :
1. There is a empirical relationship between the three measures of central tendency :
3 Median = Mode + 2 Mean
may not be the same on both the axes.

www.excellup.com ©2009 send your queries to enquiry@excellup.com

You might also like