Stat 2

Page # 1
Presentation of Data
Q.1. What are different methods of presentation of Data?
Ans. (i) Classification
(ii) Tabulation
(iii) Diagrams
(iv) Graphs.
Q.2. What is Classification?
Ans. Classification is the process of arranging the data into relatively homogeneous groups
or classes according to their resemblances and affinities.
Q.3. What is Tabulation?
Ans. The systematic arrangement of data in the form of rows and columns for the purpose
of comparison and analysis is known as tabulation.
Q.4. What is an array?
Ans. Arrangement of data in ascending or descending order is called as an array.
Q.5. What are the main parts of the table.
Ans. (i) Title
(ii) Box-head
(iii) Stub
(iv) Body
(v) Prefatory Note
(vi) Foot note
(vii) Source Note.
Q.6. What is frequency distribution?
Ans. A frequency distribution is a tabular arrangement of the data that shows the
distribution of observations among different classes.
Q.7. What are class limits?
Ans. The class limits are defined as the values of the variables, which explain the classes.
Q.8. What do you mean by open-end class?
Ans. If a frequency distribution has no lower class limit or no upper class limit of its any
class is called an open-end class.
Q.9. What are class boundaries?
Ans. The class boundaries are the exact values, which break up one class from another
class.
Q.10. What do you mean by class marks (or mid points)?
Ans. A class mark is average value of the lower and upper class limits or class boundaries.
Q.11. What is class interval?
Ans. The difference between the upper and lower class boundaries is called class interval
or class width.
Q.12. What is class frequency?
Ans. The number of values falling in a specified class is called class frequency or
frequency.
Q.13. What is relative frequency?
Ans. The frequency of a class divided by the total frequency is called relative frequency.
Q.14. What is Histogram?
Ans. A histogram is a set of adjacent rectangles for a frequency distribution such that the
area of each rectangle is proportional to the corresponding class frequency.
Page # 2
Q.15. What is frequency polygon.
Ans. A frequency polygon is a many-sided closed figure that represents a frequency
distribution.
Q.16. How is a frequency polygon constructed?
Ans. It is constructed by plotting the mid points and corresponding frequencies and then
connecting them by straight line segments.
Q.17. What is ogive?
Ans. The cumulative frequency polygon is called ogive.
Q.18. What is chart?
Ans. A chart is a device used for representing a simple statistical data in a simple, clear
and effective manner.
Q.19. What is ungrouped data?
Ans. The fresh data that have been collected for the first time are called ungrouped data.
Q.20. What do you mean by grouped data?
Ans. When the ungrouped data are arranged according to classes or groups with their
respective frequencies are called grouped data.
Q.21. For which distribution the graph of the frequency distribution is bell-
shaped?
Ans. For symmetrical distribution the graph is bell-shaped.
Q.22. Name some graphs of frequency distribution.
Ans. Histogram, polygon, frequency curve, ogive.
Q.23. Name some charts / diagrams.
Ans. Simple bar diagram, sub-divided bar diagram, Multiple bar diagram, Pie chart etc.
Q.24. What is the mid point of class 20-24?
Ans. Mid point is 22.
Q.25. Write the formula of angle of sector used in pie chart.
Ans. Angle of sector =
************************
Page # 3
Example 1: The following data shows the number of children in different families
of a small locality:
1, 2, 4, 3, 0, 1, 2, 3, 1, 1, 0, 2, 1, 0, 2, 3, 0, 0, 1, 3.
Make a frequency distribution. Also find relative frequencies.
Solution:
Range = Maximum value – Minimum value
=4–0=4
The number of families f
The number of children Tally r.f = Σf
(f)
0 //// 5 5/20 = 0.25
1 //// / 6 6/20 = 0.30
2 //// 4 4/20 = 0.20
3 //// 4 4/20 = 0.20
4 / 1 1/20 = 0.05
Σ ---- 20 1.00
Example 2: The following data shows the ages of 50 cancer patients admitted in
Shaukat Khanum Memorial Cancer Hospital, Lahore:
48 29 39 32 54 33 44 36 38 31
46 30 20 44 47 39 42 35 33 47
31 35 34 42 41 42 43 35 32 35
43 36 37 45 46 41 25 27 26 40
38 41 44 47 45 45 52 43 44 43
Make a frequency distribution. Also find class boundaries and mid points.
Solution:
The following steps are involved in constructing a frequency distribution.
i) Range = Maximum value – Minimum value
= 54 – 20 = 34
ii) Approximate number of classes
No. of classes = 1 + 3.322 logn = 1 + 3.322log(50)
= 1 + 3.322 (1.6990)= 6.6066 ≅ 7 (approximately)
iii) Width of class interval
Range 34
h = No . of classes = 7 = 4.8571 ≅ 5 (appr.)
iv) Group the entire data with an interval of 5 each and write down the classes in the first
column under the heading “Ages”. Count the actual number falling in each interval putting a
tally (/) in the proper interval for each value. Count the number of tallies for each interval
and write down in the next column, these are frequencies denoted by f.
Ages
Tally f Class boundaries Mid point (X)
(Class limits)
20 – 24 / 1 19.5 – 24.5 22
25 – 29 //// 4 24.5 – 29.5 27
30 – 34 //// /// 8 29.5 – 34.5 32
35 – 39 //// //// / 11 34.5 – 39.5 37
40 – 44 //// //// //// 15 39.5 – 44.5 42
45 – 49 //// //// 9 44.5 – 49.5 47
50 – 54 // 2 49.5 – 54.5 52
Σ ---- 50 ---- ----
Page # 4
Example 3: The following data shows the scores made by Pakistani cricketers
against New Zeeland in one-day match. Draw a simple bar chart of the following
data:
Cricketers Inzmam Waseem Shahid Saeed Imran Razzaq
Scores 54 47 26 30 25 23
Solution:
Simple Bar Chart

60
50
40
Scores
30
20
10
0
Inzmam Waseem Shahid Saeed Imran Razzaq
Cricketers
Example 4: Following data about the production of wheat in different localities of

the Punjab for years1987 to 1989.
Production in Kg (thousands)
Year 1987 1988 1989
Locality I 500 600 800
Locality II 600 700 700
Locality III 200 400 500
i) Make a multiple bar chart
ii) Make a component bar chart
iii) Make a percentage bar chart.
Solution: i)
Year 1987 1988 1989
Locality I 500 600 800
Locality I Multiple Bar Chart

900
800 Locality II
700 Locality III
Production
600
500
400
300
200
100
0
1987 Years 1988 1989
Page # 5
ii)
Year 1987 1988 1989
Locality I
500 600 800
200 400 500
Locality III
Total 1300 1700 2000
Component Bar Chart

2500
Locality III
2000 Locality II
Locality I
Production
1500
1000
500
0
1987 1988 1989
Ye ars
iii)
Percentage Production in Kg (thousands)

Year 1987 1988 1989
500 600 800
×100 = 38 .5 ×100 = 35 .3 ×100 = 40 .0
Locality I 1300 1700 2000
600 700 700
×100 = 46 .1 ×100 = 41 .2 ×100 = 35 .0
Locality II 1300 1700 2000
200 400 500
×100 = 15 .4 ×100 = 23 .5 ×100 = 25 .0
Total 100 100 100
Percentage Component Bar Chart

100%
90%
80%
Production
70%
60%
50%
40%
30%
20%
10%
0%
1987 1988 1989
Years
Example5: The data are available regarding total production of urea fertilizer and
its use on different crops. Total production of urea is 200 (thousand Kg) and its
Page # 6
consumption for different crops wheat, sugarcane, maize, and lentils is 75, 80, 30
and 15 (thousand Kg) respectively. Make an appropriate diagram to represent
these data.
Solution:
Angle of sector
Crops Fertilizer (thousand Kg) Component part
θ= × 360
Total
75
Wheat 75 × 360 = 135 o
200
80
Sugarcane 80 × 360 = 144 o
200
30
Maize 30 × 360 = 54 o
200
15
Lentils 15 × 360 = 27 o
200
Total 200 360
Pie Diagram
Lentils
27o
Maize
54o Wheat
135o
Sugarcane
144o
Example: 6: Make a histogram from the following data:

Marks f
86 – 90 6
91 – 95 4
96 – 100 10
101 – 105 6
106 – 110 3
111 – 115 1
Solution:
Marks f Class
boundaries
86 – 90 6 85.5 – 90.5
91 – 95 4 90.5 – 95.5
96 – 100 10 95.5 – 100.5
101 – 105 6 100.5 – 105.5
106 – 110 3 105.5 – 110.5
111 – 115 1 110.5 – 115.5
Σ 30 ----
Page # 7
Histogram
12
10
8
frequency
0
85.5 90.5 95.5 100.5 105.5 110.5 115.5
Class Boundaries
Measures of Central Tendency
Q.1. Define Average. What are its important types?

Ans. Average
An average is a single value, which represents the data. Averages are also called
“measures of central tendency” or “measures of location.”
Types of Averages
The important types of averages are given below:
(i) Arithmetic Mean or Mean
(ii) Median
(iii) Mode.
Q.2. Write down the properties of good average.

Ans. The properties of a good average are given below:
(i) It should be clearly defined.
(ii) It should be easy to calculate.
(iii) It should be simple to understand.
(iv) It should be based on all the observations.
(v) It should not be affected by extreme values.
(vi) It should be capable of mathematical treatment.
Q.3. Define Arithmetic Mean.

Ans. Arithmetic Mean (A.M.)
Arithmetic mean or simply mean is defined as the sum of all the values in a data
divided by number of values.
Let X1, X2, X3,....…Xn be n observations, then arithmetic mean denoted by ( X ) is given:
X = =
Q.4. Write down the properties of Arithmetic Mean.

Ans. The important properties of arithmetic mean are given below:
(i) The sum of deviations of observations from their mean is zero. i.e.,
Σ (X  X ) = 0 for ungrouped data
Σ f(X  X ) = 0 for grouped data
(ii) The mean of a constant is constant itself i.e., If X = a then X = a
Page # 8
(iii) The arithmetic mean is affected by change of origin and scale. It means that if we add
or subtract a constant from all the values or multiply or divide all the values by a
constant, the mean is affected by the respective change. i.e.,
If Y = X ± a, Y = X ± a
If Y = a ± bX, Y =a± bX
1
If Y = , Y = X where a ≠ 0
a
(iv) The sum of squares of the deviations of the observations from their mean is minimum
i.e., Σ (X - X )2 is minimum.
(v) If n1, values have mean X 1 , n2 values have mean X 2 and so on nk values have mean
=
Xk
, then the mean of all values is called as combined mean. It is denoted by X or X c
is given below:
Xc = =
Q.5. Define weighted arithmetic mean. In what circumstances is it preferred to

ordinary mean and why?
Ans. Weighted Arithmetic Mean
Sometimes all observations are not of equal importance. To show the importance of
every observations we assign to it a value called weight. If n observation X1, X2, ----- Xn
have the respective weights W1, W2 -------, Wn, then weighted arithmetic mean denoted
by X w is obtained as
Xw = =
When all the values in the data are not of equal importance, it is preferred to ordinary
mean because it gives relative importance to all the values.
Q.6. Define Median.
Ans. Median:Median is defined as the central value of the arranged data. It is a positional
~
average denoted by X
~
X = th value for ungrouped data and grouped data (discrete)
~
X=l+ for grouped data (continuous)
Median class or Median group =

l = lower class boundary of the median class.
h = class interval of the median class.
f = frequency of the median class.
C = cumulative frequency of the class preceding to the median class.
Q.7. Define Mode. Write down its methods of calculation.
Ans. Mode Mode is defined as the most frequent value of the data. It is denoted by ^
X
.
Methods of Calculation of Mode

(i) Mode (for ungrouped data):
In ungrouped data mode is found by inspection. For example, the mode of 2,8,7,3,9,3
is 3.
(ii) Mode for frequency distribution (Discrete):
Page # 9
The value corresponding to the maximum frequency.
(iii) Mode for frequency distribution (Continuous):
^ =l+ × h
X
fm = frequency of the modal class.
l = lower class boundary of the modal class.
f1 = frequency preceding the modal class.
f2 = frequency following the modal class.
h = class interval or width of the model class.
Q.8. Define harmonic mean?
Ans. Harmonic mean is defined as the reciprocal of the mean of the reciprocals of the
observations
Q.9. Define G.M.
Ans. The geometric mean is defined as the nth root of the product of n positive values.
Q.10(a) What do you mean by unimodal, bimodal, multimodal distributions?
(b) When it is not possible to find mode?
Ans. (a) Unimodal Distribution: A distribution having a single mode is called unimodal
distribution.
Bimodal Distribution:A distribution having two modes is called bimodal distribution.
Multimodal Distribution:A distribution having more than two modes is called multimodal
distribution.
(b) If each value occurs the same number of times, then it is not possible to find mode.
TOPIC UNGROUPED GROUPED DATA

Direct = =
Deviatio =A+ =A+
n
Arithme
Step =A+ =A+ × h
deviation u=
tic / Coding
Me
an
G.M= G.M =
Geometric Mean = antilog = antilog
H.M = H.M =
Harmonic Mean
~ ~
Y = The value of Y=l+
Median
QK=The Value of Kth item QK = l +
Quartiles k = 1,2,3
Mode Observation Method/ ^

Inspection Method Y=l+
Note: The formulae of median, quartiles, deciles and percentiles for discrete frequency
distribution are same as that of ungrouped data.
Weighted Arithmetic Mean:
Page # 10
_
Yw =
For symmetrical distribution:
Mean = Median = Mode
For skewed Distribution:
Mode = 3 Median – 2 Mean
Example 1: Find arithmetic mean of the following data:
102, 104, 106, 108, 110.
(i) By direct method (ii) By short-cut method
Solution:
X D= X – A(X –100)
102 2
104 4
106 6
108 8
110 10
Σ X= Σ D= 30
530
Arithmetic mean:
(i) Direct Method (ii) Short-cut Method
ΣX ΣD
X = X = A+
n n
530 30
= = 1 06 = 1 0 0+
5 5
= 1 0 0+ 6 = 1 0 6
Example 2: Find average age from the following frequency distribution of ages of
50 patients
Ages
No. of
pat
ien
ts
20-24 1
25-29 4
30-34 8
35-39 11
40-44 15
45-49 9
50-54 2
Solution:
Ages f X fX
20-24 1 22 22
25-29 4 27 108
30-34 8 32 256
35-39 11 37 407
40-44 15 42 630
45-49 9 47 423
50-54 2 52 104
Σ 50 ----- 1950
Page # 11
Average Age:
∑fX
X =
∑f
1950
= = 39
50
Hence the average age of patients is 39 years.
Example 3:Find the arithmetic mean from the given information:
(i) D = X– 39, ΣD = 240 and n = 10
X − 57
(ii) u= , Σu = 23 and n = 20
5
(iii) X = 10 + 5u, Σfu = - 46 and n = 125
Solution:
(i) ΣD = 240, n = 10
D = X – 39, Comparing with D = X – A, A = 39
ΣD 240
Arithmetic mean = X = A + = 39 + = 39 + 24 = 63
n 10
(ii) Σu = 23 n = 20
X − 57 X −A
u= , Comparing with u=
5 h
A = 57, h = 5
Σu
Arithmetic mean = X = A + ×h
n
23
= 57 + × 5 = 57 + 5.75 = 62.75
20
(iii) X = 10 + 5u, Σfu = – 46, n = 125
X – 10 = 5u
X −10 X −A
u= , Comparing with u =
5 h
A = 10, h = 5, n = Σ f = 125
Σfu
Arithmetic mean = X =A+ ×h
∑f
 − 46 
= 10 +   ×5 10 + (– 1.84)= 10 – 1.84 = 8.16
 125 
Example 4: Calculate the weighted man of the following data:
Items Expenditure Weight
Food 290 7.5
Rent 54 2.0
Clothing 98 1.5
Fuel 75 1.0
Miscellaneous 75 0.5
Solution:
Items X W WX
Food 290 7.5 2175
Rent 54 2.0 108
Clothing 98 1.5 147
Fuel 75 1.0 75
Miscellaneous 75 0.5 37.5
Σ ----- 12.5 2542.5
Weighted mean:
=
∑WX
X
∑W
w
Page # 12
2542 .5
= = 203.4
12 .5
Example 5: The ungrouped data is given below:.

45, 30, 35, 40, 44, 32, 42, 37
Calculate geometric mean using:
i) Basic definition ii) log formula
Solution:
i) G.M using basic definition
G.M = n Y1 ×Y2 ×....... ×Yn
= 8 ( 45 ×30 ×35 × 40 × 44 ×32 × 42 ×37 ) = 37.76

ii) Using log formula
Y Log Y
45 1.6532
30 1.4771
35 1.5441
40 1.6021
44 1.6434
32 1.5051
42 1.6232
37 1.5682
Σ 12.61
64
 Σlog Y  12.6164 
G.M. = antilog   == antilog  
 n   8 
= antilog 1.57705 = 37.76
Example 6:Following data has obtained from a frequency distribution using
Y −136 .5
u= , Show that G.M is less than A.M.
2
u –4 –3 –2 –1 0 1 2 3
f 2 5 8 18 22 13 8 4
Solution:
u ƒ ƒu Y = 2u + logY ƒ log Y
136.5
-4 2 -8 128.5 2.1089 4.2178
-3 5 -15 130.5 2.1156 10.578
-2 8 -16 132.5 2.1222 16.9776
-1 18 -18 134.5 2.1287 38.3166
0 22 0 136.5 2.1351 46.9722
1 13 13 138.5 2.1414 27.8382
2 8 16 140.5 2.1477 17.1816
3 4 12 142.5 2.1538 8.6152
Σ 80 -16 ----- ----- 170.6972
Y −136 .5
u=
2
2u = Y – 136.5
Y = 2u + 136.5,
A = 136.5, h = 2
Page # 13
Arithmetic Mean
Σfu  −16
Y =A+ Σ × h= 136.5 +  × 2 = 136.5 + (–0.4)
f  80 
= 136.5 – 0.4 =136.1
Σf log Y  170 .6972 
G.M = Antilog   =Antilog   =Antilog (2.1337) = 136.05
 Σ f   80 
A.M = 136.1, G.M = 136.05
It shows that G.M is less than A.M i.e.G.M < A.M
Example 7: Find harmonic mean for the following grouped data

f
Class
boundaries
0–4 2
4–8 5
8 – 12 7
12 – 16 8
16 – 20 7
20 – 24 4
24 – 28 1
Solution:
f
C.B f Y
Y
0–4 2 2 1.0000
4–8 5 6 0.8333
8 – 12 7 10 0.7000
12 – 16 8 14 0.5714
16 – 20 7 18 0.3889
20 – 24 4 22 0.1816
24 – 28 1 26 0.0385
Σ 34 ---- 3.7137
∑f
H.M = f
∑Y
34
= = 9.1553
3.7137
Example 8: Find median from the following data:
(i) c, a, b
(ii) 88.03, 94.50, 95.05, 84.60
(iii) 87,91,89,88,89,91,87,92,90,98.
Solution:
(i) The data in an array:
a, b, c
 n +1 
Median =   th value
 2 
 3 +1 
=   th value
 2 
= 2rd value = b
(ii) The data in an array:
Sr. No. 1 2 3 4 5
Values 84.60 88.30 94.50 94.90 95.05
Here n = 5
Page # 14
 n +1 
 2 
 5 +1 
=   th value
 2 
= 3rd value = 94.50
(iii) The data in an array:
Sr. No. 1 2 3 4 5 6 7 8 9 10
Values 87 87 88 89 89 90 91 91 92 98
Here n = 10,
 n +1 
 2 
10 +1 
=  th value.
 2 
5th + 6th 89 + 90
= (5.5)th value= = = 89 .5
2 2
Example 9: Find the median from the following data of heights of students:
Frequency
C.1
86 – 90 6
91 – 95 4
96 – 100 10
101 – 105 6
106 – 110 3
111 – 115 1
Solution:
C.I Class boundaries f c.f
86 – 90 85.5 – 90.5 6 6
91 – 95 90.5 – 95.5 4 10
96 – 100 95.5 – 100.5 10 20
101 – 105 100.5 – 105.5 6 26
106 – 110 105.5 – 110.5 3 29
111 – 115 110.5 – 115.5 1 30
Σ ---- 30 ----
h n 
Median = l +  −C 
f 2 
n 30
th value = th value =15 th value
2 2
∴Median class is 95 .5 −100 .5
5
Median = 95.5 + (15-10) = 95.5 + 2.5 = 98.0
10
Example 10: Find mode for the following data:
91, 89, 88, 87, 89, 91, 87, 92, 90, 98, 95, 97, 96, 100, 101, 96, 98, 99, 98, 100, 102,
99, 101, 105, 103, 107, 105, 106, 107, 112.
Solution: Since the most frequent value of the data is 98.Therefore, Mode = 98
Example 11: Find mode for the following frequency distribution of heights of
students:
Frequency
Heights
86 ≤ X ≤ 6
90 4
91 ≤ X ≤ 95 10
Page # 15
96 ≤ X ≤ 6
100 3
101 ≤ X ≤ 1
105
106 ≤ X ≤
110
111 ≤ X ≤
115
Solution:
Class boundaries f
Heights C.1
86 ≤ X ≤ 90 86 – 90 85.5 – 90.5 6
91 ≤ X ≤ 95 91 – 95 90.5 – 95.5 4
96 ≤ X ≤ 100 96 – 100 95.5 – 100.5 10
101 ≤ X ≤ 105 101 – 105 100.5 – 105.5 6
106 ≤ X ≤ 110 106 – 110 105.5 – 110.5 3
111 ≤ X ≤ 115 111 – 115 110.5 – 115.5 1
f m − f1
Mode = l+ ×h
( f m − f1 ) + ( f m − f 2 )
Sine maximum frequency is 10, therefore 95.5 – 100.5 is modal class.
l = 95 .5, h = 5, f m =10 , f1 = 4, f 2 = 6
10 − 4
Mode = 95 .5 + ×5
(10 − 4) + (10 − 6)
= 95.5+3.0 = 98.5
Measures of Dispersion
Q.1. What do you mean by dispersion?

Ans. Dispersion means the variability of values about the measures of central tendency.
Q.2. What are important types of dispersion?
Ans. (i) Range (ii) Quartile Deviation
(iii) Mean Deviation (iv) Standard Deviation
(v) Variance.
Q.3. Differentiate between absolute dispersion and relative dispersion.
Ans. An absolute dispersion is that type of dispersion in which measures of dispersion have the
same units as those of original data. A relative dispersion is that type of measures of
dispersion, which is independent of unit of measurements.
Q.4. Define range.
Ans. Range is defined as the difference between maximum and minimum values of the data.
Q.5. Define quartile deviation.
Ans. It is defined as “Half of the difference between upper and lower quartiles”.
Q.6. When does range become zero?
Ans. For constant observations, range is zero.
Q.7. Define mean deviation.
Page # 16
Ans. It is defined as
“The arithmetic mean of the absolute values of the deviations from any average.
Q.8. Define variance.
Ans. It is defined as “The arithmetic mean of squares of deviations of values from their mean.
Q.9. What will be variance of 3, 3, 3, 3, 3, 3, ?
Ans. Zero.
Q.10. Define standard deviation.
Ans. It is defined as :
“The positive square root of the arithmetic mean of squares of deviations from their mean.”
Q.11. If s.d. = 3, then what will be the variance?
Ans. Variance will be 9.
Q.12. If S.D. of 2, 4, 6, 8, 10 is 2.83, then what will be S.D. of 102, 104, 106, 108, 110?
Ans. The S.D. of 102, 104, 106, 108, 110 will be 2.83.
Q.13. Is variance affected by change of scale.
Ans. Yes, variance is affected by the change of scale.
Q.14. Is variance of negative values negative?
Ans. No, variance is always non-negative.
Q.15. What is the utility of standard deviation.
Ans. It has great practical utility in sampling and statistical inference.
Q.16. What is the relationship between mean, median and mode for positively skewed
and negatively skewed distribution.
Ans. For positively skewed distribution
Mean > Median > Mode
For negatively skewed distribution
Mean < Median < Mode
Relative
Topic Absolute Dispersion Dispersio
n
R = Ym − Yo Co-
Range efficient of
Range
=
\ Q.D = Co-
Quartile efficient of
Deviation or Q.D =
Semi Inter
Quartile Range
Ungrouped Grouped
_ _
Mean Deviation M.D = M.D =
Y Y
M.D~ = M.D ~ =
Y Y
Page # 17
Co-
efficient of
S.D =
Standard
Deviation
Co efficient
Variance 2
S = (S.D) 2 2
S = (S.D) 2 of
Variation =
× 100
Co-efficient Measures of Skewness

S.K =
S.K =
Karl Pearson’s
S.K =
Bowley’s or
Quartile
Symmetrical Distribution:
S.K = 0
Mean = Median = Mode
Positively Skewed Distribution:

S.K > 0
Mean > Mode > Median
Negatively Skewed Distribution:

S.K < 0
Mean < Mode < Median
Example 1: Find range for each of the following data.

i) 12, 6, 7, - 3, 15, 10, 18, 5& – 24
ii) 19, 3, 8, 9, 7, 8, 10, 12, 18 & 21
iii) 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4
Solution:
i) 12, 6, 7, - 3, 15, 10, 18, 5& – 24
Range = Ym – Y0
Maximum value = Ym = 18
Minimum value = Y0 = – 24
Range = 18 – (–24) = 18 + 24 = 42
ii) 19, 3, 8, 9, 7, 8, 10, 12, 18 & 21
Range = Ym – Y0
Minimum value = Y0 =3
Range = 21 – 3 = 18
iii) 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4
Range = Ym – Y0
Minimum value = Y0 =4
Range =4–4=0
The range of constant is zero.
Page # 18
Example 2: Find range for the following frequency distribution.
Groups ƒ
70-74 2
75-79 5
80-84 12
85-89 18
90-94 7
Also find coefficient of dispersion.
Groups ƒ C.B Solution:
70-74 2 69.5 – 74.5
75-79 5 74.5 – 79.5
80-84 12 79.5 – 84.5
85-89 18 84.5 – 89.5
90-95 7 89.5 – 94.5
Range = Ym – Y0
Ym = Upper class boundary of the highest class = 95.5
Y0 = Lower class boundary of the lowest = 69.5
Range = 94.5 – 69.5 = 25.0
Ym −Yo
94 .5 − 69 .5 25 .0
Co-efficient of dispersion (Range) = Ym +Yo = = = 0.1524
94 .5 + 69 .5 164 .0
Example3: Find lower quartile, upper quartile & quartile deviation from the given data:
Groups Frequen
cy
70-74 2
75-79 5
80-84 12
85-89 18
90-95 7
Solution:
Groups Frequen
C.B C.f
cy
70-74 2 69.5- 2
74.5
75-79 5 74.5- 7
79.5
80-84 12 79.5- 19
84.5
85-89 18 84.5- 37
89.5
90-95 7 89.5- 44
94.5
Σ 44 ----- -----
L o w eQr u a r tile
h  1n 
Q1 = l+ −C
f  4 
5 5
= 7 9.5 + (1 1− 7) = 7 9.5 + (4)
12 12
20
= 7 9.5 + = 7 9.5 + 1.6 7
12
= 8 1.1 7
Page # 19
U p p eqr u a rtile
h  3n 
Q3 = l + −C
f  4 
5 5
= 8 4.5 (3 3− 1 9) = 8 4.5 + (1 4)
18 18
5 5
= 8 4.5 + (3 3− 1 9) = 8 4.5 + (1 4)
18 18
70
= 8 4.5 + = 8 4.5 + 3.8 9 = 8 8.3 9
18
Q3 − Q1
Q.D =
2
88.39 − 81.17
=
2
7.22
= = 3.61
2
Example 4: The ungrouped date is given below:
2, 5, 6, 6, 8, 9, 12, 13, 16, 23
Calculate the average deviation from
i) Mean ii) Median.
Solution:
~
Y −Y Y −Y
Y
= Y −10 =Y −8.5
2 8 6.5
5 5 3.5
6 4 2.5
6 4 2.5
8 2 .5
9 1 .5
12 2 3.5
13 3 4.5
16 6 7.5
23 13 14.5
~
Σ Y −Y = Σ Y −Y =
Σ Y = 100
48 46
ΣY −Y
i) Average deviation (Mean) =
n
ΣY 100
_
Mean = Y = = = 10
n 10
ΣY −Y 48
Average deviation (Mean) = = = 4.8
n 10
~
ΣY −Y
ii) Average deviation (Median) =
n
~  n + 1
Y =  th value
 2 
 10 + 1 
=  th value
 2 
11
= th value= 5.5 th value= 5th + .5(6th − 5th)
2
= 8 + .5(9 − 8) = 8.5
~
ΣY −Y 46
Average deviation (Median) = = = 4. 6
n 10
Page # 20
Example 5: Calculate median & mean deviation from the following data:
Solution:
~
f C.B C.f Y −
~
Y f Y −Y
2 9.25 – 9.75 2 1.57 3.14
5 9.75 – 10.25 7 1.07 5.35
12 10.25 – 10.75 19 0.57 6.84
17 10.75 – 11.25 36 0.07 1.19
14 11.25 – 11.75 50 0.43 6.02
6 11.75 – 12.25 56 0.93 5.58
3 12.25 – 12.75 59 1.43 4.29
1 12.75 – 13.25 60 1.93 1.93
60 ----- ----- ----- 34.34
h n 
M e d ia=n l + −C
f  2 
n 60
th v a lu e= th v a lu e= 3 0thv a lu e
2 2
∴ M e d ia cnla s iss 1 0.7 5− 1 1.2 5
0.5 0.5
M e d ia n = 1 0.7 5+ (3 0− 1 9) = 1 0.7 5+ (1 1)
17 17
5.5
= 1 0.7 5+ = 1 0.7 5+ 0.3 2 = 1 1.0 7
17
M.D from Median:
~
ΣfY − Y
M .D(Y~ ) =
Σf
3 4.3 4
= = 0.5 7 2
60
Example 6: Calculate variance & standard deviation from the following data:
102, 104, 106, 108, 110
Solution:
Y −Y
Y
=(Y −106 )
(Y −Y ) 2
102 -4 16
104 -2 4
106 0 0
108 2 4
110 4 16
530 0 40
Y =
ΣY 530
= = 106 , Variance = S 2 =
Σ Y −Y( ) 2
=
40
=8
n 5 n 5
S .D =S =
(
Σ Y −Y ) 2
= 8 = 2.83
n
Example 7: Determine, mean S.D and C.V from the given data:
Ages Frequency
20-24 1
25-29 4
30-34 8
35-39 11
40-44 15
45-49 9
50-54 2
Solution:
Page # 21
Ages f fY fY2
Y
20-24 1 22 22 484
25-29 4 27 108 2916
30-34 8 32 256 8192
35-39 11 37 407 15059
40-44 15 42 630 26460
45-49 9 47 423 19881
50-54 2 52 104 5408
Σ 50 1950 78400
_
ΣfY 1950
Mean = Y = = = 39
Σf 50
ΣfY 2 78400
S .D = − (Y ) 2 = − (39 ) 2
Σf 50
= 1568 −1521 = 47 = 6.85
S.D 6.8 5
C.V = × 1 0 0= × 1 0 0
M ean 39
= 0.1 7 5× 61 0 0= 1 7.5 60 0
REGRESSION AND CORRELATION

Scatter Diagram: The graphic representation of a set of “n” pairs of bivariate data is
called scatter diagram or scatter plot.
In scatter diagram we take independent variable along the horizontal axis (xaxis)
 and
the dependent variable along vertical axis (yaxis),
 the resulting set of points drawn on the
graph paper. If a relationship between the Variables exists, then the points in the scatter
diagram will show a tendency to cluster around a straight line or some curve. Such a line or
curve around which the points cluster is called the regression line or regression curve which
can be used to estimate the expected value of the random variable Y from the values of the
nonrandom
 variable X. The scatter diagrams shown below show the relationship between
two variables.
Y Y
X X
Direct Linear Relationship Inverse Linear Relationship
Page # 22
Y Y
X X
Curvilinear Relationship No Relationship
Regression: The dependence of one variable (dependent variable) on one or more other
variables (independent variables) is called regression. When we study the dependence of a
variable on a single independent variable, it is called simple regression or twovariable
regression. When the dependence of a variable on two or more than two variables is studied,
it is called multiple regression.
Regressand: In regression process the dependent variable is called regressand. It is also
called as the response variable or the predictand variable or the dependent variable or the
explained variable.
Regressor: In regression process the independent variable is called as the regressor. It is
also called as the predictor variable or the independent variable or the controlled variable or
the explanatory variable.
Least Squares Principle: The principle of least squares states that the sum of squares
of the residuals of observed values from their corresponding estimated values should be
least.
Properties of the Least Squares Line: Following are the important properties of
the least squares regression line:
(i) The sum of residuals between the observed the corresponding estimated values is
always zero i.e.,
e = (y – ŷ ) = 0
(ii) The sum of squares of the residuals e2 is minimum.
(iii) The least squares regression line always passes through the point ( x, y ) .
(iv) It is the best line because a and b are the unbiased estimates of the parameters  and
.
Correlation: The degree or strength of relationship (interdependence) between the
variables is called correlation.
Examples of correlation; heights and weights of children, ages of husbands and ages of
wives at the time of their marriages, marks of students in mathematics and in statistics etc.
Product Moment Coe  fficient of Correlation: A numerical measure of strength in
the linear relationship between any two variables is called the Pearson’s product moment
correlation coefficient or coefficient of simple correlation.
The sample linear correlation for n pairs of observations is defined by
Σ ( x − x)( y − y )
r=
Σ ( x − x) 2 Σ ( y − y) 2
(i) Positive Correlation: If both the variables are moving in same direction (increase or
decrease), then it is said to be positive or direct correlation. For example, ages and heights
of children.
(ii) Negative Correlation: If both the variables are moving in opposite direction it is
called negative or inverse correlation. For example, increase in the supply of a commodity
decreases its price.
(iii) No Correlation: If the change in one variable does not effect the other variable,
then there will be no correlation. For example, the head sizes and I.Q’s of persons.
Properties of Coe  fficient of Correlation: The important properties of coefficient

Page # 23
of correlation are given as follows:
(i) The coefficient
 of correlation is symmetrical with respect to x and y, i.e.,
rxy = ryx
(ii) The correlation coefficient
 is a pure number i.e., it does not depend upon the unit of
measurement.
(iii) The correlation coefficient
 always lies between –1 and +1.
(iv) The correlation coefficient
 is the geometric mean between the two regression
coefficients
 i.e.,
r = b yx ×bxy
r = +ve, if both byx and bxy are +ve.
r = ve, if both byx and bxy are ve.
(v) The correlation coefficient
 is independent of origin and scale, i.e. ,
rxy = ruv
Important Points & Formuale

Regression line of y on x is Regression line of y on x is
ŷ = a + bx or x̂ = c + dy or
ŷ = a + byxx (b = byx) x̂ = c + bxyy (d = bxy )
Σ ( x − x )( y − y ) Σ ( x − x )( y − y )
byx = bxy =
Σ ( x − x) 2 Σ ( y − y) 2
nΣ xy − ( Σx )( Σy ) nΣ xy − ( Σx )( Σy )
= = = =
nΣ x 2 − ( Σx ) 2 nΣ y 2 − ( Σy ) 2
Σxy − n x y Σxy −n x y
2 2
Σx − n x
2 Σy 2
−n y
a = y −b x or a = y −b yx x c = x −d y or c = x −bxy y
Coefficient of Correlation
Σ(x − x )(y − y)
r =
Σ(x − x ) 2 Σ(y − y) 2
nΣ xy − ( Σx )( Σy )
=
[nΣ x − ( Σx ) 2 ][ nΣ y 2 − ( Σy ) 2 ]
2
Σxy − nx y
=
[ Σx − nx 2 ][ Σy 2 − ny 2 ]
2
Example 1 The following table shows the ages x and systolic blood pressures y of 12
women.
Age 56 42 72 36 63 47 55 49 38 42 68 60
(years
) xi
Blood 14 12 16 11 14 12 15 14 11 14 15 15
pressu 7 5 0 8 9 8 0 5 5 0 2 5
re yi
Fit a regression line of blood pressure on age. Estimate the expected blood pressure of
a women whose age is 45 years. What is the change in blood pressure for a unit
change in age.
Solution:
x y xy x2
Page # 24
56 147 8232 3136

42 125 5250 1764
72 160 11520 5184
36 118 4248 1296
63 149 9387 3969
47 128 6016 2209
55 150 8250 3025
49 145 7105 2401
38 115 4370 1444
42 140 5880 1764
68 152 10336 4624
60 155 9300 3600
628 1684 89894 34416
The estimated line of y on x is
ŷ = a + bx
b =
12 (89894 ) − (628 )(1684 ) 21176
= = = 1.138
12 (34416 ) − (628 ) 2 18608
Σx 628
x= = = 52.333
n 12
Σy 1684
y= = = 140 .333
n 12
a = y −b x =140 .333 −1.138 (52 .333 ) =80 .778
Hence ŷ = 80.778 + 1.138 x
For x = 45; ŷ = 0.80.778 + 1.138(45) = 131.988 ≅ 132

Example2 The following table gives the number of persons employed and cloth
manufactured in a textile mill.
Persons 137 209 113 189 176 200 219
employed xi
Cloth 23 47 22 40 39 51 49
manufactured yi
Calculate the coefficient by using the above formula.
Solution:
x y xy x2 y2
137 23 3151 18769 529
209 47 9823 43681 2209
113 22 2486 12769 484
189 40 7560 35721 1600
176 39 6864 30976 1521
200 51 1020 40000 2601
219 49 0 47961 2401
1073
1
124 271 5081 229877 1134
3 5 5
The correlation co-efficient is
Page # 25
nΣ xy − ( Σx )( Σy )
r =
[nΣ x − ( Σx ) 2 ][ nΣ y 2 − ( Σy ) 2 ]
2
7(50815 ) − (1243 )( 271 )

=
[7( 229877 ) − (1243 ) 2 ][ 7(11345 ) − ( 271 ) 2 ]
18852 18852
= = = 0.963
(64090 )( 5974 ) 19567 .16
Example 3 A random sample of 20 pairs of observations (xi, yi) gave the following:
x = 2 , y = 8, Σx 2 = 180 , Σy 2 = 1424 , Σxy = 404
Estimate the linear regression equation taking (i) X as independent variable (ii) Y as
independent variable.
Solution:
n = 2, x = 2 , y = 8, Σx 2 = 180 , Σy 2 = 1424 , Σxy = 404
(i) Regression function taking x as independent is

^
y= a + bx
Σxy −n x y
b = 2
Σx 2
−n x
404 − 20 ( 2)(8) 84
= = = 0.84
180 − 20 (2) 2
100
a = y −b x = 8 – 0.84(2) = 6.32
^
Hence y = 6.32 + 0.84 x
(ii) Regression function taking y as independent variable is
^
x= c + dy
Σxy −n x y
d = 2
Σy 2
−n y
404 − 20 ( 2)( 8) 84
= = = 0.583
1424 − 20 (8) 2 144
c = x −d y = 2 – 0.583(8) = 2.67
^
Hence x = 2.67 + 0.583 y
ANALYSIS OF TIME SERIES
Time Series: A time series consists of numerical data collected, observed or recorded at
successive time periods.
Examples of time series are; the hourly temperature recorded by weather bureau, the
total monthly sales of pens in a book shop, the annual rainfall at Murree etc.
Analysis of Time Series: Analysis of time series is decomposition of a time series into
its different components for separate study. The basic purpose of analysis of time series is to
use it for forecasting.
Signal: The systematic component of variation in time series is called signal.
Noise: An irregular or random component of variation in time series is called noise.
Historigram: The graph of a time series is called historigram. It is constructed by taking
time along xaxis and the time series along yaxis. Using an appropriate scale, points are
plotted, then these points are joined by line segments to get required historigram.
Components of Time Series: Following are the main components of time series:
(i) Secular trend (T)
(ii) Seasonal variations (S)
Page # 26
(iii) Cyclical movements (C)
(iv) Irregular movements (I)
(i) Secular Trend: A secular trend is a long term movement that indicates the general
direction of the variation in a time series. It represents smooth, steady and gradual
movement in a time series in the same direction.
Examples of secular trend are; a decline in death rate due to advances in science, a
continually increasing demand for smaller automobiles etc.
(ii) Seasonal Variations: The Seasonal variations are short term movements that
indicate the identical changes in a time series during the corresponding seasons. The main
causes of these variations are seasons, religious affairs and social customs. Examples of
seasonal variations are; the increased sales of cotton cloths in summer, an after Eid sale in a
departmental store, an increase in employment during summer etc.
(iii) Cyclical Movements: Cyclical movements refer to the long term oscillations or
swings about the trend line or curve since the movements take the form of upward and
downward swings, they are also called “cycles”. The four phases of a business cycle are
prosperity, recession, depression and revival, provide important example of cyclical
movements.
(iv) Irregular Movements: Irregular movements are unsystematic in nature. They
occur in a completely unpredictable manner by chance, events such as war, floods,
earthquakes, strikes, fires etc. These variations are also called accidental, residual or
random variations. Examples of irregular movements are; a fire in a factory delaying in
production for 3 weeks, rise in prices due to floods etc.
Methods of measuring secular trend in a time series?
(i) Free hand curve method
(ii) Method of semi averages
(iii) Method of moving averages
(iv) Method of least squares
Important Points & Formulae

Coding of x
Origin at Origin at Middle
beginning (x) Odd numbers Even numbers
Half unit One
unit
... ... ...
0 .. .. ..
1  3  7 3.5
2  2  5 2.5
Page # 27
3 1 3 1.5
...
... 0 1 0.5
...
... 1 1 0.5
... 2 3 1.5
...
... 3 5 2.5
... ...
... ... 7 3.5
... ... ... ...
... ... ... ..
...
* The equation of semi averages is
ŷ = a + bx
y 2 − y1
where b = and a = y1 − bx1 or a = y 2 − bx 2
x2 − x1
* The equation of linear trend is ŷ = a + bx
Normal Equations are:
y = na + bx
xy = ax + bx2
Σy
If x = 0 ⇒ a =
n
Σxy
⇒ b =
Σx 2
Examples
Example 1. Make a historigram from the following data:
Year 196 196 196 196 196 1967
2 3 4 5 6
Production 20 28 50 15 18 27
(tons)
Solution:
Historigram
60
Production
50
40
30
20
10
0
1961 1962 1963 1964 1965 1966 1967 1968
Year
Example 2. The following table shows the property damaged by road accidents in Punjab
for the years 19737
 9:
Year 197 197 197 197 197 197 1979
3 4 5 6 7 8
Property 201 238 392 507 484 649 742
damaged
Find trend values by free hand curve method.
Solution:
Page # 28
Year Property Trend value (from

damaged graph)
1973 201 187
1974 238 278
1975 392 369
1976 507 460
1977 484 551
1978 649 642
1979 742 733
800
Property damaged
700
600
500
400
300
200
100
0
1972 1973 1974 1975 1976 1977 1978 1979 1980
Year
Example 3. From the data given below:

Year 196 196 196 196 196 196 196 196 196 196
0 1 2 3 4 5 6 7 8 9
Valu 318 326 337 340 359 365 372 381 402 410
e
Obtain trend values using method of semi averages.
Solution:
Year y Semi x=t– xi Trend value
total average 1960 ŷ = 316 +
10x
196 318 0 316
0 326 1 326
196 = 336 = y1 x1 = =
337 1680 2 336
1 2
340 3 346
196 359 4 356

2
196
196
4
196 365 5 366
5 372 6 376
Page # 29
196 381 1930 7 386

= 386 = y2 x2 = =
6 402 8 396
7
196 410 9 406
196
196
9
The estimated equation of semi averages is
ŷ = a + bx
y 2 − y1 386 − 336 50
b = = = = 10
x2 − x1 7 −2 5
a = y1 −bx1
= 336 – 10(2) = 336 – 20 = 316
Hence ŷ = 316 + 10x
Example 4. Use the method of semi average to find trend values for the following data
showing net profit (in lacs of rupees) of SNGPL for the years 196472.
Year 196 196 196 196 196 196 197 197 197
4 5 6 7 8 9 0 1 2
Profit 33 86 116 95 101 128 146 110 32
Find the estimated profit in 1964.
Solution:
Year y Semi x xi ŷ i =
tota average 76.05 +
l 4.3x
1964 33 0 76.05
1965 86 = 82.5 = 1 80.35

330 x1 = =
1966 116 y1 2 1.5 84.65
1967 95 3 88.95
1968 101 4 93.25
1969 128 5 97.55
1970 146 6 101.85

416 = 104 = y2 x2 = =
1971 110 7 6.5 106.15
1972 32 8 110.45
The estimated equation of semi averages is

ŷ = a + bx
Page # 30
y 2 − y1 104 −82 .5 21 .5
b = = = = 4.3
x2 − x1 6.5 −1.5 5
a = y1 −bx1
= 82.5 – 4.3(1.5) = 82.5 – 6.45 = 76.05
Hence ŷ = 76.05 + 4.3x
Estimated profit for the year 1994:
x = 1994 – 1964 = 30
For x = 30; ŷ = 76.05 + 4.3 (30) = 205.05
Example 5. Find
(i) 3year
(ii) 5year moving averages for the following time series.
Year Value Year Value
1948 20 1954 31
1949 23 1955 27
1950 26 1956 38
1951 29 1957 34
1952 23 1958 33
1953 29 1959 35
Solution:
3year moving 5year moving
Year Value
total averag total averag
e e
1948 20
1949 23 69 23
1950 26 78 26 121 24.2
1951 29 78 26 130 26.0
1952 23 81 27 138 27.2
1953 29 83 26.67 139 27.8
1954 31 87 29 148 29.6
1955 27 96 32 159 31.8
1956 38 99 33 163 32.6
1957 34 105 35 167 33.4
1958 33 102 34
1959 35
Example 6. Find out 4year moving average (centred) for the given data.
Year Productio Year Productio
n (tons) n (tons)
1948 50.0 1953 38.1
1949 36.5 1954 32.6
1950 43.0 1955 38.7
1951 44.5 1956 41.7
1952 38.9 1957 41.1
Page # 31
Solution:
Year Productio 4year moving
n total averag average (centred)
e
1948 50
1949 36.5
174.0 43.50
= 42.12
1950 43.0
162.9 40.73
= 40.93
1951 44.5
164.5 41.13 39.83
1952 38.9
154.1 38.53 37.81
1953 38.1
148.3 37.08 37.43
1954 32.6
151.1 37.78 38.16
1955 38.7
154.1 38.53
1956 41.7
1957 41.1
Example 7. The following data shows the production of steel in a mill for the years
19561964.
Year 195 195 195 195 196 196 196 196 196
6 7 8 9 0 1 2 3 4
Producti 60 65 80 73 97 105 93 111 117

on(000
tons)
(i) Fit the linear trend by the method of least squares by taking the origin at the middle.
Also calculate the trend values.
(ii) Predict the production of steel for the year 1965.
Solution:
Year y x xy x2 (Trend
value)
ŷ = 89 +
7.1x
1956 60 –4 –240 16 60.6
1957 65 –3 –195 9 67.7
1958 80 –2 –160 4 74.8
1959 73 –1 –73 1 81.9
Page # 32
1960 97 0 0 0 89.0
1961 105 1 105 1 96.1
1962 93 2 186 4 103.2
1963 111 3 333 9 110.3
1964 117 4 468 16 117.4
 801 0 424 60 
The least squares trend line is
ŷ = a + bx
Σy 801 Σxy 424
a = = = 89 b = = = 7.1
n 9 Σx 2 60
Hence ŷ = 89 + 7.1x
(ii) Prediction of the production of steel for the year 1965 is
For x = 5 ; ŷ = 89 + 7.1(5) = 124.5
Example 8. Fit a linear trend to the following data (take origin at the middle and half year
unit).
Year 199 199 199 199 199 199
1 2 3 4 5 6
Value 5 8 12 15 20 24
Also show that sum of residuals is equal to zero.
Solution:
Year y x= xy x2 ŷ = 14 + e = y  ŷ
1.91x
1991 5 –5 –25 25 4.45 0.55
1992 8 –3 –24 9 8.27 –0.27
1993 12 –1 –12 1 12.09 –0.09
1994 15 1 15 1 15.91 –0.91
1995 20 3 60 9 19.73 0.27
1996 24 5 120 25 23.55 0.45

 84 0 134 70  0
The least squares trend line is
ŷ = a + bx
Since x = 0, therefore above equations reduce to:
Σy 84
a = = = 14
n 6
Σxy 134
b = = = 1.91
Σx 2 70
Hence ŷ = 14 + 1.91x
Since e = 0, which shows that sum of residuals is zero.
Page # 33
INDEX
NUMBERS
Q.1. What is index number?
Ans. An index number is a device which measures the changes in a variable or group of
related variables with respect to time or space.
Q.2. What is simple index number?
Ans. An index number is called simple if it measures a relative change in a single variable
with respect to base.
Q.3. Give some examples of simple index number.
Ans. Index number for wages of employees, index number of cotton prices in Sahiwal etc.
Q.4. What is composite index number?
Ans. An index number is called composite index number if it measures a relative change in
a group of related variables with respect to base.
Q.5. What are the types of index number as regard to base?
Ans. (i) Fixed base index
(ii) Chain base index.
Q.6. Define price relative.
Ans. Price relative is the percentage ratio of the price in current year and the price in a
base year.
Q.7. Define link relative.
Ans. Link relative is the percentage ratio of the price in current year and the price in the
preceding year.
Q.8. What is price index number?
Ans. A price index number measures the changes in the whole sale or retail prices of a
particular commodity or a number of commodities with respect to base.
Q.9. What is quantity index number?
Ans. A quantity index number measures the changes in the quantity or volume of goods
produced or consumed.
Q.10. Define C.P.I.
Ans. A consumer price index number measures the changes in prices of a specified basket
of goods and services consumed in the given period relative to the base period.
Q.12. What do you mean by “basket” of goods?
Ans. The basket of goods and services will contain items like
(i) Food (ii) House rent (iii) Education (iv) Clothing (v) Misc.
Q.13. Write down the formula of C.P.I.
Ans. (i) Pon × 100 (Aggregate Expenditure Method)
(ii) Pon [Weighted Average of Relatives]
Page # 34
Q.14. Write the formula of price relative.
Ans. I = × 100
Q.15. What are the other names of cost of living index numbers?
Ans. Consumer price index number or retail price index number.
Q.16. What is whole – sale price index?
Ans. An index number considering the price quotations of whole-sale markets is called as
whole-sale price index.
Q.17. What is un-weighted index number?
Ans. An index number that measures the change in the price (or quantity) of a group of
commodities when the relative importance of commodities is not taken into account
is called un-weighted index number.
Q.18. What is weighted index number?
Ans. An index number that measures the change in the prices (or quantities) of a group of
commodities when the relative importance of commodities has been taken into
account is called weighted index number.
Q.19. Name the ideal index number?
Ans. Fisher’s index number is called ideal index number.
Q.20. What is base year weighted index number?
Ans. Laspeyre’s index number is called base-year weighted index number.
Q.21. What is the other name of Paasche’s index?
Ans. Paasche’s index number is also called current year weighted index number.
Q.22. Give two uses and two limitations of index number.
Ans. Uses of index Numbers:
(i) Index numbers are of great helpful in forecasting business conditions.
(ii) Index numbers are useful in education for I.Q. comparison and effectiveness of
teaching systems.
Limitations of Index Numbers:
(i) All index numbers are not suitable for all purposes.
(ii) Different methods of construction yield different results.
Important Points and

Formulae
Price Relatives P.R =
Link Relatives L.R =
Simple Aggregative Pon =

Index
Laspeyre’s (Base Pon =

year weighted)
Index
Paasche’s (Current Pon =

year weighted)
Index
Page # 35
Fisher’s Ideal Index Pon =
Consumer Price Index /Cost of Living Index

(i) Aggregative Expenditure Method:
Pon =
(ii) Weighted Average of Relatives:
Pon = , I=
Q.1. For the following data construct index number by

(i) fixed base and
(ii) chain base method taking 1960 as base:
Year:1960 1961 1962 1963 1964 1965 1966 1967 1968 1969 1970
Price: 40 47 52 50 54 30 39 45 50 60 55
Solution:
(i) Fixed (ii) Chain Base
Base
Year Price P.R = L.R = Chain
pn pn indice
×100 ×100
po pn −1 s
1960 40 100 100 100
1961 47 117.5 117.5 117.5
1962 52 130 110.64 130
1963 50 125 96.15 125
1964 54 135 108 135
1965 30 75 55.55 75
1966 39 97.5 130 97.5
1967 45 112.5 115.38 112.5
1968 50 125 111.11 125
1969 60 150 120 150
1970 55 137.5 91.67 137.5
Q.2 Construct chain indices for the prices of sugar (per kg) for the year 1962 
1970.
Year Price Year Price
(Rs) (Rs)
1962 0.80 1967 1.42
1963 1.00 1968 1.50
1964 1.20 1969 1.62
1965 1.25 1970 1.75
1966 1.25
Solution:
Page # 36
Price pn Chain
Year L.R = ×100
(Rs) pn −1 Indices
1962 0.80  
1963 1.00 125 125
1964 1.20 120 150
1965 1.25 104.17 156.26
1966 1.25 100 156.26
1967 1.42 113.60 177.51
1968 1.50 105.63 187.50
1969 1.62 108 202.5
1970 1.75 108.02 218.74
Q.3 Construct with the help of the following data:

(i) Laspeyre’s (ii) Paasche’s Index
Base Year Current Year
Item
Price Quantity Price Quantity
A 3 71 3 26
B 2 107 3 88
C 2 62 2 70
Solution:
Base Current
Ite p0q p1q p1q poq
year year
m o o 1 1
po qo p1 q1
A 3 71 3 26 213 213 78 78
B 2 107 3 88 214 321 264 176
C 2 62 2 70 124 124 140 140
     55 65 48 39
1 8 2 4
(i) Laspeyre’s Index:
Σp1qo
P01 = ×100
Σpo qo
658
= ×100 = 119 .42
551
(ii) Paasche’s Index:
Σp1q1
P01 = ×100
Σpo q1
482
= ×100 = 122.34
394
Q.4. Construct index number for the year 1992 on the basis of the year 1987 of
the following by using:
(i) Laspeyre’s (ii) Paasche’s
(iii) Fisher’s Ideal Formula
Page # 37
A B C
Yea
r Pric Quanti Pric Quanti Pric Quanti
e ty e ty e ty
19
87 5 10 8 26 6 13
19 4 12 7 27 5 14
92
Solution:
Ite 1987 1992 p1q poq p1q poq
m po qo p1 q1 o o 1 1
A 5 10 4 12 40 50 48 60
B 8 26 7 27 182 208 189 216
C 6 13 5 14 65 78 70 84
     28 33 30 36
7 6 7 0
(i) Laspeyre’s Index:
Σp1qo
P01 = ×100
Σpo qo
287
= ×100 = 85.42
336
(ii) Paasche’s Index:

Σp1q1
P01 = ×100
Σpo q1
307
= ×100 = 85.28
360
(iii) Fisher’s Ideal Index:
Σp1qo Σp1q1
P01 = × ×100
Σpo qo Σpo q1
287 307
= × ×100
336 360
= 0.85347 × 100 = 85.35
Q.5. Find index number

(i) taking the year 1980 as base
(ii) taking the average of 1st three years as base
(iii) taking the average of all the years as base
Year Price in Rs
1980 22.5
1981 24.0
1982 28.5
1983 30.0
1984 35.0
1985 32.5
1986 37.5
1987 46.5
1988 48.5
Page # 38
Solution:
(i) (ii) (iii)
Yea Pric P.R = P.R = P.R =
r es pn pn pn
×100 ×100 ×100
22 .5 25 33.89
198 22.5 100 90 66.39
0 24.0 106.67 96 70.82
198 28.5 126.67 114 84.10
1 30.0 133.33 120 88.52
198 35.0 155.56 140 103.28
2 32.5 144.44 130 95.90
198 37.5 166.67 150 110.65
3
46.5 206.67 186 137.21
198
4 48.5 215.56 194 143.11
198
5
198
6
198
7
198
8
 305   
22 .5 + 24 .0 + 28 .5 75
Average of first three years = = = 25
3 3
305
Average of all the years = = 33 .89
9
Q.6. Find chain indices from the following price relatives

Price Relatives
Year
A B C
197 100 100 100
0 103 95 110
197 112 90 115
1 115 100 120
197 120 96 125
2 125 102 128
197
3
197
4
197
5
Solution:
Link Relatives = L.R =
pn Chain
Year ×100 Mean
pn −1 Indices
A B C
Page # 39
1970 100 100 100 100 100

1971 103 95 110 102. 102.67
1972 108.74 94.74 104.55 67 105.42
1973 102.68 111.11 104.35 102. 111.80
1974 104.35 96 104.17 68 113.49
1975 104.17 106.25 102.4 106. 118.34
05
101.
51
104.
27

Stat 2

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Stat 2

Uploaded by

Copyright:

Available Formats

Page # 1

Simple Bar Chart

Example 4: Following data about the production of wheat in different localities of

Locality I Multiple Bar Chart

Component Bar Chart

Percentage Production in Kg (thousands)

Percentage Component Bar Chart

Example: 6: Make a histogram from the following data:

Measures of Central Tendency

Q.1. Define Average. What are its important types?

Q.2. Write down the properties of good average.

Q.3. Define Arithmetic Mean.

Q.4. Write down the properties of Arithmetic Mean.

Q.5. Define weighted arithmetic mean. In what circumstances is it preferred to

Median class or Median group =

Q.7. Define Mode. Write down its methods of calculation.

Methods of Calculation of Mode

TOPIC UNGROUPED GROUPED DATA

Mode Observation Method/ ^

Example 5: The ungrouped data is given below:.

= 8 ( 45 ×30 ×35 × 40 × 44 ×32 × 42 ×37 ) = 37.76

Example 7: Find harmonic mean for the following grouped data

Q.1. What do you mean by dispersion?

Co-efficient Measures of Skewness

Positively Skewed Distribution:

Negatively Skewed Distribution:

Example 1: Find range for each of the following data.

REGRESSION AND CORRELATION

Important Points & Formuale

56 147 8232 3136

For x = 45; ŷ = 0.80.778 + 1.138(45) = 131.988 ≅ 132

7(50815 ) − (1243 )( 271 )

(i) Regression function taking x as independent is

ANALYSIS OF TIME SERIES

Important Points & Formulae

Year Property Trend value (from

Example 3. From the data given below:

196 359 4 356

196 381 1930 7 386

1965 86 = 82.5 = 1 80.35

1966 116 y1 2 1.5 84.65

1970 146 6 101.85

1971 110 7 6.5 106.15

The estimated equation of semi averages is

Producti 60 65 80 73 97 105 93 111 117

1991 5 –5 –25 25 4.45 0.55

1992 8 –3 –24 9 8.27 –0.27

1993 12 –1 –12 1 12.09 –0.09

1994 15 1 15 1 15.91 –0.91

1995 20 3 60 9 19.73 0.27

1996 24 5 120 25 23.55 0.45

Important Points and

Link Relatives L.R =

Simple Aggregative Pon =

Laspeyre’s (Base Pon =

Paasche’s (Current Pon =

Fisher’s Ideal Index Pon =

Consumer Price Index /Cost of Living Index

Q.1. For the following data construct index number by

Q.3 Construct with the help of the following data:

(ii) Paasche’s Index:

Q.5. Find index number

Q.6. Find chain indices from the following price relatives