You are on page 1of 5

Chapter : 6 REGRESSION ANALYSIS

Regression analysis is a method of predicting or estimating one variable knowing


the value of the other variable. Estimation is required in different fields in every day
life. A businessman wants to know the effect of increase in advertising expenditure
on sales or a doctor wishes to observe the effect of a new drug on patients. An
economist is interested in finding the effect of change in demand pattern of some
commodities on prices.
We observe, different pairs of variables related to each other like saving depends
upon income, cost of production depends upon the number of units produced, the
production depends on the number of workers present on a particular day etc. The
relationship between two variables can be established with the help of any measure
of correlation, as we have seen in the previous chapter. When it is observed that
two variables are highly correlated, it leads to interdependence of the variables. We
can study the cause and effect relationship between them and then we can apply
the regression analysis. The analysis helps in finding a mathematical model of the
relationship. In this chapter we will discuss linear regression only.
There are two types of regression lines viz., regression equation of x on y and
regression equation of y on x. The first one is used to estimate the value of x when
the corresponding value of y is given and the second used to find the value of y
when the corresponding value of x is given. They are given by
Regression equation of x on y is

x x bxy y y

Regression equation of y on x is

y y byx x x

and
where bxy and byx are known as

regression coefficients.
The regression coefficients are given by

bxy r x
y

byx r y
x

bxy byx r x r y

y x

i.e., bxy byx r 2


i.e., r bxy byx

Thus, when the regression coefficients are known to us we can determine the coefficient of correlation.
The regression coefficients can be found independently as

bxy

xy

x y

n
y
2
y

and byx

xy

x y

n
x
2
x

Problems:

1.

From the following data find the two regression equations


estimate y when x = 13 and estimate x when y = 10.
x
:
y
:

1
4

1
0

1
5

1
1

1
2

and hence

[ Answer: 5.2858 & 8.1428 ]


2.

Find the two regression equations and also estimate y when x = 13 and
estimate x when y = 10
x
:
y
:

1
1
1
6

1
4

1
2

1
1

1
5

1
4

1
0
1
7

[ Answer: 17.5359 & 5.0693 ]


3.

The following data represents the marks in Algebra (x) and Geometry (y) of a
group of 10 students. Find both regression equations and hence estimate
y if x = 78 and x if y = 94.
x
:
y
:

7
5
8
2

8
0
7
8

9
3
8
6

6
5
7
2

8
7
9
1

7
1
8
0

9
8
9
5

6
8
7
2

8
9
8
9

7
7
7
4

[ Answer: 80.394 ~ 80 and 94.9337 ~ 95 ]


4.

Find the regression equations for the following data and hence estimate y
when x = 15 and x when y = 18.
x
:
y
:

1
0
2
0

1
2
2
4

1
4
2
5

1
9
2
1

1
1
2
2

8
1
6

1
7
2
0

[ Answer: 21.64 & 11.54 ]


5.

From the following data, find the regression equations and further estimate y
if x = 16 and x if y = 18.
x
:
y
:

1
2

1
1

1
5

1
0
1
6

1
2
1
9

1
3
1
7

[ Answer: 20.32 & 11.8 ]

6.

The heights in cms of a group of mothers and daughters are given below.
Find the regression equations and hence find the most likely height of a
mother when the daughters height is 164 cms. Also obtain the estimate of
the height of a daughter when mothers height is 162 cms.
Height of mother
(x):
Height of
daughter (y):

15
7
16
2

16
0
15
9

16
3
16
5

16
5
16
7

16
7
17
2

16
8
17
0

17
0
16
8

16
4
16
6

[ Answer: 164.2942 & 162.4556 ]


7.

The following data gives the marks obtained at the preliminary examination
(x) and the final examination (y) for a group of 10 students. Obtain the
regression equation of y on x. Hence find the most probable marks at the
final examination of a student who has scored 70 marks at the preliminary
examination.
x
:
y
:

5
4
5
8

6
5
6
7

7
5
7
6

8
2
8
0

5
7
6
0

5
9
6
4

6
0
6
5

6
4
6
5

5
8
6
0

6
2
7
0

[ Answer: 71.21 ]
8.

The following data gives the expenditure on advertising, expressed in


hundreds of Rs. (x) and the expenditure on office staff, expressed in
thousands of Rs. (y) for 7 different companies. Find the two regression
equations and hence estimate the expenditure on advertising when
expenditure on office staff is Rs.97,000.
Also obtain the most likely
expenditure on office staff when the advertising expenditure is Rs.13,200.
x
:
y
:

12
9

13
7

13
8

13
5

13
9

13
4

98

94

93

91

96

95

14
5
10
0

[ Answer: Rs.13746 and Rs.94780 ]


9.

The following data represents the figures in kgs. for demands of two
commodities A and B. Find (i) the most probable demand for A when demand
for B is 107 kgs. (ii) the most probable demand for B when demand for A is
115 kgs.
x
:
y
:

10
7
10
5

11
3
11
0

10
9
10
3

10
3
10
0

11
0
11
0

11
7
11
1

11
4
10
8

[ Answer: 110.7069 & 110.0996 ]


10.

From the following data, estimate y when x = 142 and x when y = 126.
x
:
y
:

14
0
12
2

15
5
13
5

16
3
14
0

16
7
13
9

14
5
12
5

15
0
13
0

14
8
12
5

[ Answer: 123.25 & 146.3 ]


11.

For a bivariate distribution, the following results are obtained.


Mean value of x = 65
Mean value of y = 53
Standard deviation = 4.7 Standard deviation = 5.2
Coefficient of correlation = 0.78
Find the two regression equations and hence obtain
i.

The most probable value of y when x = 63

ii.

The most probable value of x when y = 50


[ Answer: 51.274 & 62.885 ]

12.

Given that for 10 pair of observations.

x 88.1, y 89.4, x x

y y 326.6, x x

414.9

Find the regression equation of y on x and then estimate y when x = 78.


[ Answer: 81.4495 ]
13.

The averages for rainfall and yield of a crop are 42.7 cms and 850 kgs
respectively. The corresponding standard deviations are 3.2 cms and 14.1
kgs. The coefficient of correlation is 0.65. Estimate the yield when the
rainfall is 39.2 cms.
[ Estimated yield is 839.99 kgs. ]

14.

The regression equation of supply in thousands of Rs.(y) on price in


thousands of Rs.(x) is 2 x 5 y 60 0 . The average supply is Rs.18,000. The
ratio of standard deviation of supply and price is

2
. Find the average price
3

and the coefficient of correlation between supply and price.


is Rs.15,000 & r = 0.6 ]
15.

Find the regression equation and hence estimate y when x = 56 and x


when y = 45.

x 54, x 5.3, y 40, y 2.8, r 0.85


16.

[ Average price

[ 40.9 & 62.04 ]

Given the two regression equations, find


(i)
mean values of x and y
(ii)
coefficient of correlation
where the equation are 2 x 3 y 5 0 and 5 x 8 y 13 0
[ 1, 1 and 0.9682 ]

17.

Given the regression equations


2 x 3 y 5 0 and x y 2 0 . Find the means of x and y and the coefficient
of correlation.
[1, 1 and -0.8165 ]

18.

Find the mean values of x and y and correlation coefficient, if the regression
equations are 2 x y 3 0 and 2 x 3 y 5 0 [ 1, 1 and -0.5774 ]

19.

Given the regression equations 3 x y 25 0 and 2 x 3 y 30 0 , find


i.
Mean values of x and y
ii.
Coefficient of correlation

iii.
iv.

Estimate of y when x = 17
Estimate of x when y = 25
[ 15, 20, 0.4714, 21.33 and 16.67 ]

20.

Find the two regression equations for the following:

x 30, y 50, x 4.1, y 5.2, r 0.8


[ y 1.0146 x 19.562, x 0.6308 y 1.54 ]
21.

The regression equation of income (x) on expenditure (y) is 3 x 2 y 3900 0 .


The ratio of the standard deviation of income and expenditure is 4 : 3. Find
the coefficient of correlation between income and expenditure. Also find the
average income if the average expenditure is Rs.1800. [2500, 0.5 ]

22.

From
i.
ii.
iii.
iv.

the following regression equations 2 x y 17 0 and 4 x 3 y 1 0 , find


Mean values of x and y
Coefficient of correlation
Most probable value of y when x = 28
Most probable value of x when y = 35.
[ 25, 33, 0.8164, 37,
26 ]

You might also like