Professional Documents
Culture Documents
Regression Analysis
Firm ABC wants to determine the relationship between its advertisement expenses and sales. The manger of
marketing department is asking to increase the budget of advertisement He sad that higher advertisement
expense bring increase in sales and in this way firm can earn more profit. Now management wants to test
this hypothesis that “ the higher advertisement expenditures lead to higher sales for firm ” to ensure that
their investment will be beneficial for organization. Management is also interested to know how much sales
increase for each Rupees increase in advertisement Manger collects the data for last 10 years. In this case,
advertisement (X) is independent variable and sales (Y) are dependent variable that manager seeks to
explain.
Sales and Advertisement Expenditures for firm ABC
(In Millions of Rupees)
1 20 88
2 18 80
3 22 84
4 24 92
5 22 96
6 24 104
7 26 108
8 26 116
9 28 112
10 30 120
We can draw the following graph with the help of given data. In this graph the independent variable
is measured along with the horizontal axis and sales are measured along the vertical axis. Regression line is
/opt/scribd/conversion/tmp/scratch0/24151088.doc -1-
Managerial Economics in Global Economy (Regression Analysis)
showing the positive relation between these two variables, now we can also prove this relationship with the
help of statistical formulas.
Graph 0.1
0.1
Different methods can be applied to prove this hypothesis. In this case we will use these methods.
⇒ The Ordinary Least-Squares Methods
⇒ Test of Significance or Parameter Estimates
⇒ T-Test
⇒ Coefficient of determination
⇒ Test of significance
⇒ The Multiple Regression Model
/opt/scribd/conversion/tmp/scratch0/24151088.doc -2-
Managerial Economics in Global Economy (Regression Analysis)
This is the most popular method of parameter estimation for coefficients of regression models. It has
well known probability distributions and gives unbiased estimators of regression parameters with the
smallest variance. The objective of regression line is to obtain the estimates of “a” (vertical intercept) and
“b” (the slope) of regression line:
Ŷt = + Xt
Where Ŷt is the estimation of firm’s sales and in the year “t” and Xt is the indicates the level of
advertisement and and are estimation of parameters.
2 18 80 -6 -20 120 36
3 22 84 -2 -16 32 4
4 24 92 0 -8 0 0
5 22 96 -2 -4 8 4
6 24 104 0 4 0 0
7 26 108 2 8 16 4
8 26 116 2 16 32 4
9 28 112 4 12 48 16
10 30 120 6 20 120 36
Estimation Procedure n
∑(X t − X )(Yt − Y )
b= t =1
n =
∑ t
( X −
t =1
X ) 2
a= Y − bX
• = 100 - 3.533(24)
/opt/scribd/conversion/tmp/scratch0/24151088.doc -3-
Managerial Economics in Global Economy (Regression Analysis)
• = 15.20
Thus the equation of regression is….
Ŷt = + Xt
Ŷ= 15.20-3.533(X)
This regression indicates that having the zero value of “X” (advertisement expense) the
level of sales is 15.20 million. If the value of “X” is 20 million as shown in the first year
the level of sales will increase from 15.20 to 85.86 million.
So, we can calculate the increase in sales with each Rupees increase in advertisement
expense….
Ŷ= 15.20-3.533(X)
After putting the values of “X” in different years….
Ŷ= 15.20-3.533(20) = 85.86
Ŷ= 15.20-3.533(18) = 78.79
Ŷ= 15.20-3.533(22) = 92.92
Ŷ= 15.20-3.533(24) = 99.99
Ŷ= 15.20-3.533(22) = 92.92
Ŷ= 15.20-3.533(24) = 99.99
Ŷ= 15.20-3.533(26) = 107.05
Ŷ= 15.20-3.533(26) = 107.05
Ŷ= 15.20-3.533(28) = 114.12
Ŷ= 15.20-3.533(30) = 121.19
Test of Significance Standard Error
/opt/scribd/conversion/tmp/scratch0/24151088.doc -4-
Managerial Economics in Global Economy (Regression Analysis)
To test the hypothesis that b is significant (advertisement positively affects the sales) we
need to calculate the standard error (deviation) of . The standard error of can be
calculated as…..
Where Yt and Xt are the sample observation of dependent and independent variables and “t”
is the number of years. The “e” is the error term or “n” is the number of observation in
estimation of regression line; “k” is the number of estimated coefficients in the regression.
The value of “n-k” is called degree of freedom (df).
T-Test
After calculation of the ratio of we have to calculate the ratio of . This is called the
/opt/scribd/conversion/tmp/scratch0/24151088.doc -5-
Managerial Economics in Global Economy (Regression Analysis)
t-statistic or t-ratio. We use t-statistic to check the relationship between two variables.
Two types of hypothesis are used in T-Test that are…
H 1 = There is a relationship between two variables \
From statistical table of the book Managerial Economics In Global World at page no 610 we
get the value of t using 8 as a degree of freedom, which is 2.306. The level of significance is
0.05. In this equation the value of “t” that we have calculated lies in the acceptance area. So,
we accept the H 1 which indicates that there is relationship between advertisement and sales.
-2.306
2.306
0 t
-t
Between these negative “-t ” and “ t ” the area is rejection, means that if the value of “t”
falls in this area its mean there is no relationship between the advertisement and sales. The
value that we have calculated lies in out side of the rejection area which reflects that there is
relationship between sales and advertisement expenditures.
Coefficient of Determination
/opt/scribd/conversion/tmp/scratch0/24151088.doc -6-
Managerial Economics in Global Economy (Regression Analysis)
Years X Y Ŷ (Ŷ-Y) 2 (Y-Y) 2
Table 0.4
1 20 88 85.86 199.9396 144
4 24 92 99.99 1.04 64
5 22 96 92.92 50.1264 16
R =
2
=
Total Variation ∑ (Yt − Y )2
1500.08
= = 0.8523
1760
This means that 85% total variation in sales is due to advertisement expenditures. We can graphically
represent this affect with the help of SPSS, which is presented in graph no 0.2.
/opt/scribd/conversion/tmp/scratch0/24151088.doc -7-
Managerial Economics in Global Economy (Regression Analysis)
Graph 0.2
The above graph is showing the best fit for a straight linear regression line. The
line shows that there is perfect positive relationship between sales and
advertisement expenditures. The value of R2 linear (0.85) is equal to the value of
That we have calculated above with the help of this formula.
R =
2 Explained Variation
=
∑ (Y− Y ) 2
Total Variation ∑ (Y − Y )
t
2
/opt/scribd/conversion/tmp/scratch0/24151088.doc -8-
Managerial Economics in Global Economy (Regression Analysis)
Test of significance
∑ t
(Y − Y ) 2
= ∑ (Y− Y ) 2
+ ∑ t t
(Y − Y) 2
r= 0.8523
r= 0.9232
Model Summary
Model R R Adjusted R Std. Error of the
Square Square Estimate
a
1 .923 .853 .810 6.08824
a. Predictors: (Constant), promotions, advertisement
a
Model Summary show the value of R square is 923 reflect the multiple correlation coefficients. And the
value of Adjusted R Square is. 81. It indicates that 81% of variance in the dependent variable (sales) can
be predicted from independent variable.
/opt/scribd/conversion/tmp/scratch0/24151088.doc -9-
Managerial Economics in Global Economy (Regression Analysis)
ANOVAb
Model Sum of df Mean F Sig.
Squares Square
1 Regression 1500.533 2 750.267 20.241 .001a
Residual 259.467 7 37.067
Total 1760.000 9
a. Predictors: (Constant), promotions, advertisement
b. Dependent Variable: sales
The ANOVAb table shows that the F = 20.24 which is statistically significant where the Sig. Level is
. 001 which is less than 0.05 so we will accept H1 which mean that predictors are significantly combine
together to predict dependent variable (sales).
Coefficient
Coefficient table shows the standardized beta coefficient which are interpreted like correlation. The t
value and sig. values indicates whether that variable is significantly contributing to the equation for
predicting dependent variable (sales) in this case the advertisement and promotion are significantly adding
predictions.
So, we can obtain multiple regression equations from this table of coefficients.
/opt/scribd/conversion/tmp/scratch0/24151088.doc - 10 -
Managerial Economics in Global Economy (Regression Analysis)
As we know that the regression equation is
Multicollinearity: It occurs when there are high intercorrelations among some set of predictor variables.
In other words Multicollinearity happens when two or more predictors contain the overlapping information.
/opt/scribd/conversion/tmp/scratch0/24151088.doc - 11 -
Managerial Economics in Global Economy (Regression Analysis)
Most commonly heteroskedasticity takes the form of changes in variance with the magnitude of X. That is,
in
y = Xb + e
that the e's vary in magnitude with the X's. (An example is that variance of income across individuals is
systematically higher for higher income individuals.)
http://economics.about.com/cs/economicsglossary/g/heteroskedastic.htm
Autocorrelation: Usually autocorrelation means correlation among the data from different time periods.
Spatial autocorrelation means correlation among the data from locations. There could be many dimensions
of spatial autocorrelation, unlike autocorrelation between periods.
http://economics.about.com/b/2007/01/25/economics-term-of-the-afternoon-spatial-autocorrelation.htm
/opt/scribd/conversion/tmp/scratch0/24151088.doc - 12 -