Professional Documents
Culture Documents
Contents
1. Introduction
2. Answer to Question 1
3. Answer to Question 2
4. Answer to Question 3
5. Answer to Question 4
6. Answer to Question 5
10
7. Answer to Question 6
11
8. Answer to Question 7
12
9. Answer to Question 8
14
16
19
Page 1 of 20
MCP 1607
1. Introduction
Quantitative Analysis is a vital part of any problem solving, be it from Engineering, Economics,
Logistics or Business Analysis. It enables us to glean information from statistical and other
analytical methods by which we can deduce much information that are otherwise not easily seen.
In business analysis, it is vital to extract information about correlation of data, i.e, whether the
sales quantities are related to weather, marketing poll results, changes in the economic landscape
or even changes in the political arena. To this end, it is necessary to perform regression analysis
before we can proceed any further. This assignment is focused on this area of data analysis/
The following sections comprise the answers to this assignment. This is a part of assignments for
Cohort No. 10.
Page 2 of 20
MCP 1607
The following is the data table that is referred to in the ensuing answers.
Machine
Age
Number
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
(Years)
2
7.4
5.3
18
11.5
6.4
14.3
10.2
21
3.4
2.9
9.3
13.7
16.2
4.6
Fuel
Consumption
(Ltr)
22
61
42
222
110
51
153
93
288
30
26
81
143
187
37
1. Plot the data on graph paper, consider age of machine as X and fuel consumption as Y
Page 3 of 20
MCP 1607
Page 4 of 20
MCP 1607
n. xy x y
(1)
[n x 2 ( x )2 ].[n y 2 ( y ) 2 ]
Machine
Age-x
Fuel Consumptiony
Number
(Years)
(Ltr)
xy
x2
y2
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
2
7.4
5.3
18
11.5
6.4
14.3
10.2
21
3.4
2.9
9.3
13.7
16.2
4.6
22
61
42
222
110
51
153
93
288
30
26
81
143
187
37
44
451.4
222.6
3996
1265
326.4
2187.9
948.6
6048
102
75.4
753.3
1959.1
3029.4
170.2
2
7.4
5.3
18
11.5
6.4
14.3
10.2
21
3.4
2.9
9.3
13.7
16.2
4.6
22
61
42
222
110
51
153
93
288
30
26
81
143
187
37
4
54.76
28.09
324
132.25
40.96
204.49
104.04
441
11.56
8.41
86.49
187.69
262.44
21.16
484
3721
1764
49284
12100
2601
23409
8649
82944
900
676
6561
20449
34969
1369
xy
x2
y2
1911.34
249880
21579.3
Page 5 of 20
146.2
1546
MCP 1607
Where
15
(15*21579.3-146.2*1546)/SQRT((15*1911.34-146.2*146.2)*(15*249880-1546*1546))
r=
0.981160898
0.9812
y = a + bx
+ .
..(2)
used to derive the line (in a linear sense) of best fit, the coefficients are derived as
=
.( . ) .
.
( )
..(3)
..(4)
Page 6 of 20
MCP 1607
Machine
Age-x
Number (Years)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
2
7.4
5.3
18
11.5
6.4
14.3
10.2
21
3.4
2.9
9.3
13.7
16.2
4.6
Fuel
Cons.-y
(Ltr)
xy
x2
y2
22
61
42
222
110
51
153
93
288
30
26
81
143
187
37
2
7.4
5.3
18
11.5
6.4
14.3
10.2
21
3.4
2.9
9.3
13.7
16.2
4.6
_
22
61
42
222
110
51
153
93
288
30
26
81
143
187
37
_
44
451.4
222.6
3996
1265
326.4
2187.9
948.6
6048
102
75.4
753.3
1959.1
3029.4
170.2
2
7.4
5.3
18
11.5
6.4
14.3
10.2
21
3.4
2.9
9.3
13.7
16.2
4.6
22
61
42
222
110
51
153
93
288
30
26
81
143
187
37
4
54.76
28.09
324
132.25
40.96
204.49
104.04
441
11.56
8.41
86.49
187.69
262.44
21.16
484
3721
1764
49284
12100
2601
23409
8649
82944
900
676
6561
20449
34969
1369
xy
x2
y2
1911.34
249880
x
y
9.746667 103.0667
21579.3
146.2
1546
Page 7 of 20
MCP 1607
4.
Page 8 of 20
MCP 1607
Page 9 of 20
MCP 1607
=1
..(5)
Where
sse
3379.211028
and
sst
=
=
90538.933333
Therefore, from eq. (5), the coefficient of determination is (rounded to 4 decimal places)
= 0.9627
Page 10 of 20
MCP 1607
Machine
Age-x
Fuel Cons.y
Number
(Years)
(Ltr)
1
2
2
7.4
22
61
3
4
5.3
18
Residuals
x
xy
x2
2
7.4
22
61
44
451.4
4
54.76
42
222
5.3 42
18 222
222.6
3996
28.09
324
11.5
110
11.5 110
6.4
51
7
8
9
10
11
14.3
10.2
21
3.4
2.9
12
9.3
81
13
13.7
143
14
15
16.2
4.6
6.4
51
1265 132.25
326.4
40.96
81
753.3
86.49
Page 11 of 20
y2
MCP 1607
)
15
1.56319401867222E-14
b) Draw histogram of the residuals and comment on the results (use three class intervals)
34.2891253155987
Minimum
-16.5378910201407
Span
50.8270163357394
-16.5378910201407
0.40444775843908
Range-2
0.40444775843908
17.3467865370189
Range-3
17.3467865370189
34.2891253155987
Frequency
9
4
2
0
Page 12 of 20
MCP 1607
Page 13 of 20
MCP 1607
Remarks:
1. The residuals do not show a normal distribution ( This violates the 1st assumption of regression
analysis).
2. Therefore, the linear regression model we selected is incorrect (not suitable) to represent and
analyze this data set.
08. Draw another graph with x as machine number and y as residual and comment on the
result.
Page 14 of 20
MCP 1607
Page 15 of 20
MCP 1607
Remarks:
The residuals do not seem to follow any pattern and seem to be independent of the
Machine Numbers.
09. Calculate the correlation coefficient between x (value of age) and residual and comment
on the result.
Machine
Age-x
Residualy
Number (Years)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
2
7.4
5.3
18
11.5
6.4
14.3
10.2
21
3.4
2.9
9.3
13.7
16.2
4.6
22.63509
-10.6527
-1.54079
8.449015
-16.5379
-7.26608
-11.0205
-16.1353
34.28913
11.89381
14.58713
-16.0873
-12.9885
-2.45505
2.829855
xy
x2
y2
45.27
-78.83
-8.166
152.08
-190.2
-46.5
-157.6
-164.6
720.07
40.439
42.303
-149.6
-177.9
-39.77
13.017
2
7.4
5.3
18
11.5
6.4
14.3
10.2
21
3.4
2.9
9.3
13.7
16.2
4.6
22.635
-10.65
-1.541
8.449
-16.54
-7.266
-11.02
-16.14
34.289
11.894
14.587
-16.09
-12.99
-2.455
2.8299
4
54.76
28.09
324
132.25
40.96
204.49
104.04
441
11.56
8.41
86.49
187.69
262.44
21.16
512.35
113.48
2.374
71.386
273.5
52.796
121.45
260.35
1175.7
141.46
212.78
258.8
168.7
6.0273
8.0081
xy
x2
y2
4.64E-12
Page 16 of 20
146.2
2.34E-13
1911.34 3379.211
MCP 1607
r=
r=
1.83514E-15
Note: Please refer next page for the plot of the residuals vs. machine age.
Remarks:
1. The Correlation coefficient of zero means that the residuals are not linearly correlated with
machine age. Therefore, the residuals and the independent variables are independent of each
other.
2. Also, they are randomly distributed about their mean, 0. This means that it obeys the 2nd
assumption of regression analysis.
3. Therefore, our model of regression is correct.
4. However, from the plot given in the next page, it is obvious that they are non-linearly
correlated
5. Therefore, our linear regression model is incorrect
6. We should apply a non-linear regression model to analyze this set of data
Page 17 of 20
MCP 1607
Page 18 of 20
MCP 1607
10.
Learning
Outcome/ Question
Maximum Weightage
Question 1
Question 2
Question 3
Question 4
Question 5
Question 6
Question 7
Question 8
Question 9
Total Marks
100%
Page 19 of 20
First Marker
Second Marker
MCP 1607
_________________________________________________________________________________
_________________________________________________________________________________
_________________________________________________________________________________
_________________________________________________________________________________
_________________________________________________________________________________
_________________________________________________________________________________
_________________________________________________________________________________
_________________________________________________________________________________
_________________________________________________________________________________
_________________________________________________________________________________
_________________________________________________________________________________
_________________________________________________________________________________
Page 20 of 20