Professional Documents
Culture Documents
1 of 5 27/02/13 MEI
Section 2: Regression
Solutions to Exercise
1. (i)
(ii)
30
5
6
x ,
60
10
6
y
so the line of best fit passes through (5, 10).
Line of best fit shown above has gradient 2 and passes through the
origin, so the equation of the line is 2 y x .
(iii)
30 x ,
60 y ,
371 xy ,
2
192 x
30
5
6
x ,
60
10
6
y
30 60
371 71
6
xy
x y
S xy
n
2
2
2
30
192 42
6
xx
x
S x
n
The regression line is y a bx ,
where
71
1.69
42
xy
xx
S
b
S
and
71
10 5 1.55
42
a y bx
so the regression line is 1.55 1.69 y x
(iv) The two equations are reasonably close.
2.
75 x ,
90 y ,
1238 xy ,
2
1219 x
75
15
5
x ,
90
18
5
y
75 90
1238 112
5
xy
x y
S xy
n
2 4 6 8 10
4
8
12
16
Any reasonable line of
best fit passing through
(5, 10).
Answers to
this may vary
according to
the line drawn.
AQA S1 Correlation & regression 2 Exercise solutions
2 of 5 27/02/13 MEI
2
2
2
75
1219 94
5
xx
x
S x
n
The regression line is y a bx ,
where
112
1.19
94
xy
xx
S
b
S
and
112
18 15 35.9
94
a y bx
so the regression line is 35.9 1.19 y x
When x = 21, 35.9 1.19 21 10.9 y (3 s.f.)
3. (i)
36
36 4.5
8
x x
64
64 8
8
y y ,
(ii)
321.7 xy ,
2
204 x
36 64
321.7 33.7
8
xy
x y
S xy
n
2
2
2
36
204 42
8
xx
x
S x
n
The regression line is y a bx ,
where
33.7
0.802
42
xy
xx
S
b
S
and
33.7
8 4.5 4.39
42
a y bx
so the regression line is 4.39 0.802 y x
(iii) (a) When x = 3.5, 4.39 0.802 3.5 7.20 y
(b) When x = 11, 4.39 0.802 11 13.21 y
The prediction for x = 3.5 is probably reasonably accurate. However the
prediction for x = 11 should be treated with caution as it is outside the
range of the data.
4.
99
16.5
6
x ,
1239
206.5
6
y
99 1239
22532 2088.5
6
xy
x y
S xy
n
AQA S1 Correlation & regression 2 Exercise solutions
3 of 5 27/02/13 MEI
2
2
2
99
1833 199.5
6
xx
x
S x
n
The regression line is y a bx ,
where
2088.5
10.5
199.5
xy
xx
S
b
S
and
2088.5
206.5 16.5 33.8
199.5
a y bx
so the regression line is 33.8 10.5 y x
5. (i)
660
66
10
x ,
1270
127
10
y
660 1270
84123 303
10
xy
x y
S xy
n
2
2
2
660
43678 118
10
xx
x
S x
n
The regression line is y a bx ,
where
303
2.57
118
xy
xx
S
b
S
and
303
127 66 42.5
118
a y bx
so the regression line is 42.5 2.57 y x
(ii) When x = 64, 42.5 2.57 64 122 y
so the estimated weight is 122 pounds
6. (i)
500 x ,
1350 y ,
69092 xy
2
26888 x ,
2
184088 y
500 1350
69092 1592
10
xy
x y
S xy
n
2
2
2
500
26888 1888
10
xx
x
S x
n
2
2
2
1350
184088 1838
10
yy
y
S y
n
1592
0.855
1888 1838
xy
xx yy
S
r
S S
(3 s.f.)
(ii)
500
50
10
x ,
1350
135
10
y
The regression line is y a bx ,
AQA S1 Correlation & regression 2 Exercise solutions
4 of 5 27/02/13 MEI
where
1592
0.843
1888
xy
xx
S
b
S
and
1592
135 50 92.8
1888
a y bx
so the regression line is 92.8 0.843 y x
(iii) When x = 43, 92.8 0.843 43 129 y (3 s.f.)
7. (i)
(ii)
480 x
480
60
8
x
510
510 63.75
8
y y
34524 xy ,
2
33176 x
480 510
34524 3924
8
xy
x y
S xy
n
2
2
2
480
33176 4376
8
xx
x
S x
n
The regression line is y a bx ,
where
3924
0.897
4376
xy
xx
S
b
S
and
3924
63.75 60 9.95
4376
a y bx
so the regression line is 9.95 0.897 y x
(iii) Substitute the value x = 73 into the regression line equation and
calculate the value of y.
9.95 0.897 73 75.4 y
8. (i) The residual of a point on a scatter diagram is the vertical distance of the
point from the regression line.
AQA S1 Correlation & regression 2 Exercise solutions
5 of 5 27/02/13 MEI
(ii)
x 0.728x + 9.567 y Residual Square of residual
65 56.887 61 4.113 16.917
72 61.983 64 2.017 4.068
52 47.423 42 -5.423 29.409
67 58.343 64 5.657 32.002
55 49.607 53 3.393 11.512
59 52.519 50 -2.519 6.345
75 64.167 58 -6.167 38.031
54 48.879 46 -2.879 8.289
58 51.791 50 -1.791 3.208
49 45.239 49 3.761 14.145
(A) Sum of residuals = 0.162
(B) Sum of squares of residuals = 163.93
(iii) The sum of the residuals should be zero. The answer above is
different from this because of rounding errors from the equation of
the regression line.