Professional Documents
Culture Documents
&
REGRESSION
ANALYSIS
Correlation
Wt. 67 69 85 83 74 81 97 92 114 85
(kg)
SBP 120 125 140 160 130 180 150 140 200 130
mmHg)
Wt. 67 69 85 83 74 81 97 92 114 85
SBP(mmHg) (kg)
SBP 120 125 140 160 130 180 150 140 200 130
(mmHg)
220
200
180
160
140
120
100
80 wt (kg)
60 70 80 90 100 110 120
200
180
160
140
120
100
80
Wt (kg)
60 70 80 90 100 110 120
negative relationship
no relationship
Positive Relationship
18
16
14
12
Height in CM
10
0
0 10 20 30 40 50 60 70 80 90
Age in Weeks
Negative Relationship
Reliability
Age of Car
No Relation
Correlation Coefficient
If r = l = perfect correlation.
How to compute the Simple Correlation
Coefficient (r)
xy x y
r n
( x) 2
( y)
2
x
2
. y
2
n n
Example I:
A sample of 6 children was selected, data about their
age in years and weight in kilograms was recorded as
shown in the following table . It is required to find the
correlation between age and weight.
Age Weight
Serial No (years) (Kg)
1 7 12
2 6 8
3 8 12
4 5 10
5 6 11
6 9 13
These 2 variables are of the quantitative type, one
variable (Age) is called the independent and
denoted as (X) variable and the other (weight)
is called the dependent and denoted as (Y)
variables to find the relation between age and
weight compute the simple correlation coefficient
using the following formula:
xy x y
r n
x
2
( x) 2
. y
2
( y) 2
n n
Age Weight
Serial
(years) (Kg) xy X2 Y2
No.
(x) (y)
1 7 12 84 49 144
2 6 8 48 36 64
3 8 12 96 64 144
4 5 10 50 25 100
5 6 11 66 36 121
6 9 13 117 81 169
Total ∑x= ∑y= ∑xy= ∑x2= ∑y2=
41 66 461 291 742
41 66
461
r 6
(41)
2
(66)
2
291 .742
6 6
r = 0.759 (positive)
It indicates a strong direct correlation
EXAMPLE II: Relationship between Anxiety and
Test Scores
Anxiety Test X2 Y2 XY
(X) score (Y)
10 2 100 4 20
8 3 64 9 24
2 9 4 81 18
1 7 1 49 7
5 6 25 36 30
6 5 36 25 30
∑X = 32 ∑Y = 32 ∑X2 = 230 ∑Y2 = 204 ∑XY=129
Calculating Correlation Coefficient
r = - 0.94 (negative)
It indicates an Indirect strong correlation
Calculating Coefficient of
Determination
0.576 = 0.759 2= r2
6 (di) 2
rs 1
n(n 1)
2
∑ di2=64
6 64
rs 1 0.1
7(48)
Comment:
There is an indirect weak correlation
between level of education and income.
Regression Analyses
200
180
160
140
120
100
80
Wt (kg)
60 70 80 90 100 110 120
By using the least squares method (a procedure that
minimizes the vertical deviations of plotted points
surrounding a straight line) we are
able to construct a best fitting straight line to the scatter
diagram points and then formulate a regression
equation in the form of:
ŷ a bX
x y
ŷ y b(x x) xy
n
bb1
( x) 2
x n
2
Regression Equation
Slope
SBP(mmHg)
220
200
180
160
140
120
100
80
Wt (kg)
60 70 80 90 100 110 120
Linear Equations
Y
ŷY = bX
a +bX
a
Change
b = Slope in Y
Change in X
a = Y-intercept
X
Hours Studying and Grades
Regressing grades on hours
Linear Regression
90.00 Final grade in course = 59.95 + 3.17 * study
R-Square = 0.88
Final grade in course
80.00
70.00
41 66
461
b 6 0.92
2
(41)
291
6
Regression equation
x n
2 41678
20
ŷ =112.13 + 0.4547 x
For age 25
B.P = 112.13 + 0.4547 * 25=123.49 = 123.5 mm hg
For age 87
B.P = 112.13 + 0.4547 * 87= 151.69 mm hg
Comment on the reliability of the
estimates (See previous example!)
Comment on coefficient of
determination (see previous
example!)