Professional Documents
Culture Documents
1. Concept of Correlation
Perfect
positive
Weak
positive
No
corr
Strong
negative
Weak
negative
No
corr
Perfect
negative
Source: Wikipedia
(xi,yi) ; i = 1, 2, , n
where xi = measured value of variable 1 for individual i
yi = measured value of variable 2 for individual i
BUT
Zero correlation does not necessarily imply independence (as X
and Y could have a non-linear relationship ... see previous slide.)
Independence
Zero
correlation
[it is like an unscaled correlation] ... not really a useful measure (yet).
S xy
r=
r is sometimes referred
as the Pearson sample
correlation coefficient
S xx S yy
10
(weight / age)
n
n
n
2
2
SS for X S xx = ( xi x ) = xi xi
i =1
i =1
i =1
i =1
i =1
sy
nx 2 = ( n 1) S x2
2
i
or
ny 2 = ( n 1) S y2
i =1
S xy = ( xi x )( yi y ) = xi yi xi yi n
i =1 i =1
i =1
i =1
n
SCP
i =1
2
i
i =1
SS for Y S yy = ( yi y )2 = yi2 yi n
n
or
or
x y nxy
i
i =1
1.
2.
3.
4.
5.
6.
1
6
3
2
4
5
n=6
6
18
13
9
12
14
x = 3.5
sx 1.871
S xx = 17.5
y = 12
s y 4.147
S yy = 86
S xy = xi yi nxy
= 289 6(3.5)(12) = 37
xx
= (n 1) * s
2
x
S yy = (n 1) * s
S xy = xi yi nxy
r=
2
y
11
S xy
S xx S yy
37.0
0.9537
17.5 86.0
12
H0: = 0
H1: 0
tobs =
r n2
1 r2
r
est.se ( r )
Problem:
r is an index between -1 and 1, randomly distributed about , so
H0: = 0 vs H1: 0
tobs =
If = 0,
0 the distribution of r (the random variable) will be
approximately Normal (but truncated at -1 and 1)
0.9537 6 2
1 0.9537 2
df = 6-2 = 4
The further is from 0, the more skewed the distribution of r will
be, so we can only test specifically for = 0 (and no other values).
13
at = 0.05
6.342
14
1 r2 )
WHAT?????
16
http://io9.com/on-correlation-causation-and-the-real-cause-of-auti-1494972271
19
18