Professional Documents
Culture Documents
Introduction to Econometrics
Professor Woodbury
Fall Semester 2015
Simple Regression (Regression with One Regressor)
1. Introduction to linear regression
2. Defining the linear regression model
3. Estimating the linear regression model method of moments
4. Algebraic Properties and measures of fit of OLS
5. Sampling distribution of the OLS estimator
6. Hypothesis testing and confidence intervals for 0 and 1
Like
o
that is, E( 1 ) = 1
10
= 698.9 2.28STR
11
One of the districts in the data set is Antelope, CA, for which STR
= 19.33 and Test Score = 657.8
13
14
Number of obs =
F(
1,
420
418) =
19.26
Prob > F
0.0000
R-squared
0.0512
Root MSE
18.581
------------------------------------------------------------------------|
testscr |
Robust
Coef.
Std. Err.
P>|t|
--------+---------------------------------------------------------------str |
-2.279808
.5194892
-4.39
0.000
-3.300945
-1.258671
_cons |
698.933
10.36436
67.44
0.000
678.5602
719.3057
-------------------------------------------------------------------------
16
17
Explains variable
in terms of variable
Intercept
Dependent variable,
explained variable,
response variable,
Slope parameter
Independent variable,
explanatory variable,
regressor,
Error term,
disturbance,
unobservables,
engage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole o
18
Dependent variable
Independent variable
Explained variable
Explanatory variable
Response variable
Control variable
Predicted variable
Predictor variable
Regressand
Regressor
20
E(Y|X) = 0 + 1X
(or Yi = 0 + 1Xi )
21
22
23
24
Regression Model
Omitted
variables
create
for regression
!
Interpretation
of the
simpleproblems
linear regression
model
analysis
Studies how
as long as
The simple linear regression model is rarely applicable in practice but its discussion is useful for pedagogical reasons
2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
25
!
!
Rainfall,
land quality,
Rainfall,
presence
land
quality,of parasites,
presence of parasites,
2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in who
2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or
26
27
Interpretation of 1
!
1 = Y / X
if u = 0
E(u | X) = 0
28
Meaning of E(u | X) = 0
29
Again
E(ability | education = 16 years) >
E(ability | education = 12 years)
means E(u | X) is violated
1 will estimate the additional earnings resulting from the
combination of (a) an increase schooling and (b) the added
ability that goes with the additional schooling
If will not identify a clean or pure return to one more year
of education
Footnote: corr(u,X) = 0 is not enough (although it is a good start)
because correlation is measure only of linear association
30
e.g. intelligence
e.g. intelligence
For
example, with a wage equation, the independence assumption
The conditional mean independence assumption is unlikely to hold because
e conditional
mean independence
assumption
is unlikely
to hold
because
more
education
will
also be
more
intelligent
on average.
is individuals
unlikely
towith
hold
because
individuals
with
more
education
will
ividuals with more education will also be more intelligent on average.
also be more intelligent (or more able) on average
2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in
ning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
31
32
33
u i = Yi Yi
u i = Yi ( 0 + 1X i )
because the fitted or predicted value of Yi is:
Yi = 0 + 1X i
Here is a picture
34
given by (2.17) and (2.19). The name ordinary least squares comes from the fact that
these estimates minimize the sum of squared residuals.
F I G U R E 2 . 4 Fitted values and residuals.
y
yi
y 5 0 1 1x
yi 5 fitted value
y 1
y1
x1
xi
i 5 residual
35
36
38
39
Y = b0 + b1X
so
b0 = Y b1X
so we have our estimator of 0
40
b =
1
n
i=1
(X i X )(Yi Y )
2
(X
X
)
i=1 i
n
and
So what?
This matters for a couple of reasons
First, it gives you (or Stata) a way to actually compute b0 and b1
Second, the formula for b1 says that
b1 = cov(Y, X) / var(X)
!
= sXY / s2X
42
43
E(salary|roe) 5
roe
roe
963.191
44
45
46
u i = Yi Yi = Yi ( 0 + 1X i ) (this is a residual)
The first line (the fitted value) can also be written:
47