Professional Documents
Culture Documents
University of Rochester
Fall 2008
Homework: Chapter 10
Text Problems: 10.1, 10.5, 10.8, 10.19, 10.24
10.1.
on a
cardinal scale.
They usually denote the possession or nonpossession of an attribute,
such as nationality, religion, sex, color, etc.
(c) Regression models in which explanatory variables are qualitative
are known as ANOVA models.
(d) Regression models in which one or more explanatory variables
are quantitative, although others may be qualitative, are known as
ANCOVA models.
(e) In a regression model with an intercept, if a qualitative variable
has m categories, one must introduce only (m 1) dummy variables.
If we introduce m dummies in such a model, we fall into the dummy
variable trap, that is, we cannot estimate the parameters of such
models because of perfect (multi)collinearity.
(f) They tell whether the average value of the dependent variable
varies from group to group.
(g) If the rate of change of the mean value of the dependent variable
varies between categories, the differential slope dummies will point
that out.
10.5.
(a) False. Letting D take the values of (0, 2) will halve both the
estimated B2 and its standard error, leaving the t ratio unchanged.
(b) False. Since the dummy variables do not violate any of the
assumptions of OLS, the estimators obtained by OLS are unbiased
in small as well as large samples.
10.8.
D1
and
D2
are individually
statistically significant.
(k) That seems to be the case in quarters one and two. Among other
things, coffee prices and weather may have something to do with the
observed seasonal pattern in these two quarters.
B1 B2 ln P
B6 D1
B3 ln I
B4 ln P B5 T
B7 D2
B8 D3
B9 ( D1 ln P ) B10 ( D2 ln P ) B11 ( D3 ln P )
Other Problems
1.
Suppose you have been hired by a union that wants to convince workers in local dry-cleaning
establishments that joining the union will improve their well-being. As your first assignment, your
boss asks you to build a model of wages for dry-cleaning workers that measures the impact of union
membership on those wages. Your first equation (standard errors in parentheses) is:
Wi = -11.40 + 0.30Agei 0.003Agei2 + 1.00Edui + 1.20Di
se
(0.10)
(0.002)
(0.20)
(1.00)
n = 34 R2 = .14, F= 24.2
Interpret the regression results. How do the signs compare with your expectations?
b.
Using a two-sided t-test at the 5% significance level determine if the coefficients on the independent
variables are statistically significant?
tk
bk Bk
with n k degrees of freedom
se(bk )
What relationship between A and W does the above result imply? Why doesnt the inclusion of A and
A2 violate the classical assumption of no perfect multicollinearity?
Wage
Age
This does not violate the assumption of no perfect multicollinearity because Age and Age 2 represents a
nonlinear relationship.
d.
On the basis of the regression results, should the workers be convinced that joining the union will
improve their well-being? Why of why not?
No, because the coefficient on the dummy variable is not statistically significant.
e.
Test the hypothesis that the coefficients are jointly significant at the 5 percent level.
H0: B2 = B3 = B4 = B5 = 0 and H1: B2 B3 B4 B5 0
Or
H0: R2 = 0 and H1: R2 0
The calculated F-stat is 24.2 which is greater than the F-critical value with 4 d.f. in the numerator and 29
d.f. in the denominator and = 0.05 of 2.69 (I used 30 d.f. in the denominator), therefore reject H0 and
conclude that the variables are jointly statistically significantly different from zero.
2. Suppose you want to estimate the effect of gender and education on wages using the following
regression:
ln (wage) = B1 + B2D1i + B3(educationi) + B4(educationi*D1i) + ui
Where
ln(wage) = the natural log of the hourly wage of the ith individual
D1 = 1 if female, 0 otherwise
Education = years of education
You have a sample of 274 men and 252 women which is 526 observations, the average wage for men in the
sample is $7.10/hr and for women the average wage is $4.59/hr.
The results of the regression are:
ln(wage) = .389 - .227D1i + .082(educationi) -.0056(educationi*D1i)
(se)
(0.023)
(0.008)
(0.0014)
n = 526
a.
Adjusted R2 = .441
Using a two-sided t-test (at the 1% significance level) determine if the coefficients on the
independent variables are statistically significant?
tk
bk Bk
with n k degrees of freedom
se(bk )
Therefore reject H0 and conclude that all of the coefficients are statistically significantly different from
zero.
b.
Do the results indicate a difference in average wages between males and females holding
education constant? How do you know?
Yes, because the coefficient on the dummy variable is statistically significantly different from zero.
c.
What is the return to an additional year of education for males? What is the return to an additional
year of education for females? Is the difference statistically significant? How do you know?
For males the return to education is the slope coefficient on education 0.082.
For females the return to education is the slope coefficient in education minus the differential slope
coefficient = 0.082 0.0056 = 0.0764.
Yes, the difference is statistically significant because the differential slope coefficient is statistically
significantly different from zero.
d.
Wage
Male
Female
Edu
3. Use the Crime Data from my website, which has arrest information on 500 men in California in 1986,
to estimate the following LPM of arrests
Arrestedi = b1 + b2PPCi + b3Empi + ei
Where
Arrestedi = 1 if the ith man was arrested in 1986, 0 otherwise
PPCi = the proportion of prior arrests that led to conviction for the ith man
Empi = the number of quarters the ith man was employed in 1986
a.
What are your expectations for the sings on PPC and EMP? Why?
If incarceration works than PPC should be negative, but if incarceration does not work and we have a
class of career criminals than PPC will be positive.
We would assume that if the individual were employed they are less likely to be arrested.
b.
Regression Statistics
R Square
Adjusted R Square
Standard Error
Observations
0.021
0.017
0.448
500
ANOVA
df
Regression
Residual
Total
Intercept
Proportion of Prior Convictions
Number of Quarters Employed
c.
2
497
499
SS
2.149
99.953
102.102
Coefficients
0.398
-0.110
-0.031
Standard
Error
0.040
0.059
0.012
MS
1.074
0.201
t Stat
9.934
-1.870
-2.590
F
5.343
Pvalue
0.000
0.062
0.010
Test whether the coefficients on PPC and EMP are statistically significantly different from zero at
the 10 percent level (two-tailed).
Both calculated t-statistics above are greater than the t-critical value of 1.645, therefore reject the null
and conclude that they are both statistically significantly different from zero.
d.
These results indicate that an increase in the proportion of prior convictions by one percent reduces the
probability of being arrested by 11 percent.
If we increase the number of employed quarters by 1, the probability of arrest falls by 3 percent.
Significance
F
0.005