You are on page 1of 8

Professor Schneider PLS 802

324B S. Kedzie Hall Spring 2009


sks@msu.edu
517/355-7682

REGRESSION ANALYSIS
This course provides an introduction to the theory, methods, and practice of regression analysis.
The goals are to provide students with the skills that are necessary to: (1) read, understand, and
evaluate the professional literature that uses regression analysis; (2) design and carry out studies
that employ regression techniques for testing substantive theories; and (3) prepare to learn about
more advanced statistical procedures.
The course will not dwell on statistical theory, but it will also not take a superficial approach.
Instead, it will focus on: The utility of regression analysis for evaluating empirical relationships
between variables as a critical component of the theory-testing process. We will thoroughly
cover the basic elements of the regression model and the development of the regression
estimators. We will see that this model depends very heavily on several assumptions. Therefore,
we will examine these assumptions in detail, considering why they are necessary, whether they
are valid in practical research situations, and the consequences of violating them in particular
applications of the regression techniques. These formal, analytic treatments will be
counterbalanced by the use of frequent substantive examples and class exercises. Again, the
overall course objective is not to turn you into a statistician– instead, the aim is to maximize
your research skills as a political scientist.
Course Prerequisites: Any course of this type must assume a working knowledge of
elementary statistical concepts and techniques. We will conduct a brief review at the beginning
of the course, but students must be familiar with such ideas as descriptive statistics, sampling
distributions, statistical inference, confidence intervals, and hypothesis testing, before moving on
to the more complicated matters that will comprise the majority of the course material. You must
have completed at least one prior course in introductory statistics course– i.e., PLS 801 or the
equivalent.
Course Requirements: Formal course requirements are as follows: (1) Class attendance and
active participation. This is mandatory. Statistical knowledge is cumulative, and gaps in the
material will have detrimental consequences. (2) Completion of homework assignments. Most of
these are computer-based data analysis exercises, designed to familiarize you with the
application of various concepts and techniques. Each of these assignments will focus on a
specific set of topics. However, the latter assignments are cumulative in the sense that they build
upon earlier material in the class. Homework assignments will be given frequently (about once a
week or so). They will not be assigned grades, but they will be checked for completion and
comments will be provided to make sure that you fully understand the material. (3) Two
examinations. A midterm exam will be given in class on Tuesday, March 3; the final will be a
take-home exam, due on Tuesday, May 5, 2009, at 5:00 p.m. (4) Regression Analysis Paper. For
this paper, you are expected to use regression analysis to examine a political science research
question. You should identify a suitable topic and focus for the paper by Friday, March 14, 2009.
Before you select the topic, I strongly encourage each of you to discuss your paper ideas with
me. Final papers are due on or before 5:00 p.m. on Friday, May 1, 2009. I reserve the right to
penalize papers received after the May 1, 2009 deadline.
PLS 802, Spring 2009
Page 2

Assignment of Final Grades:


Homework Assignments & Class Participation 20 %
Midterm Examination 25 %
Regression Analysis Paper 25 %
Final Examination 30 %

Textbooks:
The following are the required texts for the course:
Berry, William D., and Stanley Feldman. 1985. Multiple Regression in Practice. Beverly
Hills, CA: Sage Publications.
McClendon, McKee. 1994 (reissued 2002). Multiple Regression and Causal Analysis.
Prospect Heights, IL: Waveland Press.

The following books are useful recommended books for more detailed, comprehensive coverage
of the material along with explicit derivations of statistical concepts:
Gujarati, Damodar N. 2003. Basic Econometrics (Fourth Edition). Boston, MA: McGraw-
Hill.
Fox, John. Regression Diagnostics.1991. Beverly Hills, CA: Sage Publications
Kennedy, Peter. 2008. A Guide to Econometrics (Sixth Edition), Malden, MA: Black
Publishing, Inc.
Wooldridge, Jeffrey M. 2006. Introductory Econometrics: A Modern Approach (Third
Edition). Mason, OH: Thomson South-Western.

The following books are useful supplemental books for more basic explanations of key terms
and concepts:
Berry, William D. 1993. Understanding Regression Assumptions. Beverly Hills, CA: Sage
Publications.
Lewis-Beck, Michael. 1980. Applied Regression. Beverly Hills, Sage Publications.
Schroeder, Larry D., David L. Sjoquist, and Paula E. Stephan. 1986. Understanding
Regression Analysis: An Introductory Guide. Newbury Park: Sage Publications.

You should read all the material assigned in the required texts. Most of the recommended and
supplemental books are either too advanced or elementary to be used as central texts in this
course. However, several of them are very good and would be extremely useful books for you to
rely upon for greater detail or additional explanations at various points in the course.
PLS 802, Spring 2009
Page 3

Computing and Software: Computers and statistical software are absolutely necessary for
employing modern statistical techniques in an effective manner. Therefore, they will be closely
integrated into the course material. We will use STATA for most of the class examples,
assignments, and examinations. But, you can also use other statistical software in this course
(e.g., R, SAS, SPSS, SYSTAT, etc.), as long as it has the analytical routines and capacities that
are required to complete the assignments and examinations.

Topics and Reading Assignments

I. Introduction to Regression Analysis

Reading: McClendon, pp. 1-19

Gujarati, pp. 15-32


Kennedy, pp. 1-10
Wooldridge, pp. 1-19

II. Preliminary Material and Statistical Review

A. Frequency Distributions, Univariate Summary Statistics, Probability


Distributions

Reading: McClendon, pp. 20-25

B. Statistical Inference and the Properties of Statistical Estimators

1. Confidence Intervals & Hypothesis Tests

2. Differences Between Two Means, Two Variances, Etc.

III. Basic Concepts for Understanding Regression Analysis: Functional


Dependence, Linear Transformations, and Linear Combinations

Reading: McClendon, pp. 25-28

Wooldridge, pp 707-802
PLS 802, Spring 2009
Page 4

IV. The Bivariate Regression Model

A. Introduction: Basic Ideas and Concepts

Reading: McClendon, pp. 28-30

Berry, pp. 1-22


Gujarati, pp. 37-57

B. The Least Squares Criterion and Estimation in the Bivariate Regression


Model

Reading: McClendon, pp.31-41


Berry and Feldman, pp. 9-12

Gujarati, pp. 58-80


Kennedy, pp. 11-59
Wooldridge, pp. 50-66, 89-95, 106-109, 123-126, 176-181, 187-190

C. Goodness of fit, the Correlation Coefficient and R2


Reading: McClendon, pp. 42-49

Gujarati, pp. 79-94


Schroeder, Sjoquist, and Stephan, pp. 23-29
D. Assumptions Underlying the Bivariate Linear Regression Model

Reading: McClendon, pp. 133-146


Berry and Feldman, pp. 9-12

Gujarati, pp. 65-74; 107-110


Kennedy, pp. 11-59
Wooldridge, pp. 50-66, 89-95, 106-109, 123-126, 176-181, 187-190

E. Statistical Inference, Confidence Intervals, and Hypothesis Tests

Reading: McClendon, pp. 147-154

Lewis-Beck, pp. 26-47


Schroeder, Sjoquist, and Stephan, pp. 36-53
Gujarati, pp. 119-163
Kennedy, pp. 51-90
Wooldridge, pp. 126-147
PLS 802, Spring 2009
Page 5

F. Summary, Extensions, and a Preliminary Look at Residuals, Outliers,


and Influential Cases

Reading: McClendon, pp. 49-59

Berry, pp. 22-88


Gujarati, pp. 164-193

V. The Multiple Regression Model

A. Introduction: Notation, Assumptions, and Interpretation

Reading: McClendon, pp. 60-80


Berry and Feldman, pp. 9-18

Gujarati, pp. 202-211; 230-233


Wooldridge, pp. 73-88

B. Measures of Goodness of Fit

Reading: McClendon, pp. 80-83

Gujarati, pp. 211-225


Schroeder, Sjoquist, and Stephan, pp. 32-36

C. Statistical Inference and the Role of Hypothesis Testing

Reading: McClendon, pp. 133-174


Berry and Feldman, pp. 12-18

Gujarati, pp. 248-273


Kennedy, pp. 60-80
Wooldridge, pp. 147-167, 214-218

D. Models of Substantive Phenomena; The Importance of Model


Assumptions

Reading: McClendon, pp. 83-93, 154-157

Berry, pp. 1-24


Lewis-Beck, pp. 63-66
PLS 802, Spring 2009
Page 6

E. Summary and a Brief Look at Extensions

Reading: McClendon, pp. 93-118


(McClendon, pp. 119-132)

Gujarati, pp. 273-297

VI. Model Building in Multiple Regression Analysis

A. Model Specification

Reading: McClendon, pp. 288-321


Berry and Feldman, pp. 18-26

Berry, pp. 30-45


Gujarati, pp. 506-560
Kennedy, pp. 71-92
Lewis-Beck, pp. 30-45
Schroeder, Sjoquist, and Stephan, pp. 67-70

B. Functional Forms, Nonlinearity and Transformations

Reading: McClendon, pp. 230-270


Berry and Feldman, pp. 51-72

Berry, pp. 60-66


Gujarati, pp. 561-577
Kennedy, pp. 93-111
Schroeder, Sjoguist, and Stephan, pp. 58-61
Wooldridge, pp. 304-310

C. Nominal Independent Variables

Reading: McClendon, pp. 198-229; 271-287

Gujarati, pp. 297-334


Kennedy, pp. 248-258
Schroeder, Sjoquist, and Stephan, pp. 56-58
Wooldridge, pp. 230-252
PLS 802, Spring 2009
Page 7

VII. Potential Problems in Multiple Regression Analysis

A. Interpretation of Results

Reading: Fox, pp.3-5

B. Multicollinearity and Its Effects

Reading: McClendon, pp. 161-163


Berry and Feldman, pp. 37-50

Berry, pp. 24-27


Fox, pp. 10-21
Gujarati, pp. 341-370, 506-560
Kennedy, pp. 192-202
Lewis-Beck, pp. 58-63
Schroeder, Sjoquist, and Stephan, pp. 71-72
Wooldridge, pp. 101-105

C. Nonnormal and Nonconstant (Heteroscedastic) Errors

Reading: McClendon, pp. 174-197


Berry and Feldman, pp. 73-88

Berry, pp. 67, 72-81


Fox, pp. 40-53
Gujarati, pp. 387-428
Kennedy, pp. 133-139
Wooldridge, pp. 181-185

D. Measurement Error

Reading: Berry and Feldman, pp. 26-37

Berry, pp. 45-60


Gujarati, pp. 524-528
Kennedy, pp. 157-163
Schroeder, Sjoquist, and Stephan, pp. 70-71
Wooldridge, pp. 318-325
PLS 802, Spring 2009
Page 8

E. Residual Analysis, Outliers, and Influential Observations

Reading: Berry, pp. 27-29


Fox, pp. 21-40
Gujarati, pp. 518-524
Kennedy, pp. 372-388

VIII. Additional Topics

A. Dichotomous Dependent Variables

Reading: Gujarati, pp. 580-636


Schroeder, Sjoquist, and Stephan, pp. 79-80
Wooldridge, pp. 252-258

B. Simultaneous Equation Models

Reading: McClendon, pp. 288-347


Berry, pp. 1-54
Gujarati, pp. 715-791
Schroeder, Sjoquist, and Stephan, pp. 77-79

C. Nonindependent Disturbances and Time Series Models

Reading: Berry, pp. 67-72


Gujarati, pp. 792-865
Kennedy, pp. 139-156, 163-179
Schroeder, Sjoquist, and Stephan, pp. 72-75

You might also like