You are on page 1of 3

Interpreting results of regression with interaction terms: Example

Table 12 shows that adding interaction terms, and thus letting the model take account of the differences between the countries with respect to birth year effects on education length, increases the R2 value somewhat, and that the increase in the models fit is statistically significant. Correspondingly, the model 2 part of table 13 shows that both the Polish and the British associations between birth year and education length are significantly different from the Norwegian one at the 5% level. The estimate of the Polish regression line slope indicates that it is a notch steeper than the Norwegian (0.097 + 0.02 = 0.117 as against 0.097), while the British line seems to be less steep (0.097 0.037 = 0.06), which can be seen from the negative sign of the estimate of the interaction terms coefficient. However, only the slope of the British regression line is significantly different from the Norwegian slope at the 1% level. Table 13 also shows that the mean Polish education length starts at a lower level than the Norwegian one in the older cohorts. (The country dummy is negative and statistically different from 0, which indicates that the Polish regression line cuts across the dependent variable axis at a smaller education length value than the Norwegian line does.) Table 12. SPSS output: Regression with interaction goodness of fit statistics

Table 13. SPSS output: Regression with interaction coefficients

Note that the model 1 estimate of the birth years coefficient (0.087) is a non-weighted mean of the three countries coefficients. It is necessary to weight the cases with the combined population size / design weight to obtain an unbiased estimate of the mean coefficient. (An estimate that takes account of the countries population size differences.) Using ordinary case weighting and regression analysis may produce better slope estimates (in the proximity of 0.078) but the statistical tests cannot be trusted. If correct statistical tests are an issue, you could use this Complex samples procedure. (Weighted least squares regression with the population size / design weight, as the weighting variable would also be better than OLS regression on weighted cases.) Complex samples procedure

Regression lines based on weighted model 1 estimates are presented in Figure 15, whereas Figure 16 demonstrates the more nuanced story told by the estimates obtained by using model 2, which includes interaction terms. Ultimately, it is up to you to decide whether the improved detail of the results presented in Figure 16 compared with those presented in Figure 15, is worth the effort and complexity of interpretation required if model 1 is replaced with model 2. Figure 15. Estimated regression lines for three countries with common (weighted mean) slope

Figure 16. Estimated regression lines for three countries with separate slope estimates.

Note also that the use of interaction terms is not limited to cases in which the associations between dependent and independent variables vary between countries. In principle, it can be used in all cases where one variables association with the dependent variable varies with the value of another variable. The association between education and subsequent occupational career may, for instance, depend on peoples gender and social or ethnic background. In such cases, you can create interaction terms by multiplying education variables by gender or ethnicity variables etc.

You might also like