2.3. Extending the Model

The model of equation (2.2) is somewhat restrictive because it assumes that the regression slopes are invariant across time. This assumption can be tested and relaxed. Consider the model


which is identical to equation (2.2) except that the β coefficient is allowed to differ for times 1 and 2. The difference score equation then becomes


which, with a little algebra, can also be written as


This equation says that both (x2x1) and x1 should appear as independent variables in the difference equation. If the coefficient for x1 is significantly different from 0, that's evidence that β1 and β2 are not equal.

Let's try it for the NLSY data. Here is the program:

PROC REG DATA=diff;
   MODEL antidiff=selfdiff povdiff self90 pov90;
RUN;

Results in Output 2.6 provide no evidence that the effects of poverty and self-esteem on antisocial behavior are different in 1990 and 1994. Both coefficients for the time 1 variables are far from statistically significant.

Table 2.6. Output 2.6 Difference Regression with Time 1 Variables
VariableDFParameter EstimateStandard ErrortValuePr>|t|
Intercept10.648370.521131.240.2139
selfdiff1−0.066850.01960−3.410.0007
povdiff1−0.014100.15110−0.090.9257
self901−0.022260.02511−0.890.3757
pov9010.041260.156260.260.7919
      

Another way to extend the model is to allow the coefficients for the time-invariant variables to change with time, as in


Here, the γ coefficient is allowed to differ at the two time points, leading to the difference equation


In this case z does not drop out of the equation and must be included in the regression model. This result teaches us that fixed effects regression only controls for those time-invariant variables whose effects on the dependent variable are also time invariant. If a variable does not have time-invariant effects, it must be explicitly included in the model.

The NLSY data set has several time-invariant variables that are worth examining as possible predictors:


BLACK

1 if child is black, otherwise 0


HISPANIC

1 if child is Hispanic, otherwise 0


CHILDAGE

child's age in 1990


MARRIED

1 if mother was currently married in 1990, otherwise 0


GENDER

1 if female, 0 if male


MOMAGE

mother's age at birth of child


MOMWORK

1 if mother was employed in 1990, otherwise 0

These variables are now included in the difference score regression:

PROC REG DATA=diff;
   MODEL antidiff=selfdiff povdiff black hispanic childage
         married gender momage momwork;
RUN;

Results in Output 2.7 show that only one of the time-invariant variables, CHILDAGE, has a coefficient that even approaches statistical significance. And the inclusion of the time-invariant variables has very little impact on the coefficient estimates for SELFDIFF and POVDIFF. We conclude that there is some evidence that the effect of CHILDAGE is different in 1990 and 1994, but there is little or no evidence for a change over time in the effects of the other time-invariant variables.

Table 2.7. Output 2.7 Difference Regression with Time Invariant Variables
VariableDFParameter EstimateStandard ErrortValuePr>|t|
Intercept1−0.820021.27227−0.640.5195
selfdiff1−0.052750.01554−3.390.0007
povdiff1−0.026700.12996−0.210.8373
black1−0.061790.14583−0.420.6720
hispanic10.105180.162570.650.5179
childage10.219530.107352.050.0413
married1−0.174870.14921−1.170.2417
gender10.110650.125840.880.3796
momage1−0.044150.02972−1.490.1379
momwork1−0.124450.13270−0.940.3487

Results using the difference score method can also be replicated on the person-year data set, using PROC GLM with the ABSORB statement. To test for changes in the coefficients for SELF and POV, we include interactions between these variables and TIME:

PROC GLM DATA=persyr2;
   ABSORB id;
   MODEL anti=self pov time self*time pov*time;
RUN;

Results in Output 2.8 are equivalent to those in Output 2.6.

Table 2.8. Output 2.8 GLM with Interactions between TIME and Time-Varying Covariates
ParameterEstimateStandard ErrortValuePr>|t|
self−.04458330750.02002789−2.230.0264
pov−.05535622910.14951835−0.370.7113
time0.64837097420.521130181.240.2139
self*time−.02226262580.02511098−0.890.3757
pov*time0.04125721820.156259470.260.7919

To replicate the results in Output 2.7, we can use PROC GLM with interactions between TIME and the time-invariant predictors:

PROC GLM DATA=persyr2;
   ABSORB id;
   MODEL anti=self pov time black*time hispanic*time
         childage*time married*time gender*time
         momage*time momwork*time;
RUN;

Results in Output 2.9 are, in fact, equivalent to those in Output 2.7. Note that although the model includes interactions between TIME and the time-invariant predictors, it does not include the "main effects" of those covariates. While this may seem contrary to conventional practice, it is not a problem for this kind of analysis. In fact, if you tried to include the main effects, GLM would report coefficients of 0 with 0 degrees of freedom. That's because the time-invariant variables are perfectly collinear with the αi parameters that have been "conditioned" out of the regression equation.

Table 2.9. Output 2.9 GLM with Interactions between TIME and Time-Invariant Covariates
ParameterEstimateStandard ErrortValuePr>|t|
self−.05275401810.01554026−3.390.0007
pov−.02670382940.12995869−0.210.8373
time−.82001956711.27227104−0.640.5195
time*black−.06178700140.14583481−0.420.6720
time*hispanic0.10518146040.162566790.650.5179
time*childage0.21953069570.107346732.050.0413
time*married−.17487345070.14921056−1.170.2417
time*gender0.11065050350.125839600.880.3796
time*momage−.04414937540.02971813−1.490.1379
time*momwork−.12445052380.13269724−0.940.3487

It's also worth noting that, like interactions in general, the interactions between time and the time-invariant covariates have a dual interpretation:

  • The effect of the covariate varies with time.

  • The effect of time varies with the level of the covariate.

Often the latter interpretation will be more compelling. For example, the coefficient of .219 for the CHILDAGE*TIME interaction says that with each one-year increase in age at time 1, the rate of change in antisocial behavior from time 1 to time 2 goes up by .219. That is, older children have a more rapid rate of increase.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset