If we can make certain assumptions about the errors in a regression model, we can perform statistical tests to determine if the model is useful. The following assumptions are made about the errors:
The errors are independent.
The errors are normally distributed.
The errors have a mean of zero.
The errors have a constant variance (regardless of the value of X).
It is possible to check the data to see if these assumptions are met. Often a plot of the residuals will highlight any glaring violations of the assumptions. When the errors (residuals) are plotted against the independent variable, the pattern should appear random.
Figure 4.4 presents some typical error patterns, with Figure 4.4A displaying a pattern that is expected when the assumptions are met and the model is appropriate. The errors are random and no discernible pattern is present. Figure 4.4B demonstrates an error pattern in which the errors increase as X increases, violating the constant variance assumption. Figure 4.4C shows errors consistently increasing at first and then consistently decreasing. A pattern such as this would indicate that the model is not linear and some other form (perhaps quadratic) should be used. In general, patterns in the plot of the errors indicate problems with the assumptions or the model specification.
While the errors are assumed to have constant variance
where
In the Triple A Construction example,
From this, we can estimate the standard deviation as
This is called the standard error of the estimate or the standard deviation of the regression. In this example,
This is used in many of the statistical tests about the model. It is also used to find interval estimates for both Y and regression coefficients.2