Multiple Regression Analysis:Further Issues
This chapter brings together several issues in multiple regression analysis that we
could not conveniently cover in earlier chapters. These topics are not as fundamental
as the material in Chapters 3 and 4, but they are important for applying
multiple regression to a broad range of empirical problems.
6.1 EFFECTS OF DATA SCALING ON OLS STATISTICS
In Chapter 2 on bivariate regression, we briefly discussed the effects of changing the
units of measurement on the OLS intercept and slope estimates. We also showed that
changing the units of measurement did not affect R-squared. We now return to the issue
of data scaling and examine the effects of rescaling the dependent or independent variables
on standard errors, t statistics, F statistics, and confidence intervals.
We will discover that everything we expect to happen, does happen. When variables
are rescaled, the coefficients, standard errors, confidence intervals, t statistics, and F
statistics change in ways that preserve all measured effects and testing outcomes. While
this is no great surprise—in fact, we would be very worried if it were not the case—it
is useful to see what occurs explicitly. Often, data scaling is used for cosmetic purposes,
such as to reduce the number of zeros after a decimal point in an estimated coefficient.
By judiciously choosing units of measurement, we can improve the appearance of an
estimated equation while changing nothing that is essential.
We could treat this problem in a general way, but it is much better illustrated with
examples. Likewise, there is little value here in introducing an abstract notation.
We begin with an equation relating infant birth weight to cigarette smoking and
family income: