Abstract

We consider the situation in which there are multiple proxies for one unobserved explanatory variable in a linear regression and provide a procedure by which the coefficient of interest can be extracted from a multiple regression in which all the proxies are used simultaneously. This post hoc estimator is strictly superior in large samples to coefficients derived using any index or linear combination of the proxies that is created prior to the regression. To use an index created from the proxies that extracts the largest possible signal from them requires knowledge of information that is not available to the researcher. Using the proxies simultaneously in a multiple regression delivers this information, and the researcher then simply combines the coefficients in a known way to obtain the estimate of the effect of the unobserved factor. This procedure is also much more robust than ad hoc index construction to departures from the assumption of an underlying common factor. We provide some Monte Carlo simulations and applications to existing empirical problems to show that the reduction in attenuation bias can be non-negligible, even in finite samples.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call