The ubiquitous presence of endogenous regressors presents a significant challenge when drawing causal inference using observational data. The classical econometric method used to handle regressor endogeneity requires IVs that must satisfy the stringent condition of exclusion restriction, rendering it unfeasible in many settings. Herein, we propose a new IV-free method that uses copulas to address the endogeneity problem. Existing copula correction methods require nonnormal endogenous regressors: normally or nearly normally distributed endogenous regressors cause model non-identification or significant finite-sample bias. Furthermore, existing copula control function methods presume the independence of exogenous regressors and the copula control function. While maintaining the Gaussian copula regressor-error dependence structure, our generalized two-stage copula endogeneity correction (2sCOPE) method simultaneously relaxes the two key identification requirements. Under the Gaussian copula dependence structure, we prove that 2sCOPE yields consistent causal-effect estimates with correlated endogenous and exogenous regressors as well as normally distributed endogenous regressors. In addition to relaxing the identification requirements, 2sCOPE has superior finite-sample performance and addresses the significant finite-sample bias problem due to insufficient regressor nonnormality. Moreover, 2sCOPE employs generated regressors derived from existing regressors to control for endogeneity, and can thus considerably increase the ease and broaden the applicability of IV-free methods for handling regressor endogeneity. We further demonstrate 2sCOPE’s performance using simulation studies and illustrate its use in an empirical application.
Read full abstract