Abstract

Probit models are used extensively for inferential purposes in the social sciences as discrete data are prevalent in a vast body of social studies. Among many accompanying model inference problems, a critical question remains unsettled: how to develop a goodness-of-fit measure that resembles the ordinary least square (OLS) R2 used for linear models. Such a measure has long been sought to achieve 'comparability' of different empirical models across multiple samples addressing similar social questions. To this end, we propose a novel R2 measure for probit models using the notion of surrogacy - simulating a continuous variable as a surrogate of the original discrete response (Liu & Zhang, Journal of the American Statistical Association, 113, 845 and 2018). The proposed R2 is the proportion of the variance of the surrogate response explained by explanatory variables through a linear model, and we call it a surrogate R2 . This paper shows both theoretically and numerically that the surrogate R2 approximates the OLS R2 based on the latent continuous variable, preserves the interpretation of explained variation, and maintains monotonicity between nested models. As no other pseudo R2 , McKelvey and Zavoina's and McFadden's included, can meet all the three criteria simultaneously, our measure fills this crucial void in probit model inference.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call