Abstract
In this paper, we show that the recent notion of regression depth can be used as a data-analytic tool to measure the amount of separation between successes and failures in the binary response framework. Extending this algorithm, allows us to compute the overlap in data sets which are commonly fitted by logistic or probit regression models. The overlap is the number of observations that would need to be removed to obtain complete or quasi-complete separation, i.e. the situation where the regression parameters are no longer identifiable and the maximum likelihood estimate does not exist. It turns out that the overlap is often quite small. The results are equally useful in linear discriminant analysis.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.