ABSTRACT This study reviews the methodologies used in the literature to predict failure in small and medium-sized enterprises (SMEs). We identified 145 SMEs’ default prediction studies from 1972 to early 2023. We summarized the methods used in each study. The focus points are estimation methods, sample re-balancing methods, variable selection techniques, validation methods, and variables included in the literature. More than 1,200 factors used in failure prediction models have been identified, along with 54 unique feature selection techniques and 80 unique estimation methods. Over one-third of the studies do not use any feature selection method, and more than one-quarter use only in-sample validation. Our main recommendation for researchers is to use feature selection and validate results using hold-out samples or cross-validation. As an avenue for further research, we suggest in-depth empirical comparisons of estimation methods, feature selection techniques, and sample re-balancing methods based on some large and commonly used datasets.
Read full abstract