Abstract

ABSTRACT In this paper, we attempt to study the effectiveness of various simple machine learning methods in the prediction of bank failures. From a raw dataset of 10,938 US banks during the period of 2000–2020, we find that machine learning approaches do not really outperform the benchmark of conventional statistical method, logistic regression. However, using PCA to retain relevant variance in variables significantly improve the performance of machine learning methods and raise the out-of-sample accuracy of those method to over 70% to over 80%. Of all the machine learning methods used in this paper, the simple KNN seems to be the best model in forecasting bank failure in the United States.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call