Abstract

The issues we study are risk identification, credit granting, pre-loan and post-loan lending risks in consumer lending and trust loans. How to effectively evaluate and identify the potential default risk of borrowers and calculate the probability of default of borrowers before granting loans is a fundamental part of credit risk management in modern financial institutions. This paper focuses on the statistical analysis of historical loan data from banks and other financial institutions with the help of the idea of non-equilibrium data classification, and the use of the random forest algorithm to establish a loan default prediction model. Then a fixed threshold screening in the form of a black and white list is used. The limitation of the current method is that it cannot meet the lending needs of large-scale transaction classes and cannot be judged quickly. When the number of decision trees in a random forest is large, the space and time required for training are large. In addition, it is insensitive to data and has low accuracy. The experiments found that the machine learning-based financial lending risk prediction method can solve the problem better, and the experimental results phenotyped that the neural network and random forest algorithm outperformed the decision tree and logistic regression classification algorithm in prediction performance. In addition, by using the random forest algorithm to rank the importance of features, the features that have a greater impact on whether the final default can be obtained, so that the lending risk judgment in the financial sector can be more effective.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call