Consumer credit risk: Individual probability estimates using machine learning

Jochen Kruppa,Alexandra Schwarz,Gerhard Arminger,Andreas Ziegler

doi:10.1016/j.eswa.2013.03.019

Abstract

Consumer credit scoring is often considered a classification task where clients receive either a good or a bad credit status. Default probabilities provide more detailed information about the creditworthiness of consumers, and they are usually estimated by logistic regression. Here, we present a general framework for estimating individual consumer credit risks by use of machine learning methods. Since a probability is an expected value, all nonparametric regression approaches which are consistent for the mean are consistent for the probability estimation problem. Among others, random forests (RF), k-nearest neighbors (kNN), and bagged k-nearest neighbors (bNN) belong to this class of consistent nonparametric regression approaches. We apply the machine learning methods and an optimized logistic regression to a large dataset of complete payment histories of short-termed installment credits. We demonstrate probability estimation in Random Jungle, an RF package written in C++ with a generalized framework for fast tree growing, probability estimation, and classification. We also describe an algorithm for tuning the terminal node size for probability estimation. We demonstrate that regression RF outperforms the optimized logistic regression model, kNN, and bNN on the test data of the short-term installment credits.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Consumer credit risk: Individual probability estimates using machine learning

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications

Lead the way for us

Journal: Expert Systems with Applications	Publication Date: Mar 21, 2013
Citations: 147

Similar Papers

Machine learning in pain research.
Jörn Lötsch ... Alfred Ultsch
Pain | VOL. 159
Jörn Lötsch, et. al.Jörn Lötsch ... Alfred Ultsch
24 Nov 2017
Pain | VOL. 159

Deposit type discrimination based on trace elements in sphalerite
Yu-Miao Meng ... Songning Meng
Ore Geology Reviews | VOL. 165
Yu-Miao Meng, et. al.Yu-Miao Meng ... Songning Meng
13 Jan 2024
Ore Geology Reviews | VOL. 165

Machine Learning Techniques in Enhanced Oil Recovery Screening Using Semisupervised Label Propagation
Pouya Vaziri ... Hamzeh Alimohammadi
SPE Journal | VOL. 29
Pouya Vaziri, et. al.Pouya Vaziri ... Hamzeh Alimohammadi
19 Jun 2024
SPE Journal | VOL. 29

Sensors support machine learning
-
Food Science and Technology | VOL. 33
--
01 Dec 2019
Food Science and Technology | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Consumer credit risk: Individual probability estimates using machine learning

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications