Combining multiple probability predictions in the presence of class imbalance to discriminate between potential bad and good borrowers in the peer-to-peer lending market

Luca Zanin

doi:10.1016/j.jbef.2020.100272

Abstract

Credit risk scoring predictions represent an effective guide for lenders to discriminate between potential good (who will repay the loan) and bad (who will default) borrowers in the online social lending market. A common characteristic of such a market is a lower percentage of defaulted borrowers than non-defaulted borrowers; thus, the sample is class imbalanced. Class imbalance may affect the accuracy of default predictions, as classifiers tend to be biased towards the majority class (good borrowers). We analyse the default prediction performance when combining class rebalancing methods with different regression and machine learning techniques. We also propose to combine multiple probability predictions to improve the predictive performance. The analysis is based on a book of loans (with a three-year term) funded in the 2010–2015 period though the online platform of Lending Club. The results show that some measures of predictive accuracy tend to improve when the scoring models are trained using a rebalanced, rather than an imbalanced sample, except when the extreme gradient boosting approach is applied. Finally, we find that combining multiple probability predictions via regularised logistic regression may help to improve the predictive accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Combining multiple probability predictions in the presence of class imbalance to discriminate between potential bad and good borrowers in the peer-to-peer lending market

Abstract

Talk to us

Similar Papers

More From: Journal of Behavioral and Experimental Finance

Lead the way for us

Journal: Journal of Behavioral and Experimental Finance	Publication Date: Jan 14, 2020
Citations: 24

Similar Papers

Estimating Downhole Vibration via Machine Learning Techniques Using Only Surface Drilling Parameters
Prince Okoli ... Roman Shor
-
Prince Okoli, et. al.Prince Okoli ... Roman Shor
22 Apr 2019
22 Apr 2019

The Impact of Undersampling on the Predictive Performance of Logistic Regression and Machine Learning Algorithms: A Simulation Study.
Abigail R Cartus ... Lisa M Bodnar
Epidemiology | VOL. 31
Abigail R Cartus, et. al.Abigail R Cartus ... Lisa M Bodnar
31 Mar 2020
Epidemiology | VOL. 31

An empirical study on predictability of software maintainability using imbalanced data
Ruchika Malhotra ... Kusum Lata
Software Quality Journal | VOL. 28
Ruchika Malhotra, et. al.Ruchika Malhotra ... Kusum Lata
05 Aug 2020
Software Quality Journal | VOL. 28

A high-performance approach for predicting donor splice sites based on short window size and imbalanced large samples
Ying Zeng ... Hongjie Yuan
Biology Direct | VOL. 14
Ying Zeng, et. al.Ying Zeng ... Hongjie Yuan
11 Apr 2019
Biology Direct | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Combining multiple probability predictions in the presence of class imbalance to discriminate between potential bad and good borrowers in the peer-to-peer lending market

Abstract

Talk to us

Similar Papers

More From: Journal of Behavioral and Experimental Finance