On Lasso and adaptive Lasso for non-random sample in credit scoring

Emmanuel O Ogundimu

doi:10.1177/1471082x221092181

Abstract

Prediction models in credit scoring are often formulated using available data on accepted applicants at the loan application stage. The use of this data to estimate probability of default (PD) may lead to bias due to non-random selection from the population of applicants. That is, the PD in the general population of applicants may not be the same with the PD in the subpopulation of the accepted applicants. A prominent model for the reduction of bias in this framework is the sample selection model, but there is no consensus on its utility yet. It is unclear if the bias-variance trade- off of regularization techniques can improve the predictions of PD in non-random sample selection setting. To address this, we propose the use of Lasso and adaptive Lasso for variable selection and optimal predictive accuracy. By appealing to the least square approximation of the likelihood function of sample selection model, we optimize the resulting function subject to L1 and adaptively weighted L1 penalties using an efficient algorithm. We evaluate the performance of the proposed approach and competing alternatives in a simulation study and applied it to the well-known American Express credit card dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Statistical Modelling	Publication Date: May 9, 2022
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

On Lasso and adaptive Lasso for non-random sample in credit scoring

Abstract

Talk to us

Similar Papers

More From: Statistical Modelling

Lead the way for us

Similar Papers

Bibliography
-
-
--
23 Dec 2016
23 Dec 2016

A penalized likelihood estimation approach to semiparametric sample selection binary response modeling
Giampiero Marra ... Rosalba Radice
Electronic Journal of Statistics | VOL. 7
Giampiero Marra, et. al.Giampiero Marra ... Rosalba Radice
01 Jan 2013
Electronic Journal of Statistics | VOL. 7

Research on credit scoring method matching the probability of default: evidence from Lending Club
Hongdong Ma ... Zijian Wang
Applied Economics | VOL. 55
Hongdong Ma, et. al.Hongdong Ma ... Zijian Wang
07 Nov 2022
Applied Economics | VOL. 55

An artificial intelligence system for predicting customer default in e-commerce
Leonardo Vanneschi ... Aleš Popovič
Expert Systems with Applications | VOL. 104
Leonardo Vanneschi, et. al.Leonardo Vanneschi ... Aleš Popovič
14 Mar 2018
Expert Systems with Applications | VOL. 104

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On Lasso and adaptive Lasso for non-random sample in credit scoring

Abstract

Talk to us

Similar Papers

More From: Statistical Modelling