Inferring the Outcomes of Rejected Loans: An Application of Semisupervised Clustering

Zhiyong Li,Fanyin Zhou,Feng Shen,Xinyi Hu,Ke Li

doi:10.1111/rssa.12534

Abstract

SummaryRejection inference aims to reduce sample bias and to improve model performance in credit scoring. We propose a semisupervised clustering approach as a new rejection inference technique. K-prototype clustering can deal with mixed types of numeric and categorical characteristics, which are common in consumer credit data. We identify homogeneous acceptances and rejections and assign labels to part of the rejections according to the label of acceptances. We test the performance of various rejection inference methods in logit, support vector machine and random-forests models based on data sets of real consumer loans. The predictions of clustering rejection inference show advantages over other traditional rejection inference methods. Inferring the label of the rejection from semisupervised clustering is found to help to mitigate the sample bias problem and to improve the predictive accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Inferring the Outcomes of Rejected Loans: An Application of Semisupervised Clustering

Abstract

Talk to us

Similar Papers

More From: Journal of the Royal Statistical Society Series A: Statistics in Society

Lead the way for us

Journal: Journal of the Royal Statistical Society Series A: Statistics in Society	Publication Date: Nov 14, 2019
Citations: 6

Similar Papers

Traffic Volume Forecasting Model of Freeway Toll Stations During Holidays – An SVM Model
Xiaowei Hu ... Tianlin Wang
Promet - Traffic&Transportation | VOL. 34
Xiaowei Hu, et. al.Xiaowei Hu ... Tianlin Wang
15 Jun 2022
Promet - Traffic&Transportation | VOL. 34

Identification of the geographic origin of peaches by VIS-NIR spectroscopy, fluorescence spectroscopy and image processing technology
Qinyi Yang ... Huirong Xu
Journal of Food Composition and Analysis | VOL. 114
Qinyi Yang, et. al.Qinyi Yang ... Huirong Xu
23 Aug 2022
Journal of Food Composition and Analysis | VOL. 114

Comparison of Accuracy of Support Vector Machine Model and Logistic Regression Model in Predicting Individual Loan Defaults
...
American Journal of Applied Mathematics and Statistics | VOL. 6
, et. al. ...
14 Dec 2018
American Journal of Applied Mathematics and Statistics | VOL. 6

مدل سازی پایداری خاکدانهها با استفاده از ماشینهای بردار پشتیبان و رگرسیون خطی چند متغیره
...
-
, et. al. ...
25 Apr 2015
25 Apr 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Inferring the Outcomes of Rejected Loans: An Application of Semisupervised Clustering

Abstract

Talk to us

Similar Papers

More From: Journal of the Royal Statistical Society Series A: Statistics in Society