Multiple imputation method of missing credit risk assessment data based on generative adversarial networks

Feng Zhao,Yan Lu,Xinning Li,Lina Wang,Yingjie Song,Deming Fan,Caiming Zhang,Xiaobo Chen

doi:10.1016/j.asoc.2022.109273

Abstract

Credit risk assessment is critical for loan approval and risk management of banks. However, the problem of missing credit risk data may greatly reduce the effectiveness of the assessment model. Therefore, constructing a data imputation method for accurate missing data prediction is quite beneficial. Typically, building an effective imputation model is very challenging due to the high missing rate and complex arbitrary missing pattern of datasets in credit risk assessment. In this paper, a novel imputation method named as Multiple Generative Adversarial Imputation Networks (MGAIN) is proposed. Specifically, we first randomly select multiple attribute subsets instead of the whole attributes such that more complete samples can be generated. Then, the missing data in each attribute are imputed by using generative adversarial imputation networks (GAIN) which fully considers the relationships among missing values by combining neural network and adversarial learning. The proposed subset selection and multiple imputation strategy not only simplify the network structure of GAIN but also reduce the demand for data. Finally, a weighted average method is presented to synthesize multiple results of each missing attribute value to further improve the accuracy. The experimental results on real-world data demonstrate that the proposed method is superior to other popular imputation methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multiple imputation method of missing credit risk assessment data based on generative adversarial networks

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing

Lead the way for us

Journal: Applied Soft Computing	Publication Date: Jul 9, 2022
Citations: 21

Similar Papers

Imputation strategies for missing binary outcomes in cluster randomized trials
Jinhui Ma ... Lehana Thabane
BMC Medical Research Methodology | VOL. 11
Jinhui Ma, et. al.Jinhui Ma ... Lehana Thabane
16 Feb 2011
BMC Medical Research Methodology | VOL. 11

GAMIN: Generative Adversarial Multiple Imputation Network for Highly Missing Data
Seongwook Yoon ... Sanghoon Sull
-
Seongwook Yoon, et. al.Seongwook Yoon ... Sanghoon Sull
01 Jun 2020
01 Jun 2020

Multiple Imputation via Generative Adversarial Network for High-dimensional Blockwise Missing Value Problems.
Zongyu Dai ... Zhiqi Bu
Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications | VOL. 2021
Zongyu Dai, et. al.Zongyu Dai ... Zhiqi Bu
01 Dec 2021
01 Dec 2021

Comparing the performance of different multiple imputation strategies for missing binary outcomes in cluster randomized trials: a simulation study
Lehana Thabane ... Jinhui Ma
Open Access Medical Statistics | VOL. 2
Lehana Thabane, et. al.Lehana Thabane ... Jinhui Ma
01 Dec 2012
Open Access Medical Statistics | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multiple imputation method of missing credit risk assessment data based on generative adversarial networks

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing