Dual Autoencoders Generative Adversarial Network for Imbalanced Classification Problem

Ensen Wu,Roy E Welsch,Hongyan Cui

doi:10.1109/access.2020.2994327

Abstract

The imbalanced classification problem has become greatest issue in many fields, especially in fraud detection. In fraud detection, the transaction datasets available for training are extremely imbalanced, with fraudulent transaction logs considerably less represented. Meanwhile, the feature information of the fraud samples with better classification capabilities cannot be mined directly by feature learning methods due to too few fraud samples. These significantly reduce the effectiveness of fraud detection models. In this paper, we proposed a Dual Autoencoders Generative Adversarial Network, which can balance the majority and minority classes and learn feature representations of normal and fraudulent transactions to improve the accuracy of the fraud detection. The new model firstly trains a Generative Adversarial Networks to output sufficient mimicked fraudulent transactions for autoencoder training. Then, two autoencoders are trained on the normal and fraud dataset, respectively. The samples are encoded by two autoencoders to obtain two sets of features, which are combined to form the dual autoencoding features. Finally, the model detects fraudulent transactions by a Neural Network trained on the augmented training set. Experimental results show that the model outperforms a set of well-known classification methods in experiments, especially the sensitivity and precision, which are effectively improved.

Highlights

With the continuous increase of online transactions via credit cards, more and more fraudulent transactions are increasingly produced, bringing great losses to banks, merchants, and cardholders
In order to make full use of the information of the samples in the dataset and alleviate the imbalanced-class problem, we proposed Dual Autoencoders Generative Adversarial Network (DAEGAN)
In order to solve the problem that the autoencoder cannot completely fit the fraud samples data, we propose to train the autoencoder AE_f on the augmented fraud training set x_f, which contains the real fraud samples and fake fraud samples generated by the first WGAN: AE _f arg min θθ x_f, gAE_f (fAE_f (x_f ))

Summary

INTRODUCTION

With the continuous increase of online transactions via credit cards, more and more fraudulent transactions are increasingly produced, bringing great losses to banks, merchants, and cardholders. In the actual fraud detection dataset, the positive and negative samples are very imbalanced, and the extremely small number of fraudulent transaction records are available. This extremely imbalanced data may cause the classifier to produce biased results, because classifier may sacrifice the accuracy of the minority samples and treat them as noise [12]. In order to make full use of the information of the samples in the dataset and alleviate the imbalanced-class problem, we proposed Dual Autoencoders Generative Adversarial Network (DAEGAN). DAEGAN mines the feature information of fraud samples based on the augmented fraud dataset

RELATED WORK

EXPERIMENTS

COMPARISON WITH OTHER SEVERAL METHODS Baselines

Findings

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 13	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Dual Autoencoders Generative Adversarial Network for Imbalanced Classification Problem

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

A hybrid method with dynamic weighted entropy for handling the problem of class imbalance with overlap in credit card fraud detection
Zhenchuan Li ... Changjun Jiang
Expert Systems with Applications | VOL. 175
Zhenchuan Li, et. al.Zhenchuan Li ... Changjun Jiang
25 Feb 2021
Expert Systems with Applications | VOL. 175

부정 탐지를 위한 이상치 분석 활용방안 연구 : 농수산 상장예외품목 거래를 대상으로
Dongsung Kim ... Kitae Kim
Journal of Intelligence and Information Systems | VOL. 20
Dongsung Kim, et. al.Dongsung Kim ... Kitae Kim
30 Sep 2014
Journal of Intelligence and Information Systems | VOL. 20

Using generative adversarial networks for improving classification effectiveness in credit card fraud detection
Ugo Fiore ... Francesco Palmieri
Information Sciences | VOL. 479
Ugo Fiore, et. al.Ugo Fiore ... Francesco Palmieri
25 Dec 2017
Information Sciences | VOL. 479

Handling Class Imbalance in Online Transaction Fraud Detection
Kanika ... Yunyoung Nam
Computers, Materials & Continua | VOL. 70
Kanika, et. al. Kanika ... Yunyoung Nam
01 Jan 2021
Computers, Materials & Continua | VOL. 70

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dual Autoencoders Generative Adversarial Network for Imbalanced Classification Problem

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access