NUS: Noisy-Sample-Removed Undersampling Scheme for Imbalanced Classification and Application to Credit Card Fraud Detection

Honghao Zhu,Guanjun Liu,Cheng Guo,Mengchu Zhou,Yu Xie,Shijun Liu

doi:10.1109/tcss.2023.3243925

Abstract

Since minority samples are substantially less common than majority samples, many industrial applications, such as credit card fraud detection (CCFD) and defective part identification, call for imbalanced classification. The performance of a classifier tends to suffer from the noisy samples in majority or minority classes. This work proposes a new undersampling scheme, called a clustering-based noisy-sample-removed undersampling scheme (NUS) for imbalanced classification. The majority class samples are first clustered. The distance of the majority class sample from the cluster center that is furthest away is used as the radius to build a hypersphere, with each cluster’s center assumed to be a spherical center. We determine the Euclidean distance between the center of a cluster and each minority sample to find whether they are in the hypersphere or not. Afterward, we exclude noisy samples from the minority class. The noisy samples of majority classes are removed by using the same procedure. Second, we propose an NUS, which combines noisy sample removal with undersampling techniques. Finally, to prove the effectiveness of NUS, we integrate NUS with the basic classifiers random forest (RF), decision tree (DT), and logistics regression (LR). We conduct their comparison with seven undersampling, oversampling, and noisy-sample-removed methods. This work performs experiments on 13 public and three real transaction datasets related to e-commerce. The results show that NUS plays a positive role in promoting existing classifiers’ performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

NUS: Noisy-Sample-Removed Undersampling Scheme for Imbalanced Classification and Application to Credit Card Fraud Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computational Social Systems

Lead the way for us

Journal: IEEE Transactions on Computational Social Systems	Publication Date: Apr 1, 2024
Citations: 21

Similar Papers

A Noisy-sample-removed Under-sampling Scheme for Imbalanced Classification of Public Datasets
Honghao Zhu ... Mengchu Zhou
IFAC-PapersOnLine | VOL. 53
Honghao Zhu, et. al.Honghao Zhu ... Mengchu Zhou
01 Jan 2020
IFAC-PapersOnLine | VOL. 53

CDBH: A clustering and density-based hybrid approach for imbalanced data classification
Behzad Mirzaei ... Hossein Nezamabadi-Pour
Expert Systems with Applications | VOL. 164
Behzad Mirzaei, et. al.Behzad Mirzaei ... Hossein Nezamabadi-Pour
28 Sep 2020
Expert Systems with Applications | VOL. 164

Imbalanced classification for protein subcellular localization with multilabel oversampling.
Priyanka Rana ... Yang Song
Bioinformatics (Oxford, England) | VOL. 39
Priyanka Rana, et. al.Priyanka Rana ... Yang Song
29 Dec 2022
Bioinformatics (Oxford, England) | VOL. 39

Rare Event Prediction Using Similarity Majority Under-Sampling Technique
Jinyan Li ... Victor W Chu
-
Jinyan Li, et. al.Jinyan Li ... Victor W Chu
01 Jan 2017
01 Jan 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

NUS: Noisy-Sample-Removed Undersampling Scheme for Imbalanced Classification and Application to Credit Card Fraud Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computational Social Systems