An AutoEncoder enhanced light gradient boosting machine method for credit card fraud detection

Lianhong Ding,Luqi Liu,Yangchuan Wang,Peng Shi,Jianye Yu

doi:10.7717/peerj-cs.2323

Abstract

Online financial transactions bring convenience to people’s lives, but also present vulnerabilities for criminals to embezzle users’ accounts and trick users into credit card fraud. Although machine learning methods have been adopted to detect anomalous transactions, it’s hard for a single machine learning method to achieve satisfying results with the increasing scale and dimensionality of financial datasets. In addition, for anomaly detection of financial data, there is an obvious imbalance between normal records and abnormal. In this situation, the experimental results cannot be objectively evaluated only by the traditional metrics, such as precision, recall, and accuracy. This paper proposes an AutoEncoder enhanced LightGBM method for credit card detection. The method inherits the advantages of each component, using an AutoEncoder for feature reconstruction on the dataset, and integrating the LightGBM algorithm for improving the GBDT (Gradient Boosting Decison Tree) to detect abnormal data more accurately and efficiently. Besides the traditional evaluation metrics, F-measure, area under curve (AUC), Matthew’s correlation coefficient (MCC), and balanced classification rate (BCR) are also adopted as the evaluation metrics. Two financial datasets were used to validate the performance and robustness of the proposed model. Results obtained from the credit card fraud dataset containing 31 features indicate that our model significantly outperforms other models with a recall of 94.85%, representing a 10.70% improvement compared to the best detection performance model with a recall of only 86%. Additionally, our model’s BCR score is also significantly better than other models, with a BCR score of 97%, as opposed to the best detection performance model’s BCR score of 92%, representing a 5% improvement by our model. Various sampling methods and model combinations were considered in this study. It was found that the SMOTE algorithm combined with the proposed model produced the best results, with an AUC value of 96.83% and an F-measure score of 80.27%. The Santander bank transaction record dataset is a high dimensional large dataset containing 200 features. Experimental results on this dataset reveal that compared to other models, our model significantly improved recall and F-measure results, raising the recall to 94.14% and the F-measure score by 11.51%, surpassing the second-best-performing model. Overall, these findings demonstrate the robustness and superiority of our model in detecting fraudulent transactions and highlight the effectiveness of the SMOTE algorithm in combination with the proposed model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An AutoEncoder enhanced light gradient boosting machine method for credit card fraud detection

Abstract

Talk to us

Similar Papers

More From: PeerJ Computer Science

Lead the way for us

Journal: PeerJ Computer Science	Publication Date: Oct 18, 2024
License type: CC BY 4.0

Similar Papers

Twitter metrics complement traditional conference evaluations to evaluate knowledge translation at a National Emergency Medicine Conference.
Stella Yiu ... Jason R Frank
CJEM | VOL. 22
Stella Yiu, et. al.Stella Yiu ... Jason R Frank
26 Mar 2020
CJEM | VOL. 22

Credit card fraud detection using machine learning techniques: A comparative analysis
John O Awoyemi ... Samuel A Oluwadare
-
John O Awoyemi, et. al.John O Awoyemi ... Samuel A Oluwadare
01 Oct 2017
01 Oct 2017

Untangling Result List Refinement and Ranking Quality
Jiyin He ... Marc Bron
-
Jiyin He, et. al.Jiyin He ... Marc Bron
09 Aug 2015
09 Aug 2015

Re-evaluating Keystroke Dynamics for Continuous Authentication
Dilshan Senarath ... Maduka Vishvajith
-
Dilshan Senarath, et. al.Dilshan Senarath ... Maduka Vishvajith
23 Feb 2023
23 Feb 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An AutoEncoder enhanced light gradient boosting machine method for credit card fraud detection

Abstract

Talk to us

Similar Papers

More From: PeerJ Computer Science