Optimized Stacking Ensemble (OSE) for Credit Card Fraud Detection using Synthetic Minority Oversampling Model

Karen Charly Veigas,Sujatha Arun Kokatnoor,Durga Srilekha Regulagadda

doi:10.17485/ijst/v14i32.807

Karen Charly Veigas, Sujatha Arun Kokatnoor + Show 1 more

Open Access

https://doi.org/10.17485/ijst/v14i32.807

Copy DOI

Journal: Indian Journal of Science and Technology	Publication Date: Aug 27, 2021
Citations: 3	License type: cc-by

Abstract

Objectives: Credit fraud is a global threat to financial institutions due to specific challenges like imbalanced datasets and hidden patterns in real-life scenarios. The objective of this study is to propose a model that effectively identifies fraudulent transactions. Methods: Methods such as Synthetic Minority Oversampling Technique (SMOTE) and Generative Adversarial Networks (GAN) that artificially generate synthetic data are used in this paper to approximate the distribution of data among the two classes in the original dataset. After balancing the dataset, the individual models Multi-Layer Perceptron (MLP), k- Nearest Neighbors algorithm (kNN) and Support Vector Machine (SVM) are trained on the augmented dataset to establish an initial improvement at the data level. These base-classifiers are further incorporated into the Optimized Stacked Ensemble (OSE) learning process to fit the meta-classifier which creates an effective predictive model for fraud detection. All base-classifiers and the final Optimized Stacked Ensemble (OSE) have been implemented to critically assess and evaluate their performances. Findings: Empirical results obtained in this paper show that the quality of the final dataset is considerably improved when Synthetic Minority Oversampling Technique (SMOTE) and Generative Adversarial Networks (GAN) are used as oversampling algorithms. The Multi-Layer Perceptron model showed an increase of 10% in the F1 Score while kNN and SVM showed an increase of 3% each. The optimized model is built using a Stacking Classifier that combines the GAN-improved Multi-Perceptron Model with the other standard classification models such as KNN and SVM. This ensemble outperforms the existing enhanced Multi-Layer Perceptron with near-perfect accuracy (99.86%) and an increase of 16% in F1 Score, resulting in an effective fraud detection mechanism. Novelty: For the current dataset, the Optimized Stacked Ensemble model shows an increase of 16% in F1 Score as compared to the existing Multi-Perceptron model. Keywords: Ensemble; Credit Card; Fraud Detection; GAN; SMOTE; MLP

Highlights

The usage of counterfeit or stolen credit cards is referred to as Credit card fraud and is closely related to the crime of identity theft
Implemented a model based on Multi-Layer Perceptron (MLP) and Generative Adversarial Networks (GAN) to distinguish fraudulent transactions from normal transactions and observed a 10% increase in F1 score when the augmented dataset is tested during experimental study
The study of imbalanced datasets and ensemble learning paradigms is crucial in the field of fraudulent deductions and other similar studies

Summary

Introduction

The usage of counterfeit or stolen credit cards is referred to as Credit card fraud and is closely related to the crime of identity theft Institutions such as banks are responsible for detecting and blocking such kinds of transactions. There are a few stand-alone methods and algorithms, such as anomaly detectors, which show decent accuracy in classifying the non-fraudulent transactions but tend to fail with classifying the fraudulent ones due to the lack of insufficient data [1]. This is tested further in the paper. Since the F1 score takes into account both the recall and precision, it provides the trade-off that is being looked for in this study, and is considered best suited for real-life transactional scenarios

Objectives

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimized Stacking Ensemble (OSE) for Credit Card Fraud Detection using Synthetic Minority Oversampling Model

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Indian Journal of Science and Technology

Lead the way for us

Similar Papers

A New Integrated Approach for Landslide Data Balancing and Spatial Prediction Based on Generative Adversarial Networks (GAN)
Husam A H Al-Najjar ... Raju Sarkar
Remote Sensing | VOL. 13
Husam A H Al-Najjar, et. al.Husam A H Al-Najjar ... Raju Sarkar
07 Oct 2021
Remote Sensing | VOL. 13

WiP: Generative Adversarial Network for Oversampling Data in Credit Card Fraud Detection
Akhilesh Kumar Gangwar ... Vadlamani Ravi
-
Akhilesh Kumar Gangwar, et. al.Akhilesh Kumar Gangwar ... Vadlamani Ravi
01 Jan 2019
01 Jan 2019

Machine-Learning Approach Using SAR Data for the Classification of Oil Palm Trees That Are Non-Infected and Infected with the Basal Stem Rot Disease
Izrahayu Che Hashim ... Farrah Melissa Muharam
Agronomy | VOL. 11
Izrahayu Che Hashim, et. al.Izrahayu Che Hashim ... Farrah Melissa Muharam
12 Mar 2021
Agronomy | VOL. 11

Prediction of Myocardial Infarction Using a Combined Generative Adversarial Network Model and Feature-Enhanced Loss Function.
Shixiang Yu ... Karsten Suhre
Metabolites | VOL. 14
Shixiang Yu, et. al.Shixiang Yu ... Karsten Suhre
30 Apr 2024
Metabolites | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimized Stacking Ensemble (OSE) for Credit Card Fraud Detection using Synthetic Minority Oversampling Model

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Indian Journal of Science and Technology