Abstract

The rising prevalence of digital transactions in contemporary financial systems has heightened the risk of credit card fraud (CCF), necessitating effective detection techniques. Machine learning (ML) technologies show promise, but dataset imbalance impedes fraud detection accuracy. This study investigates the effectiveness of ensemble learning techniques and data balancing methods for detecting CCF in imbalanced datasets. We compare four ML algorithms—Logistic Regression, Decision Tree, Random Forest, and XGBoost—and evaluate data balancing techniques including SMOTE, ADASYN, Random Oversampling, CNN, NCR, and Random Undersampling, along with hybrid methods like SMOTE with NCR and ADASYN with NCR. Our experiments use a dataset of credit card transactions made by European cardholders, which includes 30 attributes and exhibits severe class imbalance, with fraud events accounting for only 0.172% of total transactions. Applying balancing techniques led to significant improvements: Random Oversampling with XGBoost achieved the highest F1-Score of 92.43%, with Precision of 97.383% and Recall of 88.418%. Random Forest with Random Oversampling followed with an F1-Score of 91.554%, and NCR with Random Forest achieved an F1-Score of 91.508%. These findings highlight the effectiveness of oversampling techniques in enhancing model performance for imbalanced datasets. Precision, Recall, and F1-Score were emphasized, while specificity and sensitivity were not directly used; macro averages ensured balanced evaluation across all classes. This study concludes that combining Random Forest or XGBoost classifiers with oversampling techniques like Random Oversampling or undersampling methods such as NCR provides accurate and unbiased results for detecting CCF in imbalanced datasets, contributing to reliable fraud detection systems for real-world applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.