Bank fraud is an increasingly prevalent issue, causing substantial financial losses for both financial institutions and customers. Also, with the increasing prevalence of bank fraud, especially facilitated by online banking and digital payments, there is a pressing need for potential expertise deployment and advanced fraud detection techniques in real-world banking systems. This study specified the Bayesian Network models for detecting fraudulent bank transactions. Bayesian Networks offer a probabilistic graphical modeling approach that can effectively capture complex relationships and dependencies within financial data. Thus, the research aimed at developing a custom Bayesian network model trained on a large dataset of bank transactions comprising over one million bank transactions to classify instances as fraudulent or non-fraudulent. Python 3.11.10 was used for the graphical representations. Down-sampling was employed to reduce the dataset from its initial size of 1 million observations to 17,650 observations, ensuring a balanced representation of both fraud and non-fraud instances. Various estimation techniques: Maximum Likelihood, Bayesian Networks, and Expectation Maximization were employed and evaluated to learn the model parameters. The model's performance was measured using metrics: accuracy, precision, recall, F1-score, and ROC-AUC. The Bayesian Networks estimator achieved an overall accuracy of 66.18%, precision of 67.73%, F1-score of 64.06%, ROC-AUC of 66.13%, Recall of 60.77%, Sensitivity of 60.77% and Specificity of 71.50%. Also, the Maximum Likelihood achieved an overall Accuracy of 66.77%, Precision of 69.22%, F1-score of 63.96%, ROC-AUC of 66.71%, Recall of 59.45%, Sensitivity of 59.45% and Specificity of 73.97%. Likewise, the Expectation Maximization achieved an overall Accuracy of 66.83%, Precision of 69.31%, F1-score of 64.00%, ROC-AUC of 66.77%, Recall of 59.45%, Sensitivity of 59.45% and Specificity of 74.09%. On the confusion matrix, the model correctly classified 1272 instances as Non-Fraudulent transactions (True Negative). Also, it was observed that the model incorrectly classified 507 instances as fraudulent transactions when they were Non-Fraudulent (False Positive). The model similarly incorrectly classified 687 instances as Non-Fraudulent transactions when they were Fraudulent (False Negative). Finally, it correctly classified 1064 instances as fraudulent transactions (True Positive). These results demonstrated the Bayesian Network's ability to identify fraudulent transactions while minimizing false alarms accurately. The findings highlight the potential of Bayesian Networks as a robust framework for fraud detection in the banking sector, contributing to enhanced security and reduced financial losses. The developed model can be introduced into existing banking systems to strengthen fraud prevention strategies.
Read full abstract