Bayesian network-based over-sampling method (BOSME) with application to indirect cost-sensitive learning

Rosario Delgado,J David Núñez-González

doi:10.1038/s41598-022-12682-8

Abstract

Traditional supervised learning algorithms do not satisfactorily solve the classification problem on imbalanced data sets, since they tend to assign the majority class, to the detriment of the minority class classification. In this paper, we introduce the Bayesian network-based over-sampling method (BOSME), which is a new over-sampling methodology based on Bayesian networks. Over-sampling methods handle imbalanced data by generating synthetic minority instances, with the benefit that classifiers learned from a more balanced data set have a better ability to predict the minority class. What makes BOSME different is that it relies on a new approach, generating artificial instances of the minority class following the probability distribution of a Bayesian network that is learned from the original minority classes by likelihood maximization. We compare BOSME with the benchmark synthetic minority over-sampling technique (SMOTE) through a series of experiments in the context of indirect cost-sensitive learning, with some state-of-the-art classifiers and various data sets, showing statistical evidence in favor of BOSME, with respect to the expected (misclassification) cost.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: May 24, 2022
Citations: 4	License type: open-access

R Discovery Prime

R Discovery Prime

Bayesian network-based over-sampling method (BOSME) with application to indirect cost-sensitive learning

Abstract

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

Soil textural class modeling using digital soil mapping approaches: Effect of resampling strategies on imbalanced dataset predictions
Fereshteh Mirzaei ... Ruth Kerry
Geoderma Regional | VOL. 38
Fereshteh Mirzaei, et. al.Fereshteh Mirzaei ... Ruth Kerry
15 Jun 2024
Geoderma Regional | VOL. 38

Performance Evaluation of Sentiment Analysis on Balanced and Imbalanced Dataset Using Ensemble Approach
Shini George ... V Srividhya
Indian Journal of Science and Technology | VOL. 15
Shini George, et. al.Shini George ... V Srividhya
05 May 2022
Indian Journal of Science and Technology | VOL. 15

Improving Accuracy of Imbalanced Clinical Data Classification Using Synthetic Minority Over-Sampling Technique
Fatihah Mohd ... Noor Maizura Mohamad Noora
-
Fatihah Mohd, et. al.Fatihah Mohd ... Noor Maizura Mohamad Noora
01 Jan 2019
01 Jan 2019

Recursive Ensemble Approach for Incremental Learning of Non-Stationary Imbalanced Data
P R Deshmukh ... Roshani Raut (Ade)
International Journal of Computer Applications | VOL. 98
P R Deshmukh, et. al.P R Deshmukh ... Roshani Raut (Ade)
18 Jul 2014
International Journal of Computer Applications | VOL. 98

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bayesian network-based over-sampling method (BOSME) with application to indirect cost-sensitive learning

Abstract

Talk to us

Similar Papers

More From: Scientific Reports