A Hybrid Approach Handling Imbalanced Datasets

Paolo Soda

doi:10.1007/978-3-642-04146-4_24

Abstract

Several binary classification problems exhibit imbalance in class distribution, influencing system learning. Indeed, traditional machine learning algorithms are biased towards the majority class, thus producing poor predictive accuracy over the minority one. To overcome this limitation, many approaches have been proposed up to now to build artificially balanced training sets. Further to their specific drawbacks, they achieve more balanced accuracies on each class harming the global accuracy. This paper first reviews the more recent method coping with imbalanced datasets and then proposes a strategy overcoming the main drawbacks of existing approaches. It is based on an ensemble of classifiers trained on balanced subsets of the original imbalanced training set working in conjunction with the classifier trained on the original imbalanced dataset. The performance of the method has been estimated on six public datasets, proving its effectiveness also in comparison with other approaches. It also gives the chance to modify the system behaviour according to the operating scenario.KeywordsClass DistributionMinority ClassGlobal AccuracyImbalanced DatasetMinority Class SampleThese keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Hybrid Approach Handling Imbalanced Datasets

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Learning from imbalanced data sets with boosting and data generation
Hongyu Guo ... Herna L Viktor
ACM SIGKDD Explorations Newsletter | VOL. 6
Hongyu Guo, et. al.Hongyu Guo ... Herna L Viktor
01 Jun 2004
ACM SIGKDD Explorations Newsletter | VOL. 6

Classification of Diffuse Lung Diseases Using Heterogeneous Ensemble Classifiers
Shyla Raj ...
-
Shyla Raj, et. al.Shyla Raj ...
01 Jan 2020
01 Jan 2020

Optimizing Kernel Transformations to Handle Binary Class Imbalanced Dataset Classification
Vaibhavi Patel ... Hetal Bhavsar
Applied Artificial Intelligence | VOL. 38
Vaibhavi Patel, et. al.Vaibhavi Patel ... Hetal Bhavsar
13 Oct 2024
Applied Artificial Intelligence | VOL. 38

Imbalance Learning and Its Application on Medical Datasets
Yachao Shao
-
Yachao ShaoYachao Shao
21 Feb 2022
21 Feb 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Hybrid Approach Handling Imbalanced Datasets

Abstract

Talk to us

Similar Papers