Entropy-based Sampling Approaches for Multi-Class Imbalanced Problems

Lusi Li,Haibo He,Jie Li

doi:10.1109/tkde.2019.2913859

Abstract

In data mining, large differences between multi-class distributions regarded as class imbalance issues have been known to hinder the classification performance. Unfortunately, existing sampling methods have shown their deficiencies such as causing the problems of over-generation and over-lapping by oversampling techniques, or the excessive loss of significant information by undersampling techniques. This paper presents three proposed sampling approaches for imbalanced learning: the first one is the entropy-based oversampling (EOS) approach; the second one is the entropy-based undersampling (EUS) approach; the third one is the entropy-based hybrid sampling (EHS) approach combined by both oversampling and undersampling approaches. These three approaches are based on a new class imbalance metric, termed entropy-based imbalance degree (EID), considering the differences of information contents between classes instead of traditional imbalance-ratio. Specifically, to balance a data set after evaluating the information influence degree of each instance, EOS generates new instances around difficult-to-learn instances and only remains the informative ones. EUS removes easy-to-learn instances. While EHS can do both simultaneously. Finally, we use all the generated and remaining instances to train several classifiers. Extensive experiments over synthetic and real-world data sets demonstrate the effectiveness of our approaches.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: Nov 1, 2020
Citations: 81	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

Entropy-based Sampling Approaches for Multi-Class Imbalanced Problems

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Similar Papers

Cluster-based majority under-sampling approaches for class imbalance learning
Yan-Ping Zhang ... Li-Na Zhang
-
Yan-Ping Zhang, et. al.Yan-Ping Zhang ... Li-Na Zhang
01 Sep 2010
01 Sep 2010

Radial-based undersampling approach with adaptive undersampling ratio determination
Bo Sun ... Peng Liu
Neurocomputing | VOL. 553
Bo Sun, et. al.Bo Sun ... Peng Liu
18 Jul 2023
Neurocomputing | VOL. 553

An Under-Sampling Approach to Imbalanced Automatic Keyphrase Extraction
Weijian Ni ... Qingtian Zeng
-
Weijian Ni, et. al.Weijian Ni ... Qingtian Zeng
01 Jan 2012
01 Jan 2012

Comparison of three undersampling approaches in computed tomography reconstruction
Chenyang Shen ... Michael K Ng
Quantitative Imaging in Medicine and Surgery | VOL. 9
Chenyang Shen, et. al.Chenyang Shen ... Michael K Ng
01 Jul 2019
Quantitative Imaging in Medicine and Surgery | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Entropy-based Sampling Approaches for Multi-Class Imbalanced Problems

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering