Imbalanced Learning with Oversampling based on Classification Contribution Degree

Zhenhao Jiang,Yan Liu,Jie Yang

doi:10.1002/adts.202100031

Abstract

AbstractImbalanced datasets exist commonly in the real world, which leads to poor performance of general machine learning models because of skewed class distribution. To address the data‐imbalance problem, a novel oversampling method based on classification contribution degree, called OS‐CCD is presented. First a new concept, classification contribution degree, is established based on micro and macro information extracted from raw datasets. With the classification contribution degree, OS‐CCD enables positive samples near the class boundary and located in an area with high density of positive samples to generate more synthetic samples than others. Furthermore, the neighbor selection for oversampling is no longer random but in the light of a selected probability. Experimental results on 12 benchmark datasets substantiate that four commonly used classifiers with the oversampling method outperform those with six popular oversampling methods in terms of accuracy, F1‐score and AUC.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Imbalanced Learning with Oversampling based on Classification Contribution Degree

Abstract

Talk to us

Similar Papers

More From: Advanced Theory and Simulations

Lead the way for us

Journal: Advanced Theory and Simulations	Publication Date: Mar 26, 2021
Citations: 8

Similar Papers

DSPOTE: Density-induced Selection Probability-based Oversampling TEchnique for Imbalanced Learning
Zhen Wei ... Li Zhang
-
Zhen Wei, et. al.Zhen Wei ... Li Zhang
21 Aug 2022
21 Aug 2022

A Weakly Supervised Learning-Based Oversampling Framework for Class-Imbalanced Fault Diagnosis
Min Qian ... Yan-Fu Li
IEEE Transactions on Reliability | VOL. 71
Min Qian, et. al.Min Qian ... Yan-Fu Li
01 Mar 2022
IEEE Transactions on Reliability | VOL. 71

A Robust Oversampling Approach for Class Imbalance Problem with Small Disjuncts
Yi Sun ... Junlin Xu
IEEE Transactions on Knowledge and Data Engineering | VOL. -
Yi Sun, et. al.Yi Sun ... Junlin Xu
01 Jan 2021
IEEE Transactions on Knowledge and Data Engineering | VOL. -

A New Oversampling Method Based on the Classification Contribution Degree
Zhenhao Jiang ... Jie Yang
Symmetry | VOL. 13
Zhenhao Jiang, et. al.Zhenhao Jiang ... Jie Yang
26 Jan 2021
Symmetry | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Imbalanced Learning with Oversampling based on Classification Contribution Degree

Abstract

Talk to us

Similar Papers

More From: Advanced Theory and Simulations