A post-processing framework for class-imbalanced learning in a transductive setting

Zhen Jiang,Yu Lu,Lingyun Zhao,Yongzhao Zhan,Qirong Mao

doi:10.1016/j.eswa.2024.123832

Abstract

Traditional classification tasks suffer from the class-imbalanced problem, where some classes far outnumber others. To address this issue, existing class-imbalanced learning (CIL) methods either preprocess class-imbalanced datasets or adapt traditional classification algorithms to the imbalanced class distribution. Inspired by the idea of transductive learning, we propose a post-processing framework called PPF for CIL. Distinct from existing CIL methods, PPF directly adjusts the predicted labels of test data to fit the imbalanced class distribution. Specifically, we relabel some test data according to their prediction probabilities so that the class proportion of test data is close to that of training data. The underlying assumption is that training and test data, drawn independently from one data space, should obey the same class distribution. Furthermore, we propose a Compact Prototype-based Nearest Neighbor (CPNN) algorithm to assist the original classifier with the adjustment. Instead of training a classifier, CPNN classifies test data according to their distances to a set of prototypes estimated on labeled data. Thus, it is computationally simple and relatively robust to class imbalance. As a general framework, PPF can be easily applied to both traditional classification and CIL algorithms. To validate the effectiveness of the proposed method, we conducted extensive experiments on a variety of class-imbalanced datasets, using SVM and C4.5 as the original classifiers, respectively. Measured by F-measure, G-mean, and AUC, both PPF-SVM and PPF-C4.5 outperform 10 state-of-the-art CIL algorithms. Additionally, PPF further improved their performances when applied to 10 CIL algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A post-processing framework for class-imbalanced learning in a transductive setting

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications

Lead the way for us

Similar Papers

Relationships between Diversity of Classification Ensembles and Single-Class Performance Measures
Shuo Wang ... Xin Yao
IEEE Transactions on Knowledge and Data Engineering | VOL. 25
Shuo Wang, et. al. Shuo Wang ... Xin Yao
01 Jan 2013
IEEE Transactions on Knowledge and Data Engineering | VOL. 25

A Novel Extreme Learning Machine-Based Classification Algorithm for Uncertain Data
Xianchao Zhang ... Wenxin Liang
-
Xianchao Zhang, et. al.Xianchao Zhang ... Wenxin Liang
01 Jan 2017
01 Jan 2017

Issues and challenges of class imbalance problem in classification
Prabhjot Kaur ... Anjana Gosain
International Journal of Information Technology | VOL. 14
Prabhjot Kaur, et. al.Prabhjot Kaur ... Anjana Gosain
13 Oct 2018
International Journal of Information Technology | VOL. 14

Comparing the Behavior of Oversampling and Undersampling Approach of Class Imbalance Learning by Combining Class Imbalance Problem with Noise
Prabhjot Kaur ... Anjana Gosain
-
Prabhjot Kaur, et. al.Prabhjot Kaur ... Anjana Gosain
01 Oct 2017
01 Oct 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A post-processing framework for class-imbalanced learning in a transductive setting

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications