Transfer synthetic over-sampling for class-imbalance learning with limited minority class data

Xu-Ying Liu,Min-Ling Zhang,Sheng-Tao Wang

doi:10.1007/s11704-018-7182-1

Abstract

The problem of limited minority class data is encountered in many class imbalanced applications, but has received little attention. Synthetic over-sampling, as popular class-imbalance learning methods, could introduce much noise when minority class has limited data since the synthetic samples are not i.i.d. samples of minority class. Most sophisticated synthetic sampling methods tackle this problem by denoising or generating samples more consistent with ground-truth data distribution. But their assumptions about true noise or ground-truth data distribution may not hold. To adapt synthetic sampling to the problem of limited minority class data, the proposed Traso framework treats synthetic minority class samples as an additional data source, and exploits transfer learning to transfer knowledge from them to minority class. As an implementation, TrasoBoost method firstly generates synthetic samples to balance class sizes. Then in each boosting iteration, the weights of synthetic samples and original data decrease and increase respectively when being misclassified, and remain unchanged otherwise. The misclassified synthetic samples are potential noise, and thus have smaller influence in the following iterations. Besides, the weights of minority class instances have greater change than those of majority class instances to be more influential. And only original data are used to estimate error rate to be immune from noise. Finally, since the synthetic samples are highly related to minority class, all of the weak learners are aggregated for prediction. Experimental results show TrasoBoost outperforms many popular class-imbalance learning methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Transfer synthetic over-sampling for class-imbalance learning with limited minority class data

Abstract

Talk to us

Similar Papers

More From: Frontiers of Computer Science

Lead the way for us

Journal: Frontiers of Computer Science	Publication Date: Jun 17, 2019
Citations: 11

Similar Papers

MWMOTE--Majority Weighted Minority Oversampling Technique for Imbalanced Data Set Learning
Sukarna Barua ... Xin Yao
IEEE Transactions on Knowledge and Data Engineering | VOL. 26
Sukarna Barua, et. al.Sukarna Barua ... Xin Yao
01 Feb 2014
IEEE Transactions on Knowledge and Data Engineering | VOL. 26

Creating synthetic minority class samples based on autoencoder extreme learning machine
Yu-Lin He ... Joshua Zhexue Huang
Pattern Recognition | VOL. 121
Yu-Lin He, et. al.Yu-Lin He ... Joshua Zhexue Huang
20 Jul 2021
Pattern Recognition | VOL. 121

Phishing Website Detection Based on Hybrid Resampling KMeansSMOTENCR and Cost-Sensitive Classification
Jaya Srivastava ... Aditi Sharan
-
Jaya Srivastava, et. al.Jaya Srivastava ... Aditi Sharan
01 Jan 2023
01 Jan 2023

An overlapping minimization-based over-sampling algorithm for binary imbalanced classification
Xuan Lu ... Yingchao Cheng
Engineering Applications of Artificial Intelligence | VOL. 133
Xuan Lu, et. al.Xuan Lu ... Yingchao Cheng
26 Feb 2024
Engineering Applications of Artificial Intelligence | VOL. 133

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transfer synthetic over-sampling for class-imbalance learning with limited minority class data

Abstract

Talk to us

Similar Papers

More From: Frontiers of Computer Science