Learning From Imbalanced Data With Deep Density Hybrid Sampling

Chien-Liang Liu,Yu-Hua Chang

doi:10.1109/tsmc.2022.3151394

Abstract

Learning from imbalanced data is an important and challenging topic in machine learning. Many works have devised methods to cope with imbalanced data, but most methods only consider minority or majority classes without considering the relationship between the two classes. In addition, many synthetic minority oversampling technique-based methods generate synthetic samples from the original feature space and use the Euclidean distance to search for the nearest neighbors. However, the Euclidean distance is not a precise distance metric in a high-dimensional space. This article proposes a novel method, called deep density hybrid sampling (DDHS), to address imbalanced data problems. The proposed method learns an embedding network to project the data samples into a low-dimensional separable latent space. The goal is to preserve class proximity during data projection, and we use within-class and between-class concepts to devise loss functions. We propose to use density as a criterion to select minority and majority samples. Subsequently, we apply a feature-level approach to the selected minority samples and generate diverse and valid synthetic samples for the minority class. This work conducts extensive experiments to assess our proposed method and compare it with several methods. The experimental results show that the proposed method can yield promising and stable results. The proposed method is a data-level algorithm, and we combine the proposed method with the boosting technique to develop a method called DDHS-boosting. We compare DDHS-boosting with several ensemble methods, and DDHS-boosting shows promising results.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning From Imbalanced Data With Deep Density Hybrid Sampling

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans : a publication of the IEEE Systems, Man, and Cybernetics Society

Lead the way for us

Journal: IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans : a publication of the IEEE Systems, Man, and Cybernetics Society	Publication Date: Nov 1, 2022
Citations: 6

Similar Papers

Imbalanced classification for protein subcellular localization with multilabel oversampling.
Priyanka Rana ... Hanchuan Peng
Computer applications in the biosciences : CABIOS | VOL. 39
Priyanka Rana, et. al.Priyanka Rana ... Hanchuan Peng
29 Dec 2022
Computer applications in the biosciences : CABIOS | VOL. 39

A Novel Deep Ensemble Learning Framework for Classifying Imbalanced Data Stream
Monika Arya ... G Hanumat Sastry
-
Monika Arya, et. al.Monika Arya ... G Hanumat Sastry
01 Jan 2021
01 Jan 2021

A New Over-Sampling Method Based on Cluster Ensembles
Si Chen ... Gongde Guo
-
Si Chen, et. al.Si Chen ... Gongde Guo
01 Jan 2009
01 Jan 2009

Feature selection for high dimensional imbalanced class data based on F-measure optimization
Chunkai Zhang ... Lin Yao
-
Chunkai Zhang, et. al.Chunkai Zhang ... Lin Yao
01 Dec 2017
01 Dec 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning From Imbalanced Data With Deep Density Hybrid Sampling

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans : a publication of the IEEE Systems, Man, and Cybernetics Society