A boosting resampling method for regression based on a conditional variational autoencoder

Yang Huang,Duen-Ren Liu,Shin-Jye Lee,Chia-Hao Hsu,Yang-Guang Liu

doi:10.1016/j.ins.2021.12.100

Abstract

Resampling is the most commonly used method for dealing with imbalanced data, in addition to modifying the algorithm mechanism, it can, for example, generate new minority samples or reduce majority samples to adjust the data distribution. However, to date, related research has predominantly focused on solving the classification problem, while the issue of imbalanced regression data has rarely been discussed. In real-world applications, predicting regression data is a common and valuable issue in decision making, especially in regard to those rare samples with extremely high or low values, such as those encountered in the fields of signal processing, finance, or meteorology. This study therefore divided its regression data into rare samples and normal samples, with self-defined relevance functions and, in addition, proposed a boosting resampling method based on a conditional variational autoencoder. The experimental results showed that when using the proposed resampling method was employed, the prediction performance of the whole testing data set was slightly increased, while the performance for the rare samples was significantly improved.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A boosting resampling method for regression based on a conditional variational autoencoder

Abstract

Talk to us

Similar Papers

More From: Information Sciences

Lead the way for us

Journal: Information Sciences	Publication Date: Jan 6, 2022
Citations: 11

Similar Papers

Conditional Variational Autoencoder with Balanced Pre-training for Generative Adversarial Networks
Yuchong Yao ... Xiaohui Wang
-
Yuchong Yao, et. al.Yuchong Yao ... Xiaohui Wang
13 Oct 2022
13 Oct 2022

An Integrated Resampling Methods for Imbalanced Sporadic Temporal Data in EHRs
Qi Ye ... Tong Ruan
-
Qi Ye, et. al.Qi Ye ... Tong Ruan
09 Dec 2021
09 Dec 2021

Comparison of evaluation metrics of deep learning for imbalanced imaging data in osteoarthritis studies
Shen Liu ... Xiaoxiao Sun
Osteoarthritis and cartilage | VOL. 31
Shen Liu, et. al.Shen Liu ... Xiaoxiao Sun
19 May 2023
Osteoarthritis and cartilage | VOL. 31

Data Level Approach for Multiclass Imbalance Financial Data
Nursel Selver Ruzgar ... Clare Chua
WSEAS TRANSACTIONS ON COMPUTERS | VOL. 19
Nursel Selver Ruzgar, et. al.Nursel Selver Ruzgar ... Clare Chua
27 Oct 2020
WSEAS TRANSACTIONS ON COMPUTERS | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A boosting resampling method for regression based on a conditional variational autoencoder

Abstract

Talk to us

Similar Papers

More From: Information Sciences