Synthetic Sample Generation for Label Distribution Learning

Manuel González,Julián Luengo,José-Ramón Cano,Salvador García

doi:10.1016/j.ins.2020.07.071

Abstract

Label Distribution Learning (LDL) is a general learning framework that assigns an instance to a distribution over a set of labels rather than a single label or multiple labels. Current LDL methods have proven their effectiveness in many machine learning applications. As of the first formulation of the LDL problem, numerous studies have been carried out that apply the LDL methodology to various real-life problem solving. Others have focused more specifically on the proposal of new algorithms. The purpose of this article is to start addressing the LDL problem as of the data pre-processing stage. The baseline hypothesis is that, due to the high dimensionality of existing LDL data sets, it is very likely that this data will be incomplete and/or that poor data quality will lead to poor performance once applied to the learning algorithms. In this paper, we propose an oversampling method, which creates a superset of the original dataset by creating new instances from existing ones. Then, we apply already existing algorithms to the pre-processed training set in order to validate the effcacy of our method. The effectiveness of the proposed SSG-LDL is verified on several LDL datasets, showing significant improvements to the state-of-the-art LDL methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Synthetic Sample Generation for Label Distribution Learning

Abstract

Talk to us

Similar Papers

More From: Information Sciences

Lead the way for us

Journal: Information Sciences	Publication Date: Aug 4, 2020
Citations: 9

Similar Papers

Decomposition-Fusion for Label Distribution Learning
Manuel González ... Salvador García
Information Fusion | VOL. 66
Manuel González, et. al.Manuel González ... Salvador García
04 Sep 2020
Information Fusion | VOL. 66

ProLSFEO-LDL: Prototype Selection and Label- Specific Feature Evolutionary Optimization for Label Distribution Learning
Manuel González ... Salvador García
Applied Sciences | VOL. 10
Manuel González, et. al.Manuel González ... Salvador García
29 Apr 2020
Applied Sciences | VOL. 10

Logistic Boosting Regression for Label Distribution Learning
Chao Xing ... Xin Geng
-
Chao Xing, et. al.Chao Xing ... Xin Geng
01 Jun 2016
01 Jun 2016

Label Distribution Learning
Xin Geng ... Rongzi Ji
-
Xin Geng, et. al.Xin Geng ... Rongzi Ji
01 Dec 2013
01 Dec 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Synthetic Sample Generation for Label Distribution Learning

Abstract

Talk to us

Similar Papers

More From: Information Sciences