Multi-class Imbalanced Learning Research Articles

Multi-label learning has garnered much research interest due to its wide range of real-world applications. Many multi-label learning methods have been proposed; however, few have addressed the class imbalance problem existing in multi-label data. Even though some studies have taken this issue into account, most of them have ignored the label correlations or only considered random correlations between them. In this study, we propose a novel partition-based imbalanced multi-label learning algorithm, named Multi-label Learning based on Hierarchical Clustering (MLHC), to tackle this problem. MLHC first carries out hierarchical clustering on the original label space to divide it into several disconnected subspaces, each of which contains several labels that are strongly correlated with each other. Then, for each label subspace, we use the problem transformation strategy to convert it into a multi-class problem by binary coding. Any multi-class imbalance learning algorithm can be applied to the transformed multi-class data. Finally, the classification results will be decoded to retrieve the corresponding label subspace, and all label subspace results are combined to show the predicted label vector in the original label space. We conducted experiments not only on thirteen benchmark multi-label datasets but also carried out them on XJTU-SY which is a multi-label engineering application dataset, and the results indicated that our proposed MLHC learning algorithm outperforms several state-of-the-art class imbalance multi-label learning algorithms, demonstrating the effectiveness and necessity of discovering label correlations and transforming the original imbalanced multi-label learning problem into multiple strongly correlated multi-class imbalanced learning problems.

Read full abstract

Imbalanced distribution of instances across the classes is a challenging issue when the underlying problem is of type classification. The reason is that classifiers will tend to favor the classes with a large number of instances i.e. instances of minority classes may be identified as instances of majority classes by the classifiers. In recent years, plenty of researches have been done to resolve the class imbalance issue in binary classification problems which resulted in many class imbalance learning techniques for binary classification problems. But, the class imbalance in multi-class classification problems did not draw much attention from the research community. Unlike binary class imbalance learning, multi-class imbalance learning techniques experience more than one majority class and more than one minority class. This paper tries to come up with a multi-class imbalanced learning technique that can overcome the effects of multi-class imbalance problem in review rating prediction tasks. The proposed model handles the multi-class imbalance issue by using the combination of hybrid sampling and ensemble learning techniques. Sampling techniques such as Random Under Sampling (RUS) and Synthetic Minority Over-sampling TEchnique(SMOTE) are jointly used in the proposed model to create balanced training sets for base learners. Also, the proposed model creates a powerful ensemble structure by amalgamating a manually created bagging ensemble and AdaBoost boosting ensembles. Experiments are done using the Amazon product dataset in order to investigate the performance of the proposed model. The experimental results show that the proposed Class Imbalance-Aware Review rating prediction(CIAR) model outperforms almost all the baseline models in-terms of G-mean, F-Score, and ROC_AUC_Score.

Read full abstract

Multi-class Imbalanced Learning Research Articles

Related Topics

Articles published on Multi-class Imbalanced Learning

Clustering-Based Oversampling Algorithm for Multi-class Imbalance Learning

To Combat Multiclass Imbalanced Problems by Aggregating Evolutionary Hierarchical Classifiers.

A partition-based problem transformation algorithm for classifying imbalanced multi-label data

Novel hybrid classification model for multi-class imbalanced lithology dataset

Double-kernel based class-specific broad learning system for multiclass imbalance learning

One-against-all-based Hellinger distance decision tree for multiclass imbalanced learning

SA-CGAN: An oversampling method based on single attribute guided conditional GAN for multi-class imbalanced learning

A hybrid multi-class imbalanced learning method for predicting the quality level of diesel engines

Classifier Selection and Ensemble Model for Multi-class Imbalance Learning in Education Grants Prediction

SMOTE-Based Weighted Deep Rotation Forest for the Imbalanced Hyperspectral Data Classification

A class imbalance-aware review rating prediction using hybrid sampling and ensemble learning

Boosting methods for multi-class imbalanced data classification: an experimental review

DyS-IENN: a novel multiclass imbalanced learning method for early warning of tardiness in rocket final assembly process

Data-driven rated power prediction of diesel engines using improved multi-class imbalanced learning method

Multiclass imbalanced learning with one-versus-one decomposition and spectral clustering

Imbalanced Hyperspectral Image Classification With an Adaptive Ensemble Method Based on SMOTE and Rotation Forest With Differentiated Sampling Rates

Dynamic Ensemble Selection and Data Preprocessing for Multi-Class Imbalance Learning

Dynamic Synthetic Minority Over-Sampling Technique-Based Rotation Forest for the Classification of Imbalanced Hyperspectral Data

Evolutionary inversion of class distribution in overlapping areas for multi-class imbalanced learning

Generalized class-specific kernelized extreme learning machine for multiclass imbalanced learning

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multi-class Imbalanced Learning Research Articles

Related Topics

Articles published on Multi-class Imbalanced Learning

Clustering-Based Oversampling Algorithm for Multi-class Imbalance Learning

To Combat Multiclass Imbalanced Problems by Aggregating Evolutionary Hierarchical Classifiers.

A partition-based problem transformation algorithm for classifying imbalanced multi-label data

Novel hybrid classification model for multi-class imbalanced lithology dataset

Double-kernel based class-specific broad learning system for multiclass imbalance learning

One-against-all-based Hellinger distance decision tree for multiclass imbalanced learning

SA-CGAN: An oversampling method based on single attribute guided conditional GAN for multi-class imbalanced learning

A hybrid multi-class imbalanced learning method for predicting the quality level of diesel engines

Classifier Selection and Ensemble Model for Multi-class Imbalance Learning in Education Grants Prediction

SMOTE-Based Weighted Deep Rotation Forest for the Imbalanced Hyperspectral Data Classification

A class imbalance-aware review rating prediction using hybrid sampling and ensemble learning

Boosting methods for multi-class imbalanced data classification: an experimental review

DyS-IENN: a novel multiclass imbalanced learning method for early warning of tardiness in rocket final assembly process

Data-driven rated power prediction of diesel engines using improved multi-class imbalanced learning method

Multiclass imbalanced learning with one-versus-one decomposition and spectral clustering

Imbalanced Hyperspectral Image Classification With an Adaptive Ensemble Method Based on SMOTE and Rotation Forest With Differentiated Sampling Rates

Dynamic Ensemble Selection and Data Preprocessing for Multi-Class Imbalance Learning

Dynamic Synthetic Minority Over-Sampling Technique-Based Rotation Forest for the Classification of Imbalanced Hyperspectral Data

Evolutionary inversion of class distribution in overlapping areas for multi-class imbalanced learning

Generalized class-specific kernelized extreme learning machine for multiclass imbalanced learning