Single-Point Crossover and Jellyfish Optimization for Handling Imbalanced Data Classification Problem

Abeer S Desuky,Asmaa Hekal Omar,Yomna M Elbarawy,Sadiq Hussain,Samina Kausar

doi:10.1109/access.2022.3146424

Abeer S Desuky, Asmaa Hekal Omar + Show 3 more

Open Access

https://doi.org/10.1109/access.2022.3146424

Copy DOI

Abstract

The imbalanced datasets and their classification has pulled in as a hot research topic over the years. It is used in different fields, for example, security, finance, health, and many others. The imbalanced datasets are balanced by applying resampling and various solutions are designed to tackle such datasets that mainly focus on class distribution issues. The imbalanced data is rebalanced using these methods. This paper introduces a technique for balancing data through two stages: first, oversampling methods are utilized in the process of rebalancing such imbalanced dataset using the single-point crossover to generate the new data of minority classes, second, it searches for an optimal subset of the imbalanced and balanced datasets by Jellyfish Search (JS) which is an optimization method. Experiments are performed on 18 real imbalanced datasets, and results are compared with famous oversampling methods and the recently published ACOR (Ant Colony Optimization Resampling) method in terms of different appraisal measurements. Higher performance is recorded by the proposed method and comparability with well-known and recent techniques.

Highlights

M ACHINE learning (ML) techniques play a vital role in gaining insights from the data in different repositories that are growing exponentially
Three approaches are suggested for such techniques (1) Data level methods focus on sampling the instances of majority and minority classes for balancing the distribution; (2) Algorithm-level techniques focus on adapting current learners to mitigate their prejudice against the class of majority; (3) Hybrid approaches which consist of the advantages of the two above-mentioned types
THE PROPOSED METHOD The proposed methods based mainly on single-point crossover and jellyfish search to overcome class imbalance classification by resampling the training data. They primarily consist of two stages: first, it rebalances an imbalanced dataset by oversampling algorithm using the single-point crossover to generate the new data of minority classes, second, it finds an optimal subset of the balanced and imbalanced dataset by jellyfish search

Summary

INTRODUCTION

M ACHINE learning (ML) techniques play a vital role in gaining insights from the data in different repositories that are growing exponentially. We introduce in this paper two proposed methods to rebalance an imbalanced dataset by oversampling algorithm using the single-point crossover to generate a new data of minority class and using JS to select the optimal instances from the training set before and after oversampling comparing the results to select the best method to handle the class imbalance problem. THE PROPOSED METHOD The proposed methods based mainly on single-point crossover and jellyfish search to overcome class imbalance classification by resampling the training data They primarily consist of two stages: first, it rebalances an imbalanced dataset by oversampling algorithm using the single-point crossover to generate the new data of minority classes, second, it finds an optimal subset of the balanced and imbalanced dataset by jellyfish search.

EXPRIMENTAL RESULTS

ANALYSIS AND DISCUSSION

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2022
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Single-Point Crossover and Jellyfish Optimization for Handling Imbalanced Data Classification Problem

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

A comparative study on noise filtering of imbalanced data sets
Szilvia Szeghalmy ... Attila Fazekas
Knowledge-Based Systems | VOL. 301
Szilvia Szeghalmy, et. al.Szilvia Szeghalmy ... Attila Fazekas
01 Jul 2024
Knowledge-Based Systems | VOL. 301

Soil textural class modeling using digital soil mapping approaches: Effect of resampling strategies on imbalanced dataset predictions
Fereshteh Mirzaei ... Ruth Kerry
Geoderma Regional | VOL. 38
Fereshteh Mirzaei, et. al.Fereshteh Mirzaei ... Ruth Kerry
15 Jun 2024
Geoderma Regional | VOL. 38

Performance Evaluation of Sentiment Analysis on Balanced and Imbalanced Dataset Using Ensemble Approach
Shini George ... V Srividhya
Indian Journal of Science and Technology | VOL. 15
Shini George, et. al.Shini George ... V Srividhya
05 May 2022
Indian Journal of Science and Technology | VOL. 15

Imbalance Learning and Its Application on Medical Datasets
Yachao Shao
-
Yachao ShaoYachao Shao
21 Feb 2022
21 Feb 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Single-Point Crossover and Jellyfish Optimization for Handling Imbalanced Data Classification Problem

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access