Sampling strategies for handling data imbalance problem: An Extensive Review

Bhaskar Kumar Veedhi,Debahuti Mishra,Kaberi Das

doi:10.47974/jsms-957

Abstract

The imbalanced data classification is a major issue in data mining. Many researchers have proposed various solutions which addressed imbalanced data problem which is broadly categorized into data level and algorithm level. Class distributions are adjusted in data level method. Creating an algorithm or modifying the existing algorithm is an appropriate approach used in algorithm level method. Imbalanced data classification problem can be resolved by means of Sampling, Random over sampling, Random under sampling, Resampling and by SMOTE (Synthetic Minority Oversampling Techniques). Resampling includes k-means clustering, density-based clustering, neural networks and ensemble. However, no algorithm or a method has an ability to remove bias in data classification, thereby integration of kernel methods with sampling methods or integration of sampling and boosting methods or integration Kernel based with Support Vector Machines (SVM) need to be performed a great extent to get the desired accuracy and performance. The main objective of this paper is to focus on various sampling strategies that are based on sampling and resampling methods and improving the concept of learning within class imbalanced data. It also explains the objectives of the models used by several researchers and emphasized the performance along with the outcomes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Sampling strategies for handling data imbalance problem: An Extensive Review

Abstract

Talk to us

Similar Papers

More From: Journal of Statistics & Management Systems

Lead the way for us

Similar Papers

K-Segments Under Bagging approach: An experimental Study on Extremely Imbalanced Data Classification
Tuan Tran ... Loc Tran
-
Tuan Tran, et. al.Tuan Tran ... Loc Tran
01 Sep 2019
01 Sep 2019

An Imbalanced Data Classification Algorithm of De-noising Auto-Encoder Neural Network Based on SMOTE
Chenggang Zhang ... M.J.E Salami
MATEC Web of Conferences | VOL. 56
Chenggang Zhang, et. al.Chenggang Zhang ... M.J.E Salami
01 Jan 2015
MATEC Web of Conferences | VOL. 56

Classification of Imbalanced Data Using SMOTE and AutoEncoder Based Deep Convolutional Neural Network
Suja A Alex ... J Jesu Vedha Nayahi
International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems | VOL. 31
Suja A Alex, et. al.Suja A Alex ... J Jesu Vedha Nayahi
01 Jun 2023
International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems | VOL. 31

Exploiting Domain Knowledge to Address Class Imbalance in Meteorological Data Mining
Evangelos Tsagalidis ... Georgios Evangelidis
Applied Sciences | VOL. 12
Evangelos Tsagalidis, et. al.Evangelos Tsagalidis ... Georgios Evangelidis
04 Dec 2022
Applied Sciences | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sampling strategies for handling data imbalance problem: An Extensive Review

Abstract

Talk to us

Similar Papers

More From: Journal of Statistics &amp; Management Systems

More From: Journal of Statistics & Management Systems