The Application of Repeated SMOTE for Multi Class Classification on Imbalanced Data

Muhammad Ibnu Choldun Rachmatullah

doi:10.30812/matrik.v22i1.1803

Abstract

One of the problems that are often faced by classifier algorithms is related to the problem of imbalanced data. One of the recommended improvement methods at the data level is to balance the number of data in different classes by enlarging the sample to the minority class (oversampling), one of which is called The Synthetic Minority Oversampling Technique (SMOTE). SMOTE is commonly used to balance data consisting of two classes. In this research, SMOTE was used to balance multi-class data. The purpose of this research is to balance multi-class data by applying SMOTE repeatedly. This iterative process needs to be applied if the number of unbalanced data classes is more than two classes, because the one-time SMOTE process is only suitable for binary classification or the number of unbalanced data classes is only one class. To see the performance of iterative SMOTE, the SMOTE datasets were classified using a neural network, k-NN, Nave Bayes, and Random Forest and the performance measures were measured in terms of accuracy, sensitivity, and specificity. The experiment in this research used the Glass Identification dataset which had six classes, and the SMOTE process was repeated five times. The best performance was achieved by the Random Forest classifier method with accuracy = 86.27%, sensitivity = 86.18%, and specificity = 95.82%. The result of experiment present that repeated SMOTE results can increase the performance of classification.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer	Publication Date: Nov 30, 2022
Citations: 1	License type: CC BY-SA 4.0

R Discovery Prime

R Discovery Prime

The Application of Repeated SMOTE for Multi Class Classification on Imbalanced Data

Abstract

Talk to us

Similar Papers

More From: MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer

Lead the way for us

Similar Papers

BES-Optimized SMOTE Variant to Improve Dataset Scaling for Enhanced Privacy-Preserving Classification
Vijayendra S.Gaikwad
Panamerican Mathematical Journal | VOL. 34
Vijayendra S.Gaikwad Vijayendra S.Gaikwad
07 Oct 2024
Panamerican Mathematical Journal | VOL. 34

SMOTE-LOF for noise identification in imbalanced data classification
Asniar ... Kridanto Surendro
Journal of King Saud University - Computer and Information Sciences | VOL. 34
Asniar, et. al. Asniar ... Kridanto Surendro
09 Feb 2021
Journal of King Saud University - Computer and Information Sciences | VOL. 34

SMOTE for Learning from Imbalanced Data: Progress and Challenges, Marking the 15-year Anniversary
Alberto Fernandez ... Salvador Garcia
Journal of Artificial Intelligence Research | VOL. 61
Alberto Fernandez, et. al.Alberto Fernandez ... Salvador Garcia
20 Apr 2018
Journal of Artificial Intelligence Research | VOL. 61

Prediction of Chronic Kidney Disease - A Machine Learning Perspective
Pankaj Chittora ... Prasun Chakrabarti
IEEE Access | VOL. 9
Pankaj Chittora, et. al.Pankaj Chittora ... Prasun Chakrabarti
01 Jan 2020
IEEE Access | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Application of Repeated SMOTE for Multi Class Classification on Imbalanced Data

Abstract

Talk to us

Similar Papers

More From: MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer