Synthetic Minority Oversampling Technique (SMOTE) for Boosting the Accuracy of C4.5 Algorithm Model

Wiwi Rahayu,Gustientiedina Gustientiedina,Yenny Desnelita,Johan Johan,Alyauma Hajjah,Gusrianty Gusrianty,Deny Jollyta Deny Jollyta,Yulvia Nora Marlim

doi:10.59934/jaiea.v3i3.469

Abstract

The low accuracy of the classification model may be caused by dataset imbalance. In reality, low-accuracy models are unacceptable. The purpose of this research is to address data imbalances in an employee performance dataset identified using the C4.5 method. SMOTE is the approach for addressing data imbalance. SMOTE is utilized to generate a large amount of data in the majority or minority class, which has an initial classification accuracy of just 17%. The C4.5 algorithm classifies the new dataset created by SMOTE, which consists of 11 attributes divided three times between training and testing data. The research found that with a 60:40 data split, the classification model had a 69% accuracy. Model accuracy climbed to 76% at 70:30 data splitting, and 86% at the final splitting, which was 80:20. The model's output matches the evaluation findings obtained using the confusion matrix. The research findings indicate that SMOTE may improve classification model accuracy by boosting data in imbalanced classes.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Synthetic Minority Oversampling Technique (SMOTE) for Boosting the Accuracy of C4.5 Algorithm Model

Abstract

Talk to us

Similar Papers

More From: Journal of Artificial Intelligence and Engineering Applications (JAIEA)

Lead the way for us

Journal: Journal of Artificial Intelligence and Engineering Applications (JAIEA)	Publication Date: Jun 5, 2024
License type: CC BY-NC-SA 4.0

Similar Papers

A three-step combination strategy for addressing outliers and class imbalance in software defect prediction
Muhammad Rizky Pribadi ... Hindriyanto Dwi Purnomo
IAES International Journal of Artificial Intelligence (IJ-AI) | VOL. 13
Muhammad Rizky Pribadi, et. al.Muhammad Rizky Pribadi ... Hindriyanto Dwi Purnomo
01 Sep 2024
IAES International Journal of Artificial Intelligence (IJ-AI) | VOL. 13

Comparative Multinomial Text Classification Analysis of Naïve Bayes and XGBoost with SMOTE on Imbalanced Dataset
Ashish Chaturvedi ... Mohd Abuzar Mohd Haroon Ansari
-
Ashish Chaturvedi, et. al.Ashish Chaturvedi ... Mohd Abuzar Mohd Haroon Ansari
05 Sep 2021
05 Sep 2021

LoRAS: an oversampling approach for imbalanced datasets
Saptarshi Bej ... Olaf Wolkenhauer
Machine Learning | VOL. 110
Saptarshi Bej, et. al.Saptarshi Bej ... Olaf Wolkenhauer
12 Nov 2020
Machine Learning | VOL. 110

A heuristic‐based hybrid sampling method using a combination of SMOTE and ENN for imbalanced health data
Hatice Nizam‐Ozogur ... Zeynep Orman
Expert Systems | VOL. 41
Hatice Nizam‐Ozogur, et. al.Hatice Nizam‐Ozogur ... Zeynep Orman
29 Mar 2024
Expert Systems | VOL. 41

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Synthetic Minority Oversampling Technique (SMOTE) for Boosting the Accuracy of C4.5 Algorithm Model

Abstract

Talk to us

Similar Papers

More From: Journal of Artificial Intelligence and Engineering Applications (JAIEA)