Classification of Imbalanced Data Using SMOTE and AutoEncoder Based Deep Convolutional Neural Network

Suja A. Alex,J. Jesu Vedha Nayahi

doi:10.1142/s0218488523500228

Abstract

The imbalanced data classification is a challenging issue in many domains including medical intelligent diagnosis and fraudulent transaction analysis. The performance of the conventional classifier degrades due to the imbalanced class distribution of the training data set. Recently, machine learning and deep learning techniques are used for imbalanced data classification. Data preprocessing approaches are also suitable for handling class imbalance problem. Data augmentation is one of the preprocessing techniques used to handle skewed class distribution. Synthetic Minority Oversampling Technique (SMOTE) is a promising class balancing approach and it generates noise during the process of creation of synthetic samples. In this paper, AutoEncoder is used as a noise reduction technique and it reduces the noise generated by SMOTE. Further, Deep one-dimensional Convolutional Neural Network is used for classification. The performance of the proposed method is evaluated and compared with existing approaches using different metrics such as Precision, Recall, Accuracy, Area Under the Curve and Geometric Mean. Ten data sets with imbalance ratio ranging from 1.17 to 577.87 and data set size ranging from 303 to 284807 instances are used in the experiments. The different imbalanced data sets used are Heart-Disease, Mammography, Pima Indian diabetes, Adult, Oil-Spill, Phoneme, Creditcard, BankNoteAuthentication, Balance scale weight & distance database and Yeast data sets. The proposed method shows an accuracy of 96.1%, 96.5%, 87.7%, 87.3%, 95%, 92.4%, 98.4%, 86.1%, 94% and 95.9% respectively. The results suggest that this method outperforms other deep learning methods and machine learning methods with respect to G-mean and other performance metrics.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Classification of Imbalanced Data Using SMOTE and AutoEncoder Based Deep Convolutional Neural Network

Abstract

Talk to us

Similar Papers

More From: International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems

Lead the way for us

Journal: International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems	Publication Date: Jun 1, 2023
Citations: 3

Similar Papers

SMOTE-LOF for noise identification in imbalanced data classification
Asniar ... Kridanto Surendro
Journal of King Saud University - Computer and Information Sciences | VOL. 34
Asniar, et. al. Asniar ... Kridanto Surendro
09 Feb 2021
Journal of King Saud University - Computer and Information Sciences | VOL. 34

Stroke Prediction with Machine Learning Methods among Older Chinese.
Yafei Wu ... Ya Fang
International journal of environmental research and public health | VOL. 17
Yafei Wu, et. al.Yafei Wu ... Ya Fang
01 Mar 2020
International journal of environmental research and public health | VOL. 17

Automated semiconductor wafer defect classification dealing with imbalanced data
Po-Hsuan Lee ... John C Robinson
-
Po-Hsuan Lee, et. al.Po-Hsuan Lee ... John C Robinson
20 Mar 2020
20 Mar 2020

Deep Learning for Imbalanced Multimedia Data Classification
Yilin Yan ... Min Chen
-
Yilin Yan, et. al.Yilin Yan ... Min Chen
01 Dec 2015
01 Dec 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Classification of Imbalanced Data Using SMOTE and AutoEncoder Based Deep Convolutional Neural Network

Abstract

Talk to us

Similar Papers

More From: International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems