Data Pre-Processing Using SMOTE Technique for Gender Classification with Imbalance Hu’s Moments Features

Ahmad Haadzal Kamarulzalis,Balkiah Moktar,Muhamad Hasbullah Mohd Razali

doi:10.1007/978-981-10-8471-3_37

Abstract

Imbalance data is common in real-world applications like text categorization, face recognition for gender classification, medical diagnosis, fraud detection, oil-spills detection of satellite images. Most of the algorithms in machine learning are focusing on classification of majority class while ignoring or misclassifying minority sample. The minority samples are those that rarely occur but very important. It is commonly agreed that standard classifiers such as neural networks, support vector machines, and C4.5 are heavily biased in recognizing mostly the majority class since they are built to achieve overall accuracy to which the minority class contributes very little. In this study, we demonstrate how the synthetic minority over-sampling technique (SMOTE) can significantly improve the imbalance problem in gender classification from the data-level perspective. Hu’s moment of the face images was generated as the numerical descriptors with different imbalance ratio and classified using a supervised decision tree (J48) algorithm. The results show that prior to preprocessing the data with SMOTE, the minority group was severely misclassified as the majority group. Our claims are confirmed through the application of SMOTE in reducing the imbalance effects before inducing the decision tree.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Data Pre-Processing Using SMOTE Technique for Gender Classification with Imbalance Hu’s Moments Features

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

SMOTE-LOF for noise identification in imbalanced data classification
Asniar ... Kridanto Surendro
Journal of King Saud University - Computer and Information Sciences | VOL. 34
Asniar, et. al. Asniar ... Kridanto Surendro
09 Feb 2021
Journal of King Saud University - Computer and Information Sciences | VOL. 34

BES-Optimized SMOTE Variant to Improve Dataset Scaling for Enhanced Privacy-Preserving Classification
Vijayendra S.Gaikwad
Panamerican Mathematical Journal | VOL. 34
Vijayendra S.Gaikwad Vijayendra S.Gaikwad
07 Oct 2024
Panamerican Mathematical Journal | VOL. 34

Modeling of Cu-Au prospectivity in the Carajás mineral province (Brazil) through machine learning: Dealing with imbalanced training data
Elias Martins Guerra Prado ... João Gabriel Motta
Ore Geology Reviews | VOL. 124
Elias Martins Guerra Prado, et. al.Elias Martins Guerra Prado ... João Gabriel Motta
05 Jun 2020
Ore Geology Reviews | VOL. 124

Evaluation of Oversampling Methods (OVER, SMOTE, and ROSE) in Classifying Soil Liquefaction Dataset based on SVM, RF, and Naïve Bayes
Selçuk Demi̇r ... Emrehan Kutluğ Şahi̇n
European Journal of Science and Technology | VOL. -
Selçuk Demi̇r, et. al.Selçuk Demi̇r ... Emrehan Kutluğ Şahi̇n
23 Feb 2022
European Journal of Science and Technology | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data Pre-Processing Using SMOTE Technique for Gender Classification with Imbalance Hu’s Moments Features

Abstract

Talk to us

Similar Papers