FCM-CSMOTE: Fuzzy C-Means Center-SMOTE

Roudani Mohammed,El Moutaouakil Karim

doi:10.1016/j.eswa.2024.123406

Abstract

Imbalanced class distributions in machine learning, where the minority class is often under-represented, pose a substantial challenge. Synthetic Minority Over-sampling Technique (SMOTE) has been widely employed to address this issue by generating synthetic minority samples through interpolation. Despite its popularity, SMOTE exhibits certain drawbacks caused by the implementation of random interpolation samples. In this paper, we introduce a new data level technique for oversampling, called Fuzzy C-Means Center-SMOTE (FCM-CSMOTE), which generates synthetic samples in each cluster using its center considered as the memory of the main data components. We demonstrate that the proposed selective strategy has a very low probability to generate noise. The experimental results demonstrate that the proposed method performs better than the state-of-the-art approaches on 21 real unbalanced data sets (regular and large size data set) in terms of several metrics, including Geometric Mean (GM), F-Measure (FM), Area Under the Curve (AUC), and Accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

FCM-CSMOTE: Fuzzy C-Means Center-SMOTE

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications

Lead the way for us

Journal: Expert Systems with Applications	Publication Date: Feb 14, 2024
Citations: 3

Similar Papers

Churn prediction in telecommunication using machine learning
Kriti Mishra ... Rinkle Rani
-
Kriti Mishra, et. al.Kriti Mishra ... Rinkle Rani
01 Aug 2017
01 Aug 2017

Enhancing SMOTE for imbalanced data with abnormal minority instances
Surani Matharaarachchi ... Saman Muthukumarana
Machine Learning with Applications | VOL. 18
Surani Matharaarachchi, et. al.Surani Matharaarachchi ... Saman Muthukumarana
29 Oct 2024
Machine Learning with Applications | VOL. 18

Impact of Nature of Medical Data on Machine and Deep Learning for Imbalanced Datasets: Clinical Validity of SMOTE Is Questionable
Seifollah Gholampour
Machine Learning and Knowledge Extraction | VOL. 6
Seifollah GholampourSeifollah Gholampour
15 Apr 2024
Machine Learning and Knowledge Extraction | VOL. 6

LoRAS: an oversampling approach for imbalanced datasets
Saptarshi Bej ... Markus Wolfien
Machine Learning | VOL. 110
Saptarshi Bej, et. al.Saptarshi Bej ... Markus Wolfien
12 Nov 2020
Machine Learning | VOL. 110

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FCM-CSMOTE: Fuzzy C-Means Center-SMOTE

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications