Extended natural neighborhood for SMOTE and its variants in imbalanced classification

Hongjiao Guan,Long Zhao,Xiangjun Dong,Chuan Chen

doi:10.1016/j.engappai.2023.106570

Abstract

Imbalanced data classification is a challenging issue encountered in many practical applications. Synthetic minority oversampling technique (SMOTE) and its variants are popular resampling methods. However, in most of these methods, the neighborhood determined by k-nearest neighbor (kNN) cannot reflect the local distribution precisely, leading to the generation of noisy examples. To solve this problem, we propose a neighborhood concept without parameter k called extended natural neighbor (ENaN), which is derived from natural neighbor (NaN). ENaN unites kNN and reverse kNN to determine neighbors adaptively according to the sample distribution. Compared to NaN, ENaN explores broad neighborhoods, which facilitates to improve the quality of generated examples. ENaN-based SMOTE (ENaNSMOTE) can improve the sample distribution obtained by SMOTE and NaNSMOTE. Extensive experiments using 30 synthetic and 20 real-world datasets prove the effectiveness of ENaN in SMOTE and its variants.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Extended natural neighborhood for SMOTE and its variants in imbalanced classification

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence

Lead the way for us

Journal: Engineering Applications of Artificial Intelligence	Publication Date: Jun 12, 2023
Citations: 9

Similar Papers

Analysis of Student Graduation Prediction Using Machine Learning Techniques on an Imbalanced Dataset: An Approach to Address Class Imbalance
Dedy Hermanto ... Muhammad Rizky Pribadi
Scientific Journal of Informatics | VOL. 11
Dedy Hermanto, et. al.Dedy Hermanto ... Muhammad Rizky Pribadi
05 Aug 2024
Scientific Journal of Informatics | VOL. 11

Instance weighted SMOTE by indirectly exploring the data distribution
Aimin Zhang ... Xibei Yang
Knowledge-Based Systems | VOL. 249
Aimin Zhang, et. al.Aimin Zhang ... Xibei Yang
04 May 2022
Knowledge-Based Systems | VOL. 249

The Improvement of Stress Level Detection in Twitter: Imbalance Classification Using SMOTE
Mohd Shahrul Nizam Mohd Danuri ... Rohizah Abd Rahman
-
Mohd Shahrul Nizam Mohd Danuri, et. al.Mohd Shahrul Nizam Mohd Danuri ... Rohizah Abd Rahman
14 Nov 2022
14 Nov 2022

K-NEAREST NEIGHBOR DENGAN ADAPTIVE BOOSTING DAN SYNTHETIC MINORITY OVERSAMPLING TECHNIQUE UNTUK KLASIFIKASI DATA TIDAK SEIMBANG
Ria Sulistyo Yuliani ... Rukun Santoso
Jurnal Gaussian | VOL. 12
Ria Sulistyo Yuliani, et. al.Ria Sulistyo Yuliani ... Rukun Santoso
28 Jul 2023
Jurnal Gaussian | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Extended natural neighborhood for SMOTE and its variants in imbalanced classification

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence