A Distance-Based Weighted Undersampling Scheme for Support Vector Machines and its Application to Imbalanced Classification.

Qi Kang,Lei Shi,Mengchu Zhou,Xuesong Wang,Qidi Wu,Zhi Wei

doi:10.1109/tnnls.2017.2755595

Abstract

A support vector machine (SVM) plays a prominent role in classic machine learning, especially classification and regression. Through its structural risk minimization, it has enjoyed a good reputation in effectively reducing overfitting, avoiding dimensional disaster, and not falling into local minima. Nevertheless, existing SVMs do not perform well when facing class imbalance and large-scale samples. Undersampling is a plausible alternative to solve imbalanced problems in some way, but suffers from soaring computational complexity and reduced accuracy because of its enormous iterations and random sampling process. To improve their classification performance in dealing with data imbalance problems, this work proposes a weighted undersampling (WU) scheme for SVM based on space geometry distance, and thus produces an improved algorithm named WU-SVM. In WU-SVM, majority samples are grouped into some subregions (SRs) and assigned different weights according to their Euclidean distance to the hyper plane. The samples in an SR with higher weight have more chance to be sampled and put to use in each learning iteration, so as to retain the data distribution information of original data sets as much as possible. Comprehensive experiments are performed to test WU-SVM via 21 binary-class and six multiclass publically available data sets. The results show that it well outperforms the state-of-the-art methods in terms of three popular metrics for imbalanced classification, i.e., area under the curve, F-Measure, and G-Mean.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Distance-Based Weighted Undersampling Scheme for Support Vector Machines and its Application to Imbalanced Classification.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems

Lead the way for us

Journal: IEEE Transactions on Neural Networks and Learning Systems	Publication Date: Oct 25, 2017
Citations: 171

Similar Papers

Ensemble Learning with Support Vector Machines for Bond Rating

-

01 Jan 2012
01 Jan 2012

An Adaptive Pre-clustering Support Vector Machine for Binary Imbalanced Classification
Zonglin Di ... Qi Kang
-
Zonglin Di, et. al.Zonglin Di ... Qi Kang
01 Oct 2018
01 Oct 2018

Fuzzy support vector machine for microarray imbalanced data classification
Faroh Ladayya ... Irhamah
-
Faroh Ladayya, et. al.Faroh Ladayya ... Irhamah
01 Jan 2017
01 Jan 2017

Signal discrimination using a support vector machine for genetic syndrome diagnosis
...
-
, et. al. ...
23 Aug 2004
23 Aug 2004

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Distance-Based Weighted Undersampling Scheme for Support Vector Machines and its Application to Imbalanced Classification.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems