ℓ2,1 norm regularized multi-kernel based joint nonlinear feature selection and over-sampling for imbalanced data classification

Peng Cao,Xiaoli Liu,Jian Zhang,Dazhe Zhao,Min Huang,Osmar Zaiane

doi:10.1016/j.neucom.2016.12.036

Abstract

High dimensionality and classification of imbalanced data sets are two of the most interesting machine learning challenges. Both issues have been independently studied in the literature. In order to simultaneously explore the both issues of feature selection and oversampling, we efficiently combine two different methodological approaches in an unified kernel framework. Specifically, we proposed a novel ℓ2,1 norm balanced multiple kernel feature selection (ℓ2,1 MKFS), and designed a proximal based optimization algorithm for efficiently learning the model. Moreover, multiple kernel oversampling (MKOS) was developed to generate synthetic instances in the optimal kernel space induced by ℓ2,1 MKFS, so as to compensate for the class imbalanced distribution. Our experimental results on multiple UCI data and two real medical application demonstrate that jointly operating nonlinear feature selection and oversampling with ℓ2,1 norm multi-kernel learning framework (ℓ2,1 MKFSOS) can lead to a promising classification performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Neurocomputing	Publication Date: Dec 18, 2016
Citations: 32	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

ℓ2,1 norm regularized multi-kernel based joint nonlinear feature selection and over-sampling for imbalanced data classification

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Similar Papers

Feature selection for high-dimensional class-imbalanced data sets using Support Vector Machines
Sebastián Maldonado ... Fazel Famili
Information Sciences | VOL. 286
Sebastián Maldonado, et. al.Sebastián Maldonado ... Fazel Famili
30 Jul 2014
Information Sciences | VOL. 286

The problem of classification in imbalanced data sets in knowledge discovery
Haifeng Sui ... Wu Qu
-
Haifeng Sui, et. al. Haifeng Sui ... Wu Qu
01 Oct 2010
01 Oct 2010

Classification of imbalanced data sets using Multi Objective Genetic Programming
Hardik H Maheta ... Vipul K Dabhi
-
Hardik H Maheta, et. al.Hardik H Maheta ... Vipul K Dabhi
01 Jan 2015
01 Jan 2015

Unbalanced Data Set Classification Based on Convolutional Neural Network
Hui Xiong
-
Hui XiongHui Xiong
01 Sep 2021
01 Sep 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ℓ2,1 norm regularized multi-kernel based joint nonlinear feature selection and over-sampling for imbalanced data classification

Abstract

Talk to us

Similar Papers

More From: Neurocomputing