Efficient Feature Selection via $\ell _{2,0}$ ℓ2,0-norm Constrained Sparse Regression

Tianji Pang,Junwei Han,Xuelong Li,Feiping Nie

doi:10.1109/tkde.2018.2847685

Abstract

Sparse regression based feature selection method has been extensively investigated these years. However, because it has a non-convex constraint, i.e., $\ell _{2,0}$ℓ2,0-norm constraint, this problem is very hard to solve. In this paper, unlike most of the other methods which only solve its slack version by introducing sparsity regularization into objective function forcibly, a novel framework is proposed by us to solve the original $\ell _{2,0}$ℓ2,0-norm constrained sparse regression based feature selection problem. We transform our objective function into Linear Discriminant Analysis (LDA) by using a new label coding method, thus enabling our model to calculate the ratio of inter-class scatter to intra-class scatter of features which is the most widely used feature discrimination evaluation metric. According to that ratio, features can be selected by a simple sorting method. The projection gradient descent algorithm is introduced to further improve the performance of our algorithm by using the solution obtained before as its initial solution. This ensures the stability of this iterative algorithm. We prove that the proposed method can get the global optimal solution of this non-convex problem when all features are statistically independent. For the general case where features are statistically dependent, extensive experiments on six small sample size datasets and one large-scale dataset show that our algorithm has comparable or better classification capability comparing with other eight state-of-the-art feature selection methods by the SVM classifier. We also show that our algorithm can obtain a low loss value, which means the solution of our algorithm can get very close to this NP-hard problem’s real solution. What is more, because we solve the original $\ell _{2,0}$ℓ2,0-norm constrained problem, we avoid the heavy work of tuning the regularization parameter because its meaning is explicit in our method, i.e., the number of selected features. At last, we evaluate the stability of our algorithm from two perspectives, i.e., the objective function values and the selected features, by experiments. From both perspectives, our algorithm shows satisfactory stability performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient Feature Selection via $\ell _{2,0}$ ℓ2,0-norm Constrained Sparse Regression

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: May 1, 2019
Citations: 76

Similar Papers

Feature Selection for Hybrid Data Sets and Feature Extraction for Non-Hybrid Data Sets

-

22 Apr 2021
22 Apr 2021

Impact of Feature Selection Methods on the Predictive Performance of Software Defect Prediction Models: An Extensive Empirical Study
Abdullateef O Balogun ... Malek A Almomani
Symmetry | VOL. 12
Abdullateef O Balogun, et. al.Abdullateef O Balogun ... Malek A Almomani
09 Jul 2020
Symmetry | VOL. 12

MCDM-EFS: A novel ensemble feature selection method for software defect prediction using multi-criteria decision making
Kamaldeep Kaur ... Ajay Kumar
Intelligent Decision Technologies | VOL. 17
Kamaldeep Kaur, et. al.Kamaldeep Kaur ... Ajay Kumar
20 Nov 2023
Intelligent Decision Technologies | VOL. 17

A Stable Instance Based Filter for Feature Selection in Small Sample Size Data Sets
Afef Ben Brahim ... Mohamed Limam
-
Afef Ben Brahim, et. al.Afef Ben Brahim ... Mohamed Limam
01 Jan 2014
01 Jan 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Feature Selection via $\ell _{2,0}$ ℓ2,0-norm Constrained Sparse Regression

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering