Special issue on ACML 2015

Chi Sing Leung

doi:10.1016/j.neucom.2017.02.076

Abstract

Extracting a small number of relevant features for the task, i.e., feature selection, is often a crucial step in supervised learning problems. Sparse linear regression provides a fast and convenient option for feature selection, where regularization facilitates reducing the weight parameters of irrelevant features. However, the regularization also induces undesirable shrinkage in the weights of relevant features.Here, we propose Bayesian masking (BM) in order to resolve the trade-off problem between sparsity and shrinkage. Our strategy is not to directly impose any regularization on the weights; instead, BM introduces binary latent variables, called masking variables, into a regression model to keep the sparsity; each feature and sample has a binary variable whose value determines if the feature is masked or not at the sample. We derive a variational Bayesian inference algorithm for the augmented model based on the factorized information criterion (FIC), a recently-proposed asymptotic approximation of the marginal log-likelihood. We analyze the one-dimensional estimators of Lasso, automatic relevance determination (ARD), and BM, and thus show the superiority of BM in terms of the sparsity-shrinkage trade-off. Finally, we confirm our theoretical analyses through experiments and, demonstrate that BM achieves higher feature selection accuracy compared with Lasso and ARD.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Special issue on ACML 2015

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Similar Papers

Sparse Bayesian linear regression with latent masking variables
Yohei Kondo ... Shin-Ichi Maeda
Neurocomputing | VOL. 258
Yohei Kondo, et. al.Yohei Kondo ... Shin-Ichi Maeda
07 Mar 2017
Neurocomputing | VOL. 258

Variational Bayesian Orthogonal Nonnegative Matrix Factorization Over the Stiefel Manifold.
Abderrahmane Rahiche ... Mohamed Cheriet
IEEE transactions on image processing : a publication of the IEEE Signal Processing Society | VOL. 31
Abderrahmane Rahiche, et. al.Abderrahmane Rahiche ... Mohamed Cheriet
01 Jan 2021
IEEE transactions on image processing : a publication of the IEEE Signal Processing Society | VOL. 31

Bayesian approach to feature selection and parameter tuning for support vector machine classifiers
Carl Gold ... Peter Sollich
Neural Networks | VOL. 18
Carl Gold, et. al.Carl Gold ... Peter Sollich
01 Jul 2005
Neural Networks | VOL. 18

Sparse Bayesian inference methods for decoding 3D reach and grasp kinematics and joint angles with primary motor cortical ensembles
Zhe Chen ... Kazutaka Takahashi
-
Zhe Chen, et. al. Zhe Chen ... Kazutaka Takahashi
01 Jul 2013
01 Jul 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Special issue on ACML 2015

Abstract

Talk to us

Similar Papers

More From: Neurocomputing