A transparent and nonlinear method for variable selection

Keyao Wang,Huiwen Wang,Jichang Zhao,Lihong Wang

doi:10.1016/j.eswa.2023.121398

Abstract

Variable selection is a procedure to obtain truly important predictors from inputs. Complex nonlinear dependencies and strong coupling pose great challenges for variable selection in high-dimensional data. Real-world applications have increased the demand for interpretable selection processes. A pragmatic approach should not only yield the most predictive covariates but also provide ample and easy-to-understand reasons for removing certain covariates. In view of these requirements, this paper proposes an approach for transparent and nonlinear variable selection. To transparently decouple information within the input predictors, a three-step heuristic search is designed, by which the input predictors are grouped into four subsets: relevant predictors, which are selected, and uninformative, redundant, and conditionally independent predictors, which are removed. A nonlinear partial correlation coefficient is introduced to better identify the predictors that have nonlinear functional dependence with the response. The selected subset is competent input for commonly used predictive models. Superiority of the proposed method is demonstrated against state-of-the-art baselines in terms of predictive accuracy and model interpretability.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A transparent and nonlinear method for variable selection

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications

Lead the way for us

Journal: Expert Systems With Applications	Publication Date: Sep 4, 2023
Citations: 1

Similar Papers

Bayesian variable selection for high-dimensional data with an ordinal response: identifying genes associated with prognostic risk group in acute myeloid leukemia
Yiran Zhang ... Kellie J Archer
BMC Bioinformatics | VOL. 22
Yiran Zhang, et. al.Yiran Zhang ... Kellie J Archer
02 Nov 2021
BMC Bioinformatics | VOL. 22

PALLADIO: a parallel framework for robust variable selection in high-dimensional data
...
-
, et. al. ...
13 Nov 2016
13 Nov 2016

Erratum to: Ultrahigh dimensional variable selection through the penalized maximum trimmed likelihood estimator
N M Neykov ... P Filzmoser
Statistical Papers | VOL. 55
N M Neykov, et. al.N M Neykov ... P Filzmoser
25 May 2013
Statistical Papers | VOL. 55

Nested coordinate descent algorithms for empirical likelihood
Cheng Yong Tang ... Tong Tong Wu
Journal of Statistical Computation and Simulation | VOL. 84
Cheng Yong Tang, et. al.Cheng Yong Tang ... Tong Tong Wu
18 Feb 2013
Journal of Statistical Computation and Simulation | VOL. 84

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A transparent and nonlinear method for variable selection

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications