High-dimensional variable selection via low-dimensional adaptive learning

Christian Staerk,Maria Kateri,Ioannis Ntzoufras

doi:10.1214/21-ejs1797

Christian Staerk, Maria Kateri + Show 1 more

Open Access

https://doi.org/10.1214/21-ejs1797

Copy DOI

Journal: Electronic Journal of Statistics	Publication Date: Jan 1, 2021
Citations: 10	License type: cc-by

Abstract

A stochastic search method, the so-called Adaptive Subspace (AdaSub) method, is proposed for variable selection in high-dimensional linear regression models. The method aims at finding the best model with respect to a certain model selection criterion and is based on the idea of adaptively solving low-dimensional sub-problems in order to provide a solution to the original high-dimensional problem. Any of the usual $\ell_0$-type model selection criteria can be used, such as Akaike's Information Criterion (AIC), the Bayesian Information Criterion (BIC) or the Extended BIC (EBIC), with the last being particularly suitable for high-dimensional cases. The limiting properties of the new algorithm are analysed and it is shown that, under certain conditions, AdaSub converges to the best model according to the considered criterion. In a simulation study, the performance of AdaSub is investigated in comparison to alternative methods. The effectiveness of the proposed method is illustrated via various simulated datasets and a high-dimensional real data example.

Highlights

Rapid developments during the last decades in fields such as information technology or genetics have led to an increased collection of huge amounts of data
If the ordered importance property (OIP) is satisfied, Adaptive Subspace (AdaSub) converges against the optimal solution of the generally NP-hard 0-regularized optimization problem
AdaSub provides a stable thresholded model even when OIP is not guaranteed to hold. It has been demonstrated through simulated and real data examples that the performance of AdaSub is very competitive for high-dimensional variable selection in comparison to state-of-the-art methods like the Adaptive Lasso, the SCAD, Tilting or the Bayesian split-and-merge approach (SAM)

Summary

Introduction

Rapid developments during the last decades in fields such as information technology or genetics have led to an increased collection of huge amounts of data. Chen and Chen (2008) argue that this model prior underlying BIC is not suitable for a high-dimensional framework where the truth is assumed to be sparse. They propose a modified version of the BIC, called the Extended Bayesian Information Criterion (EBIC), with an adjusted underlying prior on the model space: For a fixed additional parameter γ ∈ [0, 1] and a subset S ⊆ P let the prior of the corresponding model be π(S) ∝ p |S|.

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

High-dimensional variable selection via low-dimensional adaptive learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronic Journal of Statistics

Lead the way for us

Similar Papers

New Improved Criterion for Model Selection in Sparse High-Dimensional Linear Regression Models
Prakash B Gohain ... Magnus Jansson
-
Prakash B Gohain, et. al.Prakash B Gohain ... Magnus Jansson
23 May 2022
23 May 2022

The cross-validated AUC for MCP-logistic regression with high-dimensional data
Dingfeng Jiang ... Jian Huang
Statistical Methods in Medical Research | VOL. 22
Dingfeng Jiang, et. al.Dingfeng Jiang ... Jian Huang
28 Nov 2011
Statistical Methods in Medical Research | VOL. 22

Performance of criteria for selecting evolutionary models in phylogenetics: a comprehensive study based on simulated datasets.
Arong Luo ... Aibing Zhang
BMC Evolutionary Biology | VOL. 10
Arong Luo, et. al.Arong Luo ... Aibing Zhang
09 Aug 2010
BMC Evolutionary Biology | VOL. 10

Comparison of Akaike information criterion (AIC) and Bayesian information criterion (BIC) in selection of an asymmetric price relationship

Journal of Development and Agricultural Economics | VOL. 2

31 Jan 2010
Journal of Development and Agricultural Economics | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

High-dimensional variable selection via low-dimensional adaptive learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronic Journal of Statistics