Variable selection using shrinkage priors

Hanning Li,Debdeep Pati

doi:10.1016/j.csda.2016.10.008

Hanning Li, Debdeep Pati

Open Access

https://doi.org/10.1016/j.csda.2016.10.008

Copy DOI

Journal: Computational Statistics & Data Analysis	Publication Date: Oct 19, 2016
Citations: 43	License type: elsevier-specific

Affiliation: Florida State University

Abstract

Variable selection has received widespread attention over the last decade as we routinely encounter high-throughput datasets in complex biological and environment research. Most Bayesian variable selection methods are restricted to mixture priors having separate components for characterizing the signal and the noise. However, such priors encounter computational issues in high dimensions. This has motivated continuous shrinkage priors, resembling the two-component priors facilitating computation and interpretability. While such priors are widely used for estimating high-dimensional sparse vectors, selecting a subset of variables remains a daunting task. A general approach for variable selection with shrinkage priors is proposed. The presence of very few tuning parameters makes our method attractive in comparison to ad hoc thresholding approaches. The applicability of the approach is not limited to continuous shrinkage priors, but can be used along with any shrinkage prior. Theoretical properties for near-collinear design matrices are investigated and the method is shown to have good performance in a wide range of synthetic data examples and in a real data example on selecting genes affecting survival due to lymphoma.

Full Text