Selecting massive variables using an iterated conditional modes/medians algorithm

Vitara Pungpapong,Dabao Zhang,Min Zhang

doi:10.1214/15-ejs1034

Abstract

Empirical Bayes methods are designed in selecting massive variables, which may be inter-connected following certain hierarchical structures, because of three attributes: taking prior information on model parameters, allowing data-driven hyperparameter values, and free of tuning parameters. We propose an iterated conditional modes/medians (ICM/M) algorithm to implement empirical Bayes selection of massive variables, while incorporating sparsity or more complicated a priori information. The iterative conditional modes are employed to obtain data-driven estimates of hyperparameters, and the iterative conditional medians are used to estimate the model coefficients and therefore enable the selection of massive variables. The ICM/M algorithm is computationally fast, and can easily extend the empirical Bayes thresholding, which is adaptive to parameter sparsity, to complex data. Empirical studies suggest competitive performance of the proposed method, even in the simple case of selecting massive regression predictors.

Highlights

Selecting variables in problems with a large number of predictors is a challenging yet critical problem in analyzing high-dimensional data
Many efforts have been devoted to selecting variables from massive candidates by incorporating rich a priori information accumulated from historical research or practices
We propose an iterative conditional modes/medians (ICM/M) algorithm for easy implementation and fast computation of empirical Bayes variable selection (EBVS)

Summary

Introduction

Selecting variables in problems with a large number of predictors is a challenging yet critical problem in analyzing high-dimensional data. Because highdimensional data are usually of relatively small sample sizes, successful variable selection demands appropriate incorporation of a priori information. Many methods have been developed to take full advantage of this sparsity assumption, mostly built upon thresholding procedures (Donoho and Johnstone, 1994), see Tibshirani (1996), Fan and Li (2001), and others. For graph-structured variables, Li and Li (2010) and Pan et al (2010) proposed to use Laplacian matrices and Lγ norms, respectively. Li and Zhang (2010) and Stingo et al (2011) both employed Bayesian approaches to incorporate structural information of the variables, both formulating Ising priors For graph-structured variables, Li and Li (2010) and Pan et al (2010) proposed to use Laplacian matrices and Lγ norms, respectively. Li and Zhang (2010) and Stingo et al (2011) both employed Bayesian approaches to incorporate structural information of the variables, both formulating Ising priors

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronic Journal of Statistics	Publication Date: Jan 1, 2015
Citations: 6	License type: cc-by

R Discovery Prime

R Discovery Prime

Selecting massive variables using an iterated conditional modes/medians algorithm

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronic Journal of Statistics

Lead the way for us

Similar Papers

Nonparametric empirical Bayes method for comparison of treatment effects with application to stress urinary incontinence data
Chaofeng Liu ... Wei Shen
Pharmaceutical Statistics | VOL. 7
Chaofeng Liu, et. al.Chaofeng Liu ... Wei Shen
01 Jan 2008
Pharmaceutical Statistics | VOL. 7

Comparison of full and empirical Bayes approaches for inferring sea‐level changes from tide‐gauge data
Christopher G Piecuch ... Peter Huybers
Journal of Geophysical Research: Oceans | VOL. 122
Christopher G Piecuch, et. al.Christopher G Piecuch ... Peter Huybers
01 Mar 2017
Journal of Geophysical Research: Oceans | VOL. 122

A parametric empirical Bayes (PEB) approach for estimating maize progress percentage at field scale
Mahdi Ghamghami ... Hamid Pezeshk
Agricultural and Forest Meteorology | VOL. 281
Mahdi Ghamghami, et. al.Mahdi Ghamghami ... Hamid Pezeshk
09 Nov 2019
Agricultural and Forest Meteorology | VOL. 281

Assessing quality of crash modification factors estimated by empirical Bayes before-after methods
Ying Chen ... Ling-Tao Wu
Journal of Central South University | VOL. 27
Ying Chen, et. al.Ying Chen ... Ling-Tao Wu
01 Aug 2020
Journal of Central South University | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Selecting massive variables using an iterated conditional modes/medians algorithm

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronic Journal of Statistics