Random lasso

Sijian Wang,Ji Zhu,Bin Nan,Saharon Rosset

doi:10.1214/10-aoas377

Abstract

We propose a computationally intensive method, the random lasso method, for variable selection in linear models. The method consists of two major steps. In step 1, the lasso method is applied to many bootstrap samples, each using a set of randomly selected covariates. A measure of importance is yielded from this step for each covariate. In step 2, a similar procedure to the first step is implemented with the exception that for each bootstrap sample, a subset of covariates is randomly selected with unequal selection probabilities determined by the covariates' importance. Adaptive lasso may be used in the second step with weights determined by the importance measures. The final set of covariates and their coefficients are determined by averaging bootstrap results obtained from step 2. The proposed method alleviates some of the limitations of lasso, elastic-net and related methods noted especially in the context of microarray data analysis: it tends to remove highly correlated variables altogether or select them all, and maintains maximal flexibility in estimating their coefficients, particularly with different signs; the number of selected variables is no longer limited by the sample size; and the resulting prediction accuracy is competitive or superior compared to the alternatives. We illustrate the proposed method by extensive simulation studies. The proposed method is also applied to a Glioblastoma microarray data analysis.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: The Annals of Applied Statistics	Publication Date: Mar 1, 2011
Citations: 146	License type: implied-oa

R Discovery Prime

R Discovery Prime

Random lasso

Abstract

Talk to us

Similar Papers

More From: The Annals of Applied Statistics

Lead the way for us

Similar Papers

Variable selection in high-dimensional linear model with possibly asymmetric errors
Gabriela Ciuperca
Computational Statistics & Data Analysis | VOL. 155
Gabriela CiupercaGabriela Ciuperca
14 Oct 2020
Computational Statistics & Data Analysis | VOL. 155

Variable selection in linear models
Yuqi Chen ... Yuedong Wang
WIREs Computational Statistics | VOL. 6
Yuqi Chen, et. al.Yuqi Chen ... Yuedong Wang
13 Dec 2013
WIREs Computational Statistics | VOL. 6

Robust Bayesian variable selection in linear models with spherically symmetric errors
Y Maruyama ... W E Strawderman
Biometrika | VOL. 101
Y Maruyama, et. al.Y Maruyama ... W E Strawderman
20 Oct 2014
Biometrika | VOL. 101

An ensemble learning method for variable selection: application to high-dimensional data and missing values
Avner Bar-Hen ... Vincent Audigier
Journal of Statistical Computation and Simulation | VOL. ahead-of-print
Avner Bar-Hen, et. al.Avner Bar-Hen ... Vincent Audigier
07 May 2022
Journal of Statistical Computation and Simulation | VOL. ahead-of-print

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Random lasso

Abstract

Talk to us

Similar Papers

More From: The Annals of Applied Statistics