Robust adaptive variable selection in ultra-high dimensional linear regression models

Abhik Ghosh,María Jaenada,Leandro Pardo

doi:10.1080/00949655.2023.2262669

Abstract

We consider the problem of simultaneous variable selection and parameter estimation in an ultra-high dimensional linear regression model. The adaptive penalty functions are used in this regard to achieve the oracle variable selection property with simpler assumptions and lesser computational burden. Noting the non-robust nature of the usual adaptive procedures (e.g. adaptive LASSO) based on the squared error loss function against data contamination, quite frequent with modern large-scale data sets (e.g. noisy gene expression data, spectra and spectral data), in this paper, we present a new adaptive regularization procedure using a robust loss function based on the density power divergence (DPD) measure under a general class of error distributions. We theoretically prove that the proposed adaptive DPD-LASSO estimator of the regression coefficients is highly robust, consistent, asymptotically normal and leads to robust oracle-consistent variable selection under easily verifiable assumptions. Numerical illustrations are provided for the mostly used normal and heavy-tailed error densities. Finally, the proposal is applied to analyse an interesting spectral dataset, in the field of chemometrics, regarding the electron-probe X-ray microanalysis (EPXMA) of archaeological glass vessels from the 16th and 17th centuries.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust adaptive variable selection in ultra-high dimensional linear regression models

Abstract

Talk to us

Similar Papers

More From: Journal of Statistical Computation and Simulation

Lead the way for us

Journal: Journal of Statistical Computation and Simulation	Publication Date: Sep 29, 2023
Citations: 2

Similar Papers

Variable selection in high-dimensional linear model with possibly asymmetric errors
Gabriela Ciuperca
Computational Statistics & Data Analysis | VOL. 155
Gabriela CiupercaGabriela Ciuperca
14 Oct 2020
Computational Statistics & Data Analysis | VOL. 155

Mixture Modeling of Exponentiated Pareto Distribution in Bayesian Framework With Applications of Wind-Speed and Tensile Strength of Carbon Fiber
Ammara Nawaz Cheema ... Ishfaq Ahmad
IEEE Access | VOL. 8
Ammara Nawaz Cheema, et. al.Ammara Nawaz Cheema ... Ishfaq Ahmad
01 Jan 2020
IEEE Access | VOL. 8

The Effectiveness of the Squared Error and Higgins-Tsokos Loss Functions on the Bayesian Reliability Analysis of Software Failure Times under the Power Law Process
Freeh N Alenezi ... Christ P Tsokos
Engineering | VOL. 11
Freeh N Alenezi, et. al.Freeh N Alenezi ... Christ P Tsokos
01 Jan 2019
Engineering | VOL. 11

Variable selection for functional linear models with strong heredity constraint
Sanying Feng ... Menghan Zhang
Annals of the Institute of Statistical Mathematics | VOL. 74
Sanying Feng, et. al.Sanying Feng ... Menghan Zhang
28 Apr 2021
Annals of the Institute of Statistical Mathematics | VOL. 74

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust adaptive variable selection in ultra-high dimensional linear regression models

Abstract

Talk to us

Similar Papers

More From: Journal of Statistical Computation and Simulation