PALLADIO: a parallel framework for robust variable selection in high-dimensional data

Matteo Barbieri ,Samuele Fiorini ,Federico Tomasi ,Annalisa Barla

doi:10.5555/3019083.3019086

Abstract

The main goal of supervised data analytics is to model a target phenomenon given a limited amount of samples, each represented by an arbitrarily large number of variables. Especially when the number of variables is much larger than the number of available samples, variable selection is a key step as it allows to identify a possibly reduced subset of relevant variables describing the observed phenomenon. Obtaining interpretable and reliable results, in this highly indeterminate scenario, is often a non-trivial task. In this work we present PALLADIO, a framework designed for HPC cluster architectures, that is able to provide robust variable selection in high-dimensional problems. PALLADIO is developed in Python and it integrates CUDA kernels to decrease the computational time needed for several independent element-wise operations. The scalability of the proposed framework is assessed on synthetic data of different sizes, which represent realistic scenarios.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

PALLADIO: a parallel framework for robust variable selection in high-dimensional data

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Bayesian variable selection for high-dimensional data with an ordinal response: identifying genes associated with prognostic risk group in acute myeloid leukemia
Yiran Zhang ... Kellie J Archer
BMC Bioinformatics | VOL. 22
Yiran Zhang, et. al.Yiran Zhang ... Kellie J Archer
02 Nov 2021
BMC Bioinformatics | VOL. 22

A transparent and nonlinear method for variable selection
Keyao Wang ... Lihong Wang
Expert Systems With Applications | VOL. 237
Keyao Wang, et. al.Keyao Wang ... Lihong Wang
04 Sep 2023
Expert Systems With Applications | VOL. 237

Erratum to: Ultrahigh dimensional variable selection through the penalized maximum trimmed likelihood estimator
N M Neykov ... P Filzmoser
Statistical Papers | VOL. 55
N M Neykov, et. al.N M Neykov ... P Filzmoser
25 May 2013
Statistical Papers | VOL. 55

Nested coordinate descent algorithms for empirical likelihood
Cheng Yong Tang ... Tong Tong Wu
Journal of Statistical Computation and Simulation | VOL. 84
Cheng Yong Tang, et. al.Cheng Yong Tang ... Tong Tong Wu
18 Feb 2013
Journal of Statistical Computation and Simulation | VOL. 84

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PALLADIO: a parallel framework for robust variable selection in high-dimensional data

Abstract

Talk to us

Similar Papers