Groupyr: Sparse Group Lasso in Python.

Adam Richie-Halford,Manjari Narayan,Ariel Rokem,Noah Simon,Jason Yeatman

doi:10.21105/joss.03024

Adam Richie-Halford, Manjari Narayan + Show 3 more

Open Access

https://doi.org/10.21105/joss.03024

Copy DOI

Abstract

SummaryFor high-dimensional supervised learning, it is often beneficial to use domain-specific knowledge to improve the performance of statistical learning models. When the problem contains covariates which form groups, researchers can include this grouping information to find parsimonious representations of the relationship between covariates and targets. These groups may arise artificially, as from the polynomial expansion of a smaller feature space, or naturally, as from the anatomical grouping of different brain regions or the geographical grouping of different cities. When the number of features is large compared to the number of observations, one seeks a subset of the features which is sparse at both the group and global level.

Highlights

The sparse group lasso (Simon et al, 2013) is a penalized regression technique designed for exactly these situations
For the grid search strategy, our implementation is more efficient than using the base estimator with scikit-learn’s GridSearchCV because it makes use of warm-starting, where the model is fit along a pre-defined regularization path and the solution from the previous fit is used as the initial guess for the current hyperparameter value
Even without warm-starting, we find that the sequential model based optimization (SMBO) strategy usually outperforms grid search because far fewer evaluations are needed to arrive at the optimal hyperparameters

Summary

For high-dimensional supervised learning, it is often beneficial to use domain-specific knowledge to improve the performance of statistical learning models. When the problem contains covariates which form groups, researchers can include this grouping information to find parsimonious representations of the relationship between covariates and targets. The sparse group lasso (Simon et al, 2013) is a penalized regression technique designed for exactly these situations. It combines the original lasso (Tibshirani, 1996), which induces global sparsity, with the group lasso (Yuan & Lin, 2006), which induces group-level sparsity. It estimates a target variable yfrom a feature matrix X, using y = Xβ, as depicted, with color encoding the group structure of the covariates in X. Journal of Open Source Software, 6(58), 3024. https://doi.org/10. 1

Statement of need

Author statements and acknowledgments

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of open source software	Publication Date: Feb 24, 2021
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Groupyr: Sparse Group Lasso in Python.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of open source software

Lead the way for us

Similar Papers

Comparison of Statistical and Machine Learning models for pipe failure modeling in Water Distribution Networks (WDN)
Mónica Giraldo ... Juan Rodríguez
-
Mónica Giraldo, et. al.Mónica Giraldo ... Juan Rodríguez
12 Nov 2019
12 Nov 2019

Neurovascular coupling dysfunction associated with cognitive impairment in presbycusis.
Chunhua Xing ... Richard Salvi
Brain communications | VOL. 6
Chunhua Xing, et. al.Chunhua Xing ... Richard Salvi
26 Jun 2024
Brain communications | VOL. 6

Big data forecasting of South African inflation.
Byron Botha ... Rulof Burger
Empirical economics | VOL. 65
Byron Botha, et. al.Byron Botha ... Rulof Burger
08 Nov 2022
Empirical economics | VOL. 65

Novel deterministic and probabilistic forecasting methods for crude oil price employing optimized deep learning, statistical and hybrid models
Sourav Kumar Purohit ... Sibarama Panigrahi
Information Sciences | VOL. 658
Sourav Kumar Purohit, et. al.Sourav Kumar Purohit ... Sibarama Panigrahi
17 Dec 2023
Information Sciences | VOL. 658

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Groupyr: Sparse Group Lasso in Python.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of open source software