Sparse Group Lasso Research Articles

BackgroundSocial-environmental data obtained from the US Census is an important resource for understanding health disparities, but rarely is the full dataset utilized for analysis. A barrier to incorporating the full data is a lack of solid recommendations for variable selection, with researchers often hand-selecting a few variables. Thus, we evaluated the ability of empirical machine learning approaches to identify social-environmental factors having a true association with a health outcome.MethodsWe compared several popular machine learning methods, including penalized regressions (e.g. lasso, elastic net), and tree ensemble methods. Via simulation, we assessed the methods’ ability to identify census variables truly associated with binary and continuous outcomes while minimizing false positive results (10 true associations, 1000 total variables). We applied the most promising method to the full census data (p = 14,663 variables) linked to prostate cancer registry data (n = 76,186 cases) to identify social-environmental factors associated with advanced prostate cancer.ResultsIn simulations, we found that elastic net identified many true-positive variables, while lasso provided good control of false positives. Using a combined measure of accuracy, hierarchical clustering based on Spearman’s correlation with sparse group lasso regression performed the best overall. Bayesian Adaptive Regression Trees outperformed other tree ensemble methods, but not the sparse group lasso. In the full dataset, the sparse group lasso successfully identified a subset of variables, three of which replicated earlier findings.ConclusionsThis analysis demonstrated the potential of empirical machine learning approaches to identify a small subset of census variables having a true association with the outcome, and that replicate across empiric methods. Sparse clustered regression models performed best, as they identified many true positive variables while controlling false positive discoveries.

Recent works have shown that the resting-state brain functional connectivity hypernetwork, where multiple nodes can be connected, are an effective technique for brain disease diagnosis and classification research. The lasso method was used to construct hypernetworks by solving sparse linear regression models in previous research. But, constructing a hypernetwork based on the lasso method simply selects a single variable, in that it lacks the ability to interpret the grouping effect. Considering the group structure problem, the previous study proposed to create a hypernetwork based on the elastic net and the group lasso methods, and the results showed that the former method had the best classification performance. However, the highly correlated variables selected by the elastic net method were not necessarily in the active set in the group. Therefore, we extended our research to address this issue. Herein, we propose a new method that introduces the sparse group lasso method to improve the construction of the hypernetwork by solving the group structure problem of the brain regions. We used the traditional lasso, group lasso method, and sparse group lasso method to construct a hypernetwork in patients with depression and normal subjects. Meanwhile, other clustering coefficients (clustering coefficients based on pairs of nodes) were also introduced to extract features with traditional clustering coefficients. Two types of features with significant differences obtained after feature selection were subjected to multi-kernel learning for feature fusion and classification using each method, respectively. The network topology results revealed differences among the three networks, where hypernetwork using the lasso method was the strictest; the group lasso, most lenient; and the sgLasso method, moderate. The network topology of the sparse group lasso method was similar to that of the group lasso method but different from the lasso method. The classification results show that the sparse group lasso method achieves the best classification accuracy by using multi-kernel learning, which indicates that better classification performance can be achieved when the group structure exists and is properly extended.

Sparse Group Lasso Research Articles

Related Topics

Articles published on Sparse Group Lasso

Variable selection in social-environmental data: sparse regression and tree ensemble machine learning approaches

Convex clustering method for compositional data via sparse group lasso

Granger causality detection in high-dimensional systems using feedforward neural networks

Sparse Multicategory Generalized Distance Weighted Discrimination in Ultra-High Dimensions.

Cancer Diagnosis and Disease Gene Identification via Statistical Machine Learning

A Novel Convex Clustering Method for High-Dimensional Data Using Semiproximal ADMM

Seagull: lasso, group lasso and sparse-group lasso regularization for linear regression models via proximal gradient descent

The sparse group lasso for high-dimensional integrative linear discriminant analysis with application to alzheimer's disease prediction

Adaptive sparse group LASSO in quantile regression

Sparse Elitist Group Lasso Denoising in Frequency Domain for Bearing Fault Diagnosis

Seismic Absorption Qualitative Indicator via Sparse Group-Lasso-Based Time–Frequency Representation

Accounting for grouped predictor variables or pathways in high-dimensional penalized Cox regression models

High-dimensional penalized arch processes

Multi-task learning sparse group lasso: a method for quantifying antigenicity of influenza A(H1N1) virus using mutations and variations in glycosylation of Hemagglutinin

Penalized models for analysis of multiple mediators.

Genetic Variants Detection Based on Weighted Sparse Group Lasso

Hypernetwork Construction and Feature Fusion Analysis Based on Sparse Group Lasso Method on fMRI Dataset

Group Guided Fused Laplacian Sparse Group Lasso for Modeling Alzheimer’s Disease Progression

Schizophrenia Identification Using Multi-View Graph Measures of Functional Brain Networks.

A group lasso based sparse KNN classifier

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Sparse Group Lasso Research Articles

Related Topics

Articles published on Sparse Group Lasso

Variable selection in social-environmental data: sparse regression and tree ensemble machine learning approaches

Convex clustering method for compositional data via sparse group lasso

Granger causality detection in high-dimensional systems using feedforward neural networks

Sparse Multicategory Generalized Distance Weighted Discrimination in Ultra-High Dimensions.

Cancer Diagnosis and Disease Gene Identification via Statistical Machine Learning

A Novel Convex Clustering Method for High-Dimensional Data Using Semiproximal ADMM

Seagull: lasso, group lasso and sparse-group lasso regularization for linear regression models via proximal gradient descent

The sparse group lasso for high-dimensional integrative linear discriminant analysis with application to alzheimer's disease prediction

Adaptive sparse group LASSO in quantile regression

Sparse Elitist Group Lasso Denoising in Frequency Domain for Bearing Fault Diagnosis

Seismic Absorption Qualitative Indicator via Sparse Group-Lasso-Based Time–Frequency Representation

Accounting for grouped predictor variables or pathways in high-dimensional penalized Cox regression models

High-dimensional penalized arch processes

Multi-task learning sparse group lasso: a method for quantifying antigenicity of influenza A(H1N1) virus using mutations and variations in glycosylation of Hemagglutinin

Penalized models for analysis of multiple mediators.

Genetic Variants Detection Based on Weighted Sparse Group Lasso

Hypernetwork Construction and Feature Fusion Analysis Based on Sparse Group Lasso Method on fMRI Dataset

Group Guided Fused Laplacian Sparse Group Lasso for Modeling Alzheimer’s Disease Progression

Schizophrenia Identification Using Multi-View Graph Measures of Functional Brain Networks.

A group lasso based sparse KNN classifier