AUTALASSO: an automatic adaptive LASSO for genome-wide prediction

Patrik Waldmann,Maja Ferenčaković,Gábor Mészáros,Ino Curik,Negar Khayatzadeh,Johann Sölkner

doi:10.1186/s12859-019-2743-3

Abstract

BackgroundGenome-wide prediction has become the method of choice in animal and plant breeding. Prediction of breeding values and phenotypes are routinely performed using large genomic data sets with number of markers on the order of several thousands to millions. The number of evaluated individuals is usually smaller which results in problems where model sparsity is of major concern. The LASSO technique has proven to be very well-suited for sparse problems often providing excellent prediction accuracy. Several computationally efficient LASSO algorithms have been developed, but optimization of hyper-parameters can be demanding.ResultsWe have developed a novel automatic adaptive LASSO (AUTALASSO) based on the alternating direction method of multipliers (ADMM) optimization algorithm. The two major hyper-parameters of ADMM are the learning rate and the regularization factor. The learning rate is automatically tuned with line search and the regularization factor optimized using Golden section search. Results show that AUTALASSO provides superior prediction accuracy when evaluated on simulated and real bull data compared to the adaptive LASSO, LASSO and ridge regression implemented in the popular glmnet software.ConclusionsThe AUTALASSO provides a very flexible and computationally efficient approach to GWP, especially when it is important to obtain high prediction accuracy and genetic gain. The AUTALASSO also has the capability to perform GWAS of both additive and dominance effects with smaller prediction error than the ordinary LASSO.

Highlights

Genome-wide prediction has become the method of choice in animal and plant breeding
The purpose of this study is to introduce proximal algorithms, with a special focus on alternating direction method of multipliers (ADMM), into a Genome-wide prediction (GWP) framework, and to develop a general approach that automatically finds the optimal values of the learning rate and the regularization parameters of an adaptive LASSO
The AUTALASSO completed in 190 s and resulted in a MSEtest of 64.34 and rtest of 0.676

Summary

Introduction

Prediction of breeding values and phenotypes are routinely performed using large genomic data sets with number of markers on the order of several thousands to millions. Since the number of individuals is usually smaller, in the range of some hundreds to a few thousands, the result is a multivariate highdimensional statistical issue that is often referred to as the p >> n problem [4, 5]. Regularization is a mathematical technique to impose prior information on the structure of the solution to an optimization problem. It closely resembles the task of using priors in Bayesian statistics. It is well established that the LASSO usually results in better prediction accuracy than ridge regression if the predictors display low to moderate correlation between each other [4, 9, 10]

Objectives

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Apr 2, 2019
Citations: 21	License type: open-access

R Discovery Prime

R Discovery Prime

AUTALASSO: an automatic adaptive LASSO for genome-wide prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

SU‐F‐BRB‐11: An Integrated Alternating Direction Method of Multipliers for Treatment Planning Optimization
M Zarepisheh ... Y Ye
Medical Physics | VOL. 42
M Zarepisheh, et. al.M Zarepisheh ... Y Ye
01 Jun 2015
Medical Physics | VOL. 42

Convergence Study on the Symmetric Version of ADMM with Larger Step Sizes
Bingsheng He ... Xiaoming Yuan
SIAM Journal on Imaging Sciences | VOL. 9
Bingsheng He, et. al.Bingsheng He ... Xiaoming Yuan
01 Jan 2015
SIAM Journal on Imaging Sciences | VOL. 9

Application of Support Vector Regression for Improving the Performance of the Emotion Prediction Model
...
-
, et. al. ...
01 Jan 2012
01 Jan 2012

Poisson Phase Retrieval in Very Low-count Regimes.
Zongyu Li ... Kenneth Lange
IEEE Transactions on Computational Imaging | VOL. 8
Zongyu Li, et. al.Zongyu Li ... Kenneth Lange
01 Jan 2021
IEEE Transactions on Computational Imaging | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AUTALASSO: an automatic adaptive LASSO for genome-wide prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics