Asynchronous Stochastic Proximal Optimization Algorithms with Variance Reduction

Qi Meng,Wei Chen,Zhi-Ming Ma,Jingcheng Yu,Tie-Yan Liu,Taifeng Wang

doi:10.1609/aaai.v31i1.10910

Abstract

Regularized empirical risk minimization (R-ERM) is an important branch of machine learning, since it constrains the capacity of the hypothesis space and guarantees the generalization ability of the learning algorithm. Two classic proximal optimization algorithms, i.e., proximal stochastic gradient descent (ProxSGD) and proximal stochastic coordinate descent (ProxSCD) have been widely used to solve the R-ERM problem. Recently, variance reduction technique was proposed to improve ProxSGD and ProxSCD, and the corresponding ProxSVRG and ProxSVRCD have better convergence rate. These proximal algorithms with variance reduction technique have also achieved great success in applications at small and moderate scales. However, in order to solve large-scale R-ERM problems and make more practical impacts, the parallel versions of these algorithms are sorely needed. In this paper, we propose asynchronous ProxSVRG (Async-ProxSVRG) and asynchronous ProxSVRCD (Async-ProxSVRCD) algorithms, and prove that Async-ProxSVRG can achieve near linear speedup when the training data is sparse, while Async-ProxSVRCD can achieve near linear speedup regardless of the sparse condition, as long as the number of block partitions are appropriately set. We have conducted experiments on a regularized logistic regression task. The results verified our theoretical findings and demonstrated the practical efficiency of the asynchronous stochastic proximal algorithms with variance reduction.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Asynchronous Stochastic Proximal Optimization Algorithms with Variance Reduction

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Feb 13, 2017
Citations: 6

Similar Papers

Asynchronous Mini-Batch Gradient Descent with Variance Reduction for Non-Convex Optimization
Zhouyuan Huo ... Heng Huang
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 31
Zhouyuan Huo, et. al.Zhouyuan Huo ... Heng Huang
13 Feb 2017
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 31

Convergence analysis of asynchronous stochastic recursive gradient algorithms
Pengfei Wang ... Nenggan Zheng
Knowledge-Based Systems | VOL. 252
Pengfei Wang, et. al.Pengfei Wang ... Nenggan Zheng
29 Jun 2022
Knowledge-Based Systems | VOL. 252

A proximal gradient algorithm for decentralized nondifferentiable optimization
Wei Shi ... Qing Ling
-
Wei Shi, et. al.Wei Shi ... Qing Ling
01 Apr 2015
01 Apr 2015

Riemannian proximal stochastic gradient descent for sparse 2DPCA
Zhuan Zhang ... Ting Yang
Digital Signal Processing | VOL. 122
Zhuan Zhang, et. al.Zhuan Zhang ... Ting Yang
23 Nov 2021
Digital Signal Processing | VOL. 122

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Asynchronous Stochastic Proximal Optimization Algorithms with Variance Reduction

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence