IS-ASGD

Fei Wang,Guihai Chen,Jun Ye,Xiaofeng Gao

doi:10.1145/3225058.3225135

Abstract

Variance reduction (VR) techniques for convergence rate acceleration of stochastic gradient descent (SGD) algorithm have been developed with great efforts recently. VR's two variants, stochastic variance-reduced-gradient (SVRG-SGD) and importance sampling (IS-SGD) have achieved remarkable progresses. Meanwhile, asynchronous SGD (ASGD) is becoming more critical due to the ever-increasing scale of the optimization problems. The application of VR in ASGD to accelerate its convergence rate has therefore attracted much interest and SVRG-ASGDs were proposed. However, we found that SVRG suffers dissatisfying performance in accelerating ASGD when datasets are sparse and large-scale. In such case, SVRG-ASGD's iterative computation cost is magnitudes higher than plain ASGD which makes it very inefficient. On the other hand, IS achieves improved convergence rate with few extra computation cost and is invariant to the sparsity of datasets. These advantages make it very suitable for the acceleration of ASGD on large-scale sparse datasets. In this paper we propose a novel IS-combined ASGD for efficient convergence rate acceleration, namely, IS-ASGD. We theoretically prove the superior convergence bound of IS-ASGD. Experimental results also demonstrate our statements.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

IS-ASGD

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Guided parallelized stochastic gradient descent for delay compensation
Anuraganand Sharma
Applied Soft Computing | VOL. 102
Anuraganand SharmaAnuraganand Sharma
14 Jan 2021
Applied Soft Computing | VOL. 102

An Efficient, Distributed Stochastic Gradient Descent Algorithm for Deep-Learning Applications
Guojing Cong ... Minwei Feng
-
Guojing Cong, et. al.Guojing Cong ... Minwei Feng
01 Aug 2017
01 Aug 2017

Asynchronous Mini-Batch Gradient Descent with Variance Reduction for Non-Convex Optimization
Zhouyuan Huo ... Heng Huang
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 31
Zhouyuan Huo, et. al.Zhouyuan Huo ... Heng Huang
13 Feb 2017
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 31

Stochastic modified equations for the asynchronous stochastic gradient descent
Jing An ... Jianfeng Lu
Information and Inference: A Journal of the IMA | VOL. 9
Jing An, et. al.Jing An ... Jianfeng Lu
18 Nov 2019
Information and Inference: A Journal of the IMA | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

IS-ASGD

Abstract

Talk to us

Similar Papers