Fast Asynchronous Parallel Stochastic Gradient Descent: A Lock-Free Approach with Convergence Guarantee

Shen-Yi Zhao,Wu-Jun Li

doi:10.1609/aaai.v30i1.10305

Abstract

Stochastic gradient descent (SGD) and its variants have become more and more popular in machine learning due to their efficiency and effectiveness. To handle large-scale problems, researchers have recently proposed several parallel SGD methods for multicore systems. However, existing parallel SGD methods cannot achieve satisfactory performance in real applications. In this paper, we propose a fast asynchronous parallel SGD method, called AsySVRG, by designing an asynchronous strategy to parallelize the recently proposed SGD variant called stochastic variance reduced gradient (SVRG). AsySVRG adopts a lock-free strategy which is more efficient than other strategies with locks. Furthermore, we theoretically prove that AsySVRG is convergent with a linear convergence rate. Both theoretical and empirical results show that AsySVRG can outperform existing state-of-the-art parallel SGD methods like Hogwild! in terms of convergence rate and computation cost.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fast Asynchronous Parallel Stochastic Gradient Descent: A Lock-Free Approach with Convergence Guarantee

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Mar 2, 2016
Citations: 52

Similar Papers

IS-ASGD
Fei Wang ... Xiaofeng Gao
-
Fei Wang, et. al.Fei Wang ... Xiaofeng Gao
13 Aug 2018
13 Aug 2018

Asynchronous Parallel, Sparse Approximated SVRG for High-Dimensional Machine Learning
Fanhua Shang ... Jun Fan
IEEE Transactions on Knowledge and Data Engineering | VOL. 34
Fanhua Shang, et. al.Fanhua Shang ... Jun Fan
06 Apr 2021
IEEE Transactions on Knowledge and Data Engineering | VOL. 34

SVRG with adaptive epoch size
Erxue Min ... Yawei Zhao
-
Erxue Min, et. al.Erxue Min ... Yawei Zhao
01 May 2017
01 May 2017

Guided parallelized stochastic gradient descent for delay compensation
Anuraganand Sharma
Applied Soft Computing | VOL. 102
Anuraganand SharmaAnuraganand Sharma
14 Jan 2021
Applied Soft Computing | VOL. 102

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fast Asynchronous Parallel Stochastic Gradient Descent: A Lock-Free Approach with Convergence Guarantee

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence