Stochastic gradient method with accelerated stochastic dynamics

Masayuki Ohzeki

doi:10.1088/1742-6596/699/1/012019

Abstract

We implement the simple method to accelerate the convergence speed to the steady state and enhance the mixing rate to the stochastic gradient Langevin method. The ordinary stochastic gradient method is based on mini-batch learning for reducing the computational cost when the amount of data is extraordinary large. The stochasticity of the gradient can be mitigated by the injection of Gaussian noise, which yields the stochastic Langevin gradient method; this method can be used for Bayesian posterior sampling. However, the performance of the stochastic Langevin gradient method depends on the mixing rate of the stochastic dynamics. In this study, we propose violating the detailed balance condition to enhance the mixing rate. Recent studies have revealed that violating the detailed balance condition accelerates the convergence to a stationary state and reduces the correlation time between the samplings. We implement this violation of the detailed balance condition in the stochastic gradient Langevin method and test our method for a simple model to demonstrate its performance.

Highlights

Since massive amounts of data can be acquired from various sources, the importance of the socalled big-data analysis is rapidly increasing
Summary In this study, we proposed the application of the Ohzeki-Ichiki method in the stochastic gradient Langevin method
The Ohzeki-Ichiki method violates the detailed balance condition (DBC), which is a sufficient condition to ensure convergence to a stationary state, and shows remarkable performance for attaining a stationary state compared to the standard equilibrium case under the DBC

Summary

Introduction

Since massive amounts of data can be acquired from various sources, the importance of the socalled big-data analysis is rapidly increasing. The former stochasticity is a resultant property to reduce the computational cost for large-scale data In the latter, the noise makes the trajectory of the parameters converge to full posterior distribution rather than just the maximum a posteriori mode. Welling and Teh have proposed the combination of Langevin dynamics with the stochastic gradient method, i.e., the stochastic gradient Langevin method, to generate the posterior distribution for learning from large-scale data [16]. By decreasing the step size gradually, the injected noise will become dominant and the effective dynamics will converge to the Langevin equation with the exact gradient. The increase in the convergence speed improves the performance and reduces the computational cost of the method This fact motivates the study of the stochastic gradient Langevin method from a point of view of nonequilibrium statistical physics. We introduce the accelerated stochastic dynamics with faster convergence to a stationary state

Langevin equation and its corresponding Fokker-Planck equation

Ohzeki-Ichiki method for replicate system

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Physics: Conference Series	Publication Date: Mar 1, 2016
Citations: 5	License type: cc-by

R Discovery Prime

R Discovery Prime

Stochastic gradient method with accelerated stochastic dynamics

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series

Lead the way for us

Similar Papers

Biased stochastic conjugate gradient algorithm with adaptive step size for nonconvex problems
Ruping Huang ... Gonglin Yuan
Expert Systems With Applications | VOL. 238
Ruping Huang, et. al.Ruping Huang ... Gonglin Yuan
22 Sep 2023
Expert Systems With Applications | VOL. 238

Decentralized stochastic optimization algorithms using uncoordinated step-sizes over unbalanced directed networks
Jinhui Hu ... Huaqing Li
Signal Processing | VOL. 180
Jinhui Hu, et. al.Jinhui Hu ... Huaqing Li
19 Nov 2020
Signal Processing | VOL. 180

CSG: A new stochastic gradient method for the efficient solution of structural optimization problems with infinitely many states
Lukas Pflug ... Max Grieshammer
Structural and Multidisciplinary Optimization | VOL. 61
Lukas Pflug, et. al.Lukas Pflug ... Max Grieshammer
31 May 2020
Structural and Multidisciplinary Optimization | VOL. 61

SUCAG: Stochastic Unbiased Curvature-aided Gradient Method for Distributed Optimization
Hoi-To Wai ... Nikolaos M Freris
-
Hoi-To Wai, et. al.Hoi-To Wai ... Nikolaos M Freris
01 Dec 2018
01 Dec 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Stochastic gradient method with accelerated stochastic dynamics

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series