Exploration of the (non-)asymptotic bias and variance of stochastic gradient langevin dynamics

Sebastian J Vollmer ,Konstantinos C Zygalakis ,Yee Whye Teh

doi:10.5555/2946645.3053441

Sebastian J Vollmer , Konstantinos C Zygalakis + Show 1 more

https://doi.org/10.5555/2946645.3053441

Copy DOI

Abstract

Applying standard Markov chain Monte Carlo (MCMC) algorithms to large data sets is computationally infeasible. The recently proposed stochastic gradient Langevin dynamics (SGLD) method circumvents this problem in three ways: it generates proposed moves using only a subset of the data, it skips the Metropolis-Hastings accept-reject step, and it uses sequences of decreasing step sizes. In Teh et al. (2014), we provided the mathematical foundations for the decreasing step size SGLD, including consistency and a central limit theorem. However, in practice the SGLD is run for a relatively small number of iterations, and its step size is not decreased to zero. The present article investigates the behaviour of the SGLD with fixed step size. In particular we characterise the asymptotic bias explicitly, along with its dependence on the step size and the variance of the stochastic gradient. On that basis a modified SGLD which removes the asymptotic bias due to the variance of the stochastic gradients up to first order in the step size is derived. Moreover, we are able to obtain bounds on the finite-time bias, variance and mean squared error (MSE). The theory is illustrated with a Gaussian toy model for which the bias and the MSE for the estimation of moments can be obtained explicitly. For this toy model we study the gain of the SGLD over the standard Euler method in the limit of large data sets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploration of the (non-)asymptotic bias and variance of stochastic gradient langevin dynamics

Abstract

Talk to us

Similar Papers

More From: Journal of Machine Learning Research

Lead the way for us

Journal: Journal of Machine Learning Research	Publication Date: Jan 1, 2016
Citations: 54

Similar Papers

Consistency and fluctuations for stochastic gradient Langevin dynamics
...
Journal of Machine Learning Research | VOL. 17
, et. al. ...
01 Jan 2015
Journal of Machine Learning Research | VOL. 17

Characterizing Membership Privacy in Stochastic Gradient Langevin Dynamics
Bingzhe Wu ... Xiaolu Zhang
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34
Bingzhe Wu, et. al.Bingzhe Wu ... Xiaolu Zhang
03 Apr 2020
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34

Functional Central Limit Theorem and Strong Law of Large Numbers for Stochastic Gradient Langevin Dynamics
A Lovas ... M Rásonyi
Applied Mathematics & Optimization | VOL. 88
A Lovas, et. al.A Lovas ... M Rásonyi
28 Aug 2023
Applied Mathematics & Optimization | VOL. 88

Stochastic gradient Langevin dynamics with adaptive drifts
Sehwan Kim ... Faming Liang
Journal of Statistical Computation and Simulation | VOL. ahead-of-print
Sehwan Kim, et. al.Sehwan Kim ... Faming Liang
29 Jul 2021
Journal of Statistical Computation and Simulation | VOL. ahead-of-print

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploration of the (non-)asymptotic bias and variance of stochastic gradient langevin dynamics

Abstract

Talk to us

Similar Papers

More From: Journal of Machine Learning Research