Is there an analog of Nesterov acceleration for gradient-based MCMC?

Yi-An Ma,Xiang Cheng,Niladri S Chatterji,Peter L Bartlett,Michael I Jordan,Nicolas Flammarion

doi:10.3150/20-bej1297

Is there an analog of Nesterov acceleration for gradient-based MCMC?

Yi-An Ma, Xiang Cheng + Show 4 more

https://doi.org/10.3150/20-bej1297

Copy DOI

Journal: Bernoulli	Publication Date: May 1, 2021
Citations: 21

Affiliation: University of California, San Diego, University of California, Berkeley, École Polytechnique Fédérale de Lausanne

#Langevin Algorithm #Space Of Probability Measures + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We formulate gradient-based Markov chain Monte Carlo (MCMC) sampling as optimization on the space of probability measures, with Kullback–Leibler (KL) divergence as the objective functional. We show that an underdamped form of the Langevin algorithm performs accelerated gradient descent in this metric. To characterize the convergence of the algorithm, we construct a Lyapunov functional and exploit hypocoercivity of the underdamped Langevin algorithm. As an application, we show that accelerated rates can be obtained for a class of nonconvex functions with the Langevin algorithm.

Full Text