Non-reversible Parallel Tempering for Deep Posterior Approximation

Wei Deng,Qian Zhang,Qi Feng,Guang Lin,Faming Liang

doi:10.1609/aaai.v37i6.25893

Abstract

Parallel tempering (PT), also known as replica exchange, is the go-to workhorse for simulations of multi-modal distributions. The key to the success of PT is to adopt efficient swap schemes. The popular deterministic even-odd (DEO) scheme exploits the non-reversibility property and has successfully reduced the communication cost from quadratic to linear given the sufficiently many chains. However, such an innovation largely disappears in big data due to the limited chains and few bias-corrected swaps. To handle this issue, we generalize the DEO scheme to promote non-reversibility and propose a few solutions to tackle the underlying bias caused by the geometric stopping time. Notably, in big data scenarios, we obtain a nearly linear communication cost based on the optimal window size. In addition, we also adopt stochastic gradient descent (SGD) with large and constant learning rates as exploration kernels. Such a user-friendly nature enables us to conduct approximation tasks for complex posteriors without much tuning costs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Non-reversible Parallel Tempering for Deep Posterior Approximation

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

Examination of Optimal Window Size and Acquisition Time of Respiratory-gated PET Image: Phantom Study with a SiPM-based PET/CT Scanner
Ryotaro Sato ... Akihito Usui
Nihon Hoshasen Gijutsu Gakkai zasshi | VOL. 76
Ryotaro Sato, et. al.Ryotaro Sato ... Akihito Usui
01 Jan 2020
Nihon Hoshasen Gijutsu Gakkai zasshi | VOL. 76

Iteration and stochastic first-order oracle complexities of stochastic gradient descent using constant and decaying learning rates
Kento Imaizumi ... Hideaki Iiduka
Optimization | VOL. ahead-of-print
Kento Imaizumi, et. al.Kento Imaizumi ... Hideaki Iiduka
19 Jun 2024
Optimization | VOL. ahead-of-print

Stochastic Gradient Descent as Approximate Bayesian Inference
...
Journal of Machine Learning Research | VOL. 18
, et. al. ...
01 Jan 2017
Journal of Machine Learning Research | VOL. 18

Analysis of stochastic gradient descent in continuous time
Jonas Latz
Statistics and Computing | VOL. 31
Jonas LatzJonas Latz
09 May 2021
Statistics and Computing | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Non-reversible Parallel Tempering for Deep Posterior Approximation

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence