Solving high-dimensional Hamilton\u2013Jacobi\u2013Bellman PDEs using neural networks: perspectives from the theory of controlled diffusions and measures on path space

Nikolas Nüsken,Lorenz Richter

doi:10.1007/s42985-021-00102-x

Abstract

Optimal control of diffusion processes is intimately connected to the problem of solving certain Hamilton–Jacobi–Bellman equations. Building on recent machine learning inspired approaches towards high-dimensional PDEs, we investigate the potential of iterative diffusion optimisation techniques, in particular considering applications in importance sampling and rare event simulation, and focusing on problems without diffusion control, with linearly controlled drift and running costs that depend quadratically on the control. More generally, our methods apply to nonlinear parabolic PDEs with a certain shift invariance. The choice of an appropriate loss function being a central element in the algorithmic design, we develop a principled framework based on divergences between path measures, encompassing various existing methods. Motivated by connections to forward-backward SDEs, we propose and study the novel log-variance divergence, showing favourable properties of corresponding Monte Carlo estimators. The promise of the developed approach is exemplified by a range of high-dimensional and metastable numerical examples.

Highlights

Hamilton–Jacobi–Bellman partial differential equations (HJB-PDEs) are of central importance in applied mathematics
Building on recent machine learning inspired approaches towards high-dimensional PDEs, we investigate the potential of iterative diffusion optimisation techniques, in particular considering applications in importance sampling and rare event simulation, and focusing on problems without diffusion control, with linearly controlled drift and running costs that depend quadratically on the control
We show that a variety of loss functions can be constructed and analysed in terms of divergences between probability measures on the path space associated to solutions of (1), providing a unifying framework for iterative diffusion optimisation (IDO) and extending on previous works in that direction [59,73,128]

Summary

Introduction

Hamilton–Jacobi–Bellman partial differential equations (HJB-PDEs) are of central importance in applied mathematics. We show that a variety of loss functions can be constructed and analysed in terms of divergences between probability measures on the path space associated to solutions of (1), providing a unifying framework for IDO and extending on previous works in that direction [59,73,128]. As this perspective entails the approximation of a target probability measure as a core element, our approach exposes connections to the theory of variational inference [15,124]. The aforementioned adjustments needed to establish the path space perspective often lead to faster convergence and more accurate approximation of the optimal control, as we show by means of numerical experiments

48 Page 4 of 48

Optimal control

Conditioning and rare events

Sampling problems

48 Page 8 of 48

48 Page 10 of 48

Algorithms and previous work

48 Page 12 of 48

Approximating probability measures on path space

Divergences and loss functions

48 Page 14 of 48

48 Page 16 of 48

FBSDEs and the log-variance loss

Algorithmic outline and empirical estimators

Equivalence properties in the limit of infinite batch size

48 Page 20 of 48

48 Page 22 of 48

Finite sample properties and the variance of estimators

Stability in high dimensions—robustness under tensorisation

Numerical experiments

Computational aspects

Ornstein–Uhlenbeck dynamics with linear costs

48 Page 30 of 48

Ornstein–Uhlenbeck dynamics with quadratic costs

48 Page 32 of 48

48 Page 34 of 48

Conclusion and outlook

A Appendix

48 Page 38 of 48

48 Page 40 of 48

N VarPM dPM dQM dPM dPM

Optimal control for Ornstein–Uhlenbeck dynamics with linear cost

48 Page 46 of 48

48 Page 48 of 48

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Partial Differential Equations and Applications	Publication Date: Jun 21, 2021
Citations: 20	License type: open-access

R Discovery Prime

R Discovery Prime

Solving high-dimensional Hamilton\u2013Jacobi\u2013Bellman PDEs using neural networks: perspectives from the theory of controlled diffusions and measures on path space

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Partial Differential Equations and Applications

Lead the way for us

Similar Papers

Hypothesis Testing for Rare-Event Simulation: Limitations and Possibilities
Pieter-Tjerk De Boer ... Daniël Reijsbergen
-
Pieter-Tjerk De Boer, et. al.Pieter-Tjerk De Boer ... Daniël Reijsbergen
01 Jan 2015
01 Jan 2015

Simulation Probability Density Function Design for Turbo Codes
T Sakai
IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences | VOL. E88-A
T SakaiT Sakai
01 Oct 2005
IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences | VOL. E88-A

Estimation variance bounds of importance sampling simulations in digital communication systems
K Yao ... D Lu
IEEE Transactions on Communications | VOL. 39
K Yao, et. al.K Yao ... D Lu
01 Jan 1991
IEEE Transactions on Communications | VOL. 39

Learning-based importance sampling via stochastic optimal control for stochastic reaction networks
Sophia Wiechert ... Chiheb Ben Hammouda
Statistics and Computing | VOL. 33
Sophia Wiechert, et. al.Sophia Wiechert ... Chiheb Ben Hammouda
28 Mar 2023
Statistics and Computing | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Solving high-dimensional Hamilton\u2013Jacobi\u2013Bellman PDEs using neural networks: perspectives from the theory of controlled diffusions and measures on path space

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Partial Differential Equations and Applications