Value function estimators for Feynman–Kac forward–backward SDEs in stochastic optimal control

Kelsey P Hawkins,Ali Pakniyat,Panagiotis Tsiotras

doi:10.1016/j.automatica.2023.111281

Kelsey P Hawkins, Ali Pakniyat + Show 1 more

Open Access

https://doi.org/10.1016/j.automatica.2023.111281

Copy DOI

Journal: Automatica	Publication Date: Sep 4, 2023
Citations: 3	License type: publisher-specific-oa

Affiliation: University of Alabama

Abstract

Two novel numerical estimators are proposed for solving forward–backward stochastic differential equations (FBSDEs) appearing in the Feynman–Kac representation of the value function in stochastic optimal control problems. In contrast to the current numerical approaches, which are based on the discretization of the continuous-time FBSDE, we propose a converse approach, namely, we obtain a discrete-time approximation of the value function, and then we derive a discrete-time estimator that resembles the continuous-time counterpart. The proposed approach allows for the construction of higher accuracy estimators along with an error analysis. The approach is applied to the policy improvement step in a reinforcement learning framework. Numerical results, along with the corresponding error analysis, demonstrate that the proposed estimators show significant improvement in terms of accuracy over classical Euler–Maruyama-based estimators. In the case of LQ problems, we demonstrate that our estimators result in near machine-precision level accuracy, in contrast to previously proposed methods that can potentially diverge on the same problems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Value function estimators for Feynman–Kac forward–backward SDEs in stochastic optimal control

Abstract

Talk to us

Similar Papers

More From: Automatica

Lead the way for us

Similar Papers

On the Time Discretization of the Feynman-Kac Forward-Backward Stochastic Differential Equations for Value Function Approximation
Kelsey P Hawkins ... Ali Pakniyat
-
Kelsey P Hawkins, et. al.Kelsey P Hawkins ... Ali Pakniyat
14 Dec 2021
14 Dec 2021

Solving Stochastic Optimal Control Problem via Stochastic Maximum Principle with Deep Learning Method
Shaolin Ji ... Shige Peng
Journal of Scientific Computing | VOL. 93
Shaolin Ji, et. al.Shaolin Ji ... Shige Peng
07 Sep 2022
Journal of Scientific Computing | VOL. 93

Solving stochastic optimal control using 4 step scheme
A Abdel-Naby ... M.A El-Tawil
-
A Abdel-Naby, et. al.A Abdel-Naby ... M.A El-Tawil
07 Aug 2002
07 Aug 2002

Local–global model reduction method for stochastic optimal control problems constrained by partial differential equations
Lingling Ma ... Lijian Jiang
Computer Methods in Applied Mechanics and Engineering | VOL. 339
Lingling Ma, et. al.Lingling Ma ... Lijian Jiang
17 May 2018
Computer Methods in Applied Mechanics and Engineering | VOL. 339

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Value function estimators for Feynman–Kac forward–backward SDEs in stochastic optimal control

Abstract

Talk to us

Similar Papers

More From: Automatica