Accelerated Point-Wise Maximum Approach to Approximate Dynamic Programming

Paul Nathaniel Beuchat,Joseph Warrington,John Lygeros

doi:10.1109/tac.2021.3050440

Paul Nathaniel Beuchat, Joseph Warrington + Show 1 more

Open Access

https://doi.org/10.1109/tac.2021.3050440

Copy DOI

Journal: IEEE Transactions on Automatic Control	Publication Date: Jan 3, 2021
Citations: 4	License type: other-oa

Affiliation: ETH Zurich

Abstract

In this article, we describe an approximate dynamic programming (ADP) approach to compute lower bounds on the optimal value function for a discrete time, continuous space, and infinite horizon setting. The approach iteratively constructs a family of lower bounding approximate value functions by using the so-called Bellman inequality. The novelty of our approach is that, at each iteration, we aim to compute an approximate value function that maximizes the point-wise maximum taken with the family of approximate value functions computed thus far. This leads to a nonconvex objective, and we propose a gradient ascent algorithm to find stationary points by solving a sequence of convex optimization problems. We provide convergence guarantees for our algorithm and an interpretation for how the gradient computation relates to the state-relevance weighting parameter appearing in related ADP approaches. We demonstrate through numerical examples that, when compared to the existing approaches, the algorithm we propose computes tighter suboptimality bounds with comparable computation time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Accelerated Point-Wise Maximum Approach to Approximate Dynamic Programming

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Automatic Control

Lead the way for us

Similar Papers

Reducing the computational effort of optimal process controllers for continuous state spaces by using incremental learning and post-decision state formulations
Melanie Senn ... Jay H Lee
Journal of Process Control | VOL. 24
Melanie Senn, et. al.Melanie Senn ... Jay H Lee
02 Feb 2014
Journal of Process Control | VOL. 24

Energy management of PV-storage systems: ADP approach with temporal difference learning
Chanaka Keerthisinghe ... Gregor Verbic
-
Chanaka Keerthisinghe, et. al.Chanaka Keerthisinghe ... Gregor Verbic
01 Jun 2016
01 Jun 2016

Cross-docking based factory logistics unitisation process: An approximate dynamic programming approach
Zhongyuan Lyu ... George Q Huang
European Journal of Operational Research | VOL. 311
Zhongyuan Lyu, et. al.Zhongyuan Lyu ... George Q Huang
26 Apr 2023
European Journal of Operational Research | VOL. 311

Approximate dynamic programming for dynamic capacity allocation with multiple priority levels
Alexander Erdelyi ... Huseyin Topaloglu
IIE Transactions | VOL. 43
Alexander Erdelyi, et. al.Alexander Erdelyi ... Huseyin Topaloglu
30 Nov 2010
IIE Transactions | VOL. 43

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Accelerated Point-Wise Maximum Approach to Approximate Dynamic Programming

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Automatic Control