A New Optimal Stepsize for Approximate Dynamic Programming

Ilya O Ryzhov,Peter I Frazier,Warren B Powell

doi:10.1109/tac.2014.2357134

Abstract

Approximate dynamic programming (ADP) has proven itself in a wide range of applications spanning large-scale transportation problems, health care, revenue management, and energy systems. The design of effective ADP algorithms has many dimensions, but one crucial factor is the stepsize rule used to update a value function approximation. Many operations research applications are computationally intensive, and it is important to obtain good results quickly. Furthermore, the most popular stepsize formulas use tunable parameters and can produce very poor results if tuned improperly. We derive a new stepsize rule that optimizes the prediction error in order to improve the short-term performance of an ADP algorithm. With only one, relatively insensitive tunable parameter, the new rule adapts to the level of noise in the problem and produces faster convergence in numerical experiments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A New Optimal Stepsize for Approximate Dynamic Programming

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Automatic Control

Lead the way for us

Journal: IEEE Transactions on Automatic Control	Publication Date: Mar 1, 2015
Citations: 61

Similar Papers

The Delivery Dispatching Problem with Time Windows for Urban Consolidation Centers
W J A Van Heeswijk ... M R K Mes
Transportation Science | VOL. 53
W J A Van Heeswijk, et. al.W J A Van Heeswijk ... M R K Mes
11 May 2017
Transportation Science | VOL. 53

Use of Approximate Dynamic Programming for Production Optimization
Benjamin Van Roy ... Zheng Wen
-
Benjamin Van Roy, et. al.Benjamin Van Roy ... Zheng Wen
21 Feb 2011
21 Feb 2011

Real-time Energy Management for the Integrated Heat and Power System Using Approximate Dynamic Programming
Xizhen Xue ... Hang Shuai
-
Xizhen Xue, et. al.Xizhen Xue ... Hang Shuai
02 Aug 2020
02 Aug 2020

Sourcing strategies in supply risk management: An approximate dynamic programming approach
Jiarui Fang ... Tom Van Woensel
Computers & Operations Research | VOL. 40
Jiarui Fang, et. al.Jiarui Fang ... Tom Van Woensel
04 Sep 2012
Computers & Operations Research | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A New Optimal Stepsize for Approximate Dynamic Programming

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Automatic Control