Mixed Density Methods for Approximate Dynamic Programming

Max L Greene,Warren E Dixon,Patryk Deptula,Rushikesh Kamalapurkar

doi:10.1007/978-3-030-60990-0_5

Abstract

This chapter discusses mixed density reinforcement learning (RL)-based approximate optimal control methods applied to deterministic systems. Such methods typically require a persistence of excitation (PE) condition for convergence. In this chapter, data-based methods will be discussed to soften the stringent PE condition by learning via simulation-based extrapolation. The development is based on the observation that, given a model of the system, RL can be implemented by evaluating the Bellman error (BE) at any number of desired points in the state space, thus virtually simulating the system. The sections will discuss necessary and sufficient conditions for optimality, regional model-based RL, local (StaF) RL, combining regional and local model-based RL, and RL with sparse BE extrapolation. Notes on stability follow within each method’s respective section.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mixed Density Methods for Approximate Dynamic Programming

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Chapter Eight - Model-Based Reinforcement Learning for Approximate Optimal Regulation
R Kamalapurkar ... W.E Dixon
Control of Complex Systems | VOL. -
R Kamalapurkar, et. al.R Kamalapurkar ... W.E Dixon
01 Jan 2015
Control of Complex Systems | VOL. -

Model-based reinforcement learning for approximate optimal regulation
Rushikesh Kamalapurkar ... Warren E Dixon
Automatica | VOL. 64
Rushikesh Kamalapurkar, et. al.Rushikesh Kamalapurkar ... Warren E Dixon
07 Dec 2015
Automatica | VOL. 64

Towards Generalization and Efficiency in Reinforcement Learning

-

02 Jul 2019
02 Jul 2019

Safe Model-Based Reinforcement Learning for Systems With Parametric Uncertainties.
S M Nahid Mahmud ... Rushikesh Kamalapurkar
Frontiers in Robotics and AI | VOL. 8
S M Nahid Mahmud, et. al.S M Nahid Mahmud ... Rushikesh Kamalapurkar
16 Dec 2021
Frontiers in Robotics and AI | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mixed Density Methods for Approximate Dynamic Programming

Abstract

Talk to us

Similar Papers