Deep Q‐learning: A robust control approach

Balázs Varga,Morteza Haghir Chehreghani,Balázs Kulcsár

doi:10.1002/rnc.6457

Balázs Varga, Morteza Haghir Chehreghani + Show 1 more

Open Access

https://doi.org/10.1002/rnc.6457

Copy DOI

Abstract

AbstractThis work aims at constructing a bridge between robust control theory and reinforcement learning. Although, reinforcement learning has shown admirable results in complex control tasks, the agent's learning behavior is opaque. Meanwhile, system theory has several tools for analyzing and controlling dynamical systems. This article places deep Q‐learning is into a control‐oriented perspective to study its learning dynamics with well‐established techniques from robust control. An uncertain linear time‐invariant model is formulated by means of the neural tangent kernel to describe learning. This novel approach allows giving conditions for stability (convergence) of the learning and enables the analysis of the agent's behavior in frequency‐domain. The control‐oriented approach makes it possible to formulate robust controllers that inject dynamical rewards as control input in the loss function to achieve better convergence properties. Three output‐feedback controllers are synthesized: gain scheduling , dynamical , and fixed‐structure controllers. Compared to traditional deep Q‐learning techniques, which involve several heuristics, setting up the learning agent with a control‐oriented tuning methodology is more transparent and has well‐established literature. The proposed approach does not use a target network and randomized replay memory. The role of the target network is overtaken by the control input, which also exploits the temporal dependency of samples (opposed to a randomized memory buffer). Numerical simulations in different OpenAI Gym environments suggest that the controlled learning can converge faster and receive higher scores (depending on the environment) compared to the benchmark double deep Q‐learning.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Robust and Nonlinear Control	Publication Date: Oct 29, 2022
Citations: 7	License type: CC BY-NC 4.0

R Discovery Prime

R Discovery Prime

Deep Q‐learning: A robust control approach

Abstract

Talk to us

Similar Papers

More From: International Journal of Robust and Nonlinear Control

Lead the way for us

Similar Papers

Temporal-Spatial Causal Interpretations for Vision-Based Reinforcement Learning.
Wenjie Shi ... Cheng Wu
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44
Wenjie Shi, et. al.Wenjie Shi ... Cheng Wu
01 Dec 2022
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44

Switching dynamics of multi-agent learning
...
-
, et. al. ...
12 May 2008
12 May 2008

The driver and the engineer: Reinforcement learning and robust control
Natalie Bernat ... John Doyle
-
Natalie Bernat, et. al.Natalie Bernat ... John Doyle
01 Jul 2020
01 Jul 2020

A Robust Reinforcement Learning Control Design Method for Nonlinear System with Partially Unknown Structure
Kazuhiro Nakano ... Takashi Kuremoto
IEEJ Transactions on Electronics, Information and Systems | VOL. 130
Kazuhiro Nakano, et. al.Kazuhiro Nakano ... Takashi Kuremoto
01 Jan 2009
IEEJ Transactions on Electronics, Information and Systems | VOL. 130

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Q‐learning: A robust control approach

Abstract

Talk to us

Similar Papers

More From: International Journal of Robust and Nonlinear Control