Robust Adversarial Reinforcement Learning with Dissipation Inequation Constraint

Peng Zhai,Shunli Wang,Dingkang Yang,Zhiyan Dong,Lihua Zhang,Jie Luo

doi:10.1609/aaai.v36i5.20481

Abstract

Robust adversarial reinforcement learning is an effective method to train agents to manage uncertain disturbance and modeling errors in real environments. However, for systems that are sensitive to disturbances or those that are difficult to stabilize, it is easier to learn a powerful adversary than establish a stable control policy. An improper strong adversary can destabilize the system, introduce biases in the sampling process, make the learning process unstable, and even reduce the robustness of the policy. In this study, we consider the problem of ensuring system stability during training in the adversarial reinforcement learning architecture. The dissipative principle of robust H-inﬁnity control is extended to the Markov Decision Process, and robust stability constraints are obtained based on L2 gain performance in the reinforcement learning system. Thus, we propose a dissipation-inequation-constraint-based adversarial reinforcement learning architecture. This architecture ensures the stability of the system during training by imposing constraints on the normal and adversarial agents. Theoretically, this architecture can be applied to a large family of deep reinforcement learning algorithms. Results of experiments in MuJoCo and GymFc environments show that our architecture effectively improves the robustness of the controller against environmental changes and adapts to more powerful adversaries. Results of the flight experiments on a real quadcopter indicate that our method can directly deploy the policy trained in the simulation environment to the real environment, and our controller outperforms the PID controller based on hardware-in-the-loop. Both our theoretical and empirical results provide new and critical outlooks on the adversarial reinforcement learning architecture from a rigorous robust control perspective.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust Adversarial Reinforcement Learning with Dissipation Inequation Constraint

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 28, 2022
Citations: 9

Similar Papers

Path-based multi-hop reasoning over knowledge graph for answering questions via adversarial reinforcement learning
Hai Cui ... Lu Liu
Knowledge-Based Systems | VOL. 276
Hai Cui, et. al.Hai Cui ... Lu Liu
30 Jun 2023
Knowledge-Based Systems | VOL. 276

Improving Speech Separation with Adversarial Network and Reinforcement Learning
Guangcan Liu ... Xiuyi Chen
-
Guangcan Liu, et. al.Guangcan Liu ... Xiuyi Chen
01 Jul 2018
01 Jul 2018

Robust Adaptive Ensemble Adversary Reinforcement Learning
Peng Zhai ... Xiaopeng Ji
IEEE Robotics and Automation Letters | VOL. 7
Peng Zhai, et. al.Peng Zhai ... Xiaopeng Ji
01 Oct 2022
IEEE Robotics and Automation Letters | VOL. 7

Semi-Supervised Intrusion Detection System for In-Vehicle Networks Based on Variational Autoencoder and Adversarial Reinforcement Learning
Trieu-Phong Nguyen ... Daehee Kim
Knowledge-Based Systems | VOL. 304
Trieu-Phong Nguyen, et. al.Trieu-Phong Nguyen ... Daehee Kim
01 Nov 2024
Knowledge-Based Systems | VOL. 304

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust Adversarial Reinforcement Learning with Dissipation Inequation Constraint

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence