Enhancing Off-Policy Constrained Reinforcement Learning through Adaptive Ensemble C Estimation

Hengrui Zhang,Kai Lv,Shuo Shen,Youfang Lin,Sheng Han

doi:10.1609/aaai.v38i19.30177

Abstract

In the domain of real-world agents, the application of Reinforcement Learning (RL) remains challenging due to the necessity for safety constraints. Previously, Constrained Reinforcement Learning (CRL) has predominantly focused on on-policy algorithms. Although these algorithms exhibit a degree of efficacy, their interactivity efficiency in real-world settings is sub-optimal, highlighting the demand for more efficient off-policy methods. However, off-policy CRL algorithms grapple with challenges in precise estimation of the C-function, particularly due to the fluctuations in the constrained Lagrange multiplier. Addressing this gap, our study focuses on the nuances of C-value estimation in off-policy CRL and introduces the Adaptive Ensemble C-learning (AEC) approach to reduce these inaccuracies. Building on state-of-the-art off-policy algorithms, we propose AEC-based CRL algorithms designed for enhanced task optimization. Extensive experiments on nine constrained robotics tasks reveal the superior interaction efficiency and performance of our algorithms in comparison to preceding methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhancing Off-Policy Constrained Reinforcement Learning through Adaptive Ensemble C Estimation

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

Exoatmospheric Evasion Guidance Law with Total Energy Limit via Constrained Reinforcement Learning
Mengda Yan ... Rennong Yang
International Journal of Aeronautical and Space Sciences | VOL. 25
Mengda Yan, et. al.Mengda Yan ... Rennong Yang
15 Apr 2024
International Journal of Aeronautical and Space Sciences | VOL. 25

Deep Reinforcement Learning for Vision-Based Navigation of UAVs in Avoiding Stationary and Mobile Obstacles
Amudhini P Kalidas ... Senthilkumar Mohan
Drones | VOL. 7
Amudhini P Kalidas, et. al.Amudhini P Kalidas ... Senthilkumar Mohan
01 Apr 2023
Drones | VOL. 7

Optimizing Digital Coupon Assignment Using Constrained Reinforcement Learning
Xinlin Yao ... Xianghua Lu
-
Xinlin Yao, et. al.Xinlin Yao ... Xianghua Lu
25 Jan 2019
25 Jan 2019

Opportunities for Reinforcement Learning in Industrial Automation
Quan Xin ... Wenqi Fang
-
Quan Xin, et. al.Quan Xin ... Wenqi Fang
29 Oct 2021
29 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhancing Off-Policy Constrained Reinforcement Learning through Adaptive Ensemble C Estimation

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence