GLDAP: Global Dynamic Action Persistence Adaptation for Deep Reinforcement Learning

Junbo Tong,Yi Liu,Wenhui Fan,Daming Shi

doi:10.1145/3590154

Abstract

In the implementation of deep reinforcement learning (DRL), action persistence strategies are often adopted so agents maintain their actions for a fixed or variable number of steps. The choice of the persistent duration for agent actions usually has notable effects on the performance of reinforcement learning algorithms. Aiming at the research gap of global dynamic optimal action persistence and its application in multi-agent systems, we propose a novel framework: global dynamic action persistence (GLDAP), which achieves global action persistence adaptation for deep reinforcement learning. We introduce a closed-loop method that is used to learn the estimated value and the corresponding policy of each candidate action persistence. Our experiment shows that GLDAP achieves an average of 2.5%~90.7% performance improvement and 3~20 times higher sampling efficiency over several baselines across various single-agent and multi-agent domains. We also validate the ability of GLDAP to determine the optimal action persistence through multiple experiments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

GLDAP: Global Dynamic Action Persistence Adaptation for Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Autonomous and Adaptive Systems

Lead the way for us

Journal: ACM Transactions on Autonomous and Adaptive Systems	Publication Date: May 28, 2023
Citations: 1

Similar Papers

Sample effficient deep reinforcement learning for control

-

15 Dec 2019
15 Dec 2019

Deep Reinforcement Learning: A New Frontier in Computer Vision Research
Sejuti Rahman ... A K M Nadimul Haque
-
Sejuti Rahman, et. al.Sejuti Rahman ... A K M Nadimul Haque
01 Jan 2020
01 Jan 2020

Deep reinforcement learning and its applications in medical imaging and radiation therapy: a survey
Lanyu Xu ... Ning Wen
Physics in Medicine & Biology | VOL. 67
Lanyu Xu, et. al.Lanyu Xu ... Ning Wen
11 Nov 2022
Physics in Medicine & Biology | VOL. 67

Break through the limits of learning by machines
Zhongzhi Shi
Chinese Science Bulletin | VOL. 61
Zhongzhi ShiZhongzhi Shi
20 Sep 2016
Chinese Science Bulletin | VOL. 61

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

GLDAP: Global Dynamic Action Persistence Adaptation for Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Autonomous and Adaptive Systems