A deep deterministic policy gradient algorithm based on averaged state-action estimation

Jian Xu,Haifei Zhang,Jianlin Qiu

doi:10.1016/j.compeleceng.2022.108015

Abstract

Deep Reinforcement Learning (DRL), one of the most popular research topics in artificial intelligence, has achieved a breakthrough in continuous control tasks. Nonetheless, the DRL algorithm's instability and local optimality have a bad influence impact on its performance. The Deep Deterministic Policy Gradients (DDPG) algorithm uses a "soft" update to slow down the target value rate of change to alleviate this problem. However, there is still a specific target approximate error variance. The variance will aggravate the degree of the data dispersion and reduce the stability of the model. This paper proposed the DDPG with averaged state-action estimation (Averaged-DDPG) algorithm. It aims to minimize the adverse effects of conflict, which calculates the action reward by averaging the estimated values of previously learned Q values, thus reducing the training process's fluctuation and improving the algorithm's performance. The evaluation results in continuous control tasks show that Averaged-DDPG can enhance the agent's learning efficiency and training balance more effectively than the original DDPG algorithm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A deep deterministic policy gradient algorithm based on averaged state-action estimation

Abstract

Talk to us

Similar Papers

More From: Computers and Electrical Engineering

Lead the way for us

Journal: Computers and Electrical Engineering	Publication Date: May 11, 2022
Citations: 10

Similar Papers

A State-Compensated Deep Deterministic Policy Gradient Algorithm for UAV Trajectory Tracking
Jiying Wu ... Luwei Liao
Machines | VOL. 10
Jiying Wu, et. al.Jiying Wu ... Luwei Liao
21 Jun 2022
Machines | VOL. 10

UAV maneuvering decision -making algorithm based on Twin Delayed Deep Deterministic Policy Gradient Algorithm
Shuangxia Bai ... Evgeny Neretin
Journal of Artificial Intelligence and Technology | VOL. -
Shuangxia Bai, et. al.Shuangxia Bai ... Evgeny Neretin
07 Dec 2021
Journal of Artificial Intelligence and Technology | VOL. -

Deep Deterministic Policy Gradient Algorithm Based on Convolutional Block Attention for Autonomous Driving
Yanliang Jin ... Leiji Zhu
Symmetry | VOL. 13
Yanliang Jin, et. al.Yanliang Jin ... Leiji Zhu
12 Jun 2021
Symmetry | VOL. 13

Research and Application of Predictive Control Method Based on Deep Reinforcement Learning for HVAC Systems
Chenhui Fu ... Yunhua Zhang
IEEE Access | VOL. 9
Chenhui Fu, et. al.Chenhui Fu ... Yunhua Zhang
01 Jan 2020
IEEE Access | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A deep deterministic policy gradient algorithm based on averaged state-action estimation

Abstract

Talk to us

Similar Papers

More From: Computers and Electrical Engineering