Twin-Delayed DDPG

Stephen Dankwa,Wenfeng Zheng

doi:10.1145/3387168.3387199

Abstract

In this current research, Twin-Delayed DDPG (TD3) algorithm has been used to solve the most challenging virtual Artificial Intelligence application by training a 4-ant-legged robot as an Intelligent Agent to run across a field. Twin-Delayed DDPG (TD3) is an incredibly smart AI model of a Deep Reinforcement Learning which combines the state-of-the-art methods in Artificial Intelligence. These includes Policy gradient, Actor-Critics, and continuous Double Deep Q-Learning. These Deep Reinforcement Learning approaches trained an Intelligent agent to interact with an environment with automatic feature engineering, that is, necessitating minimal domain knowledge. For the implementation of the TD3, we used a two-layer feedforward neural network of 400 and 300 hidden nodes respectively, with Rectified Linear Units (ReLU) as an activation function between each layer for both the Actor and Critics. We, then added a final tanh unit after the output of the Actor. The Critic receives both the state and action as input to the first layer. Both the network parameters were updated using Adam optimizer. The idea behind the Twin-Delayed DDPG (TD3) is to reduce overestimation bias in Deep Q-Learning with discrete actions which are ineffective in an Actor-Critic domain setting. Based on the Maximum Average Reward over the evaluation time-step, our model achieved an approximate maximum of 2364. Therefore, we can truly say that, TD3 has obviously improved on both the learning speed and performance of the Deep Deterministic Policy Gradient (DDPG) in a challenging environment in a continuous control domain.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Twin-Delayed DDPG

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Modeling a Continuous Locomotion Behavior of an Intelligent Agent Using Deep Reinforcement Technique
Stephen Dankwa ... Wenfeng Zheng
-
Stephen Dankwa, et. al.Stephen Dankwa ... Wenfeng Zheng
01 Aug 2019
01 Aug 2019

Morphing control of a new bionic morphing UAV with deep reinforcement learning
Dan Xu ... Gang Chen
Aerospace Science and Technology | VOL. 92
Dan Xu, et. al.Dan Xu ... Gang Chen
28 May 2019
Aerospace Science and Technology | VOL. 92

Mixed Deep Reinforcement Learning Considering Discrete-continuous Hybrid Action Space for Smart Home Energy Management
Chao Huang ... Xiong Luo
Journal of Modern Power Systems and Clean Energy | VOL. 10
Chao Huang, et. al.Chao Huang ... Xiong Luo
01 Jan 2021
Journal of Modern Power Systems and Clean Energy | VOL. 10

Deep Reinforcement Learning for Automatic Drilling Optimization Using an Integrated Reward Function
Xu Huang ... Ted Furlong
-
Xu Huang, et. al.Xu Huang ... Ted Furlong
27 Feb 2024
27 Feb 2024

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Twin-Delayed DDPG

Abstract

Talk to us

Similar Papers