Asynchronous Advantage Actor-Critic Algorithms Based on Residual Networks

Lili Tang

doi:10.1088/1742-6596/1302/2/022061

Abstract

Deep reinforcement learning is one of the fastest-growing technologies in machine learning. The Asynchronous Advantage Actor-Critic algorithm completely uses the actor-critic framework and utilizes the idea of asynchronous training, which greatly speeds up the training and improves performance. Although A3C algorithm puts actor-critic into multiple threads to train synchronously, effectively utilizes computer resources and improves training effectiveness, it is still difficult to train in deep neural network. Deep networks have proved to be capable of extending to thousands of layers and still have improved performance. However, every one percent increase in accuracy almost doubles the cost of layers, so it is not easy for A3C to train both actor and critic networks. In response to this problem, we innovatively utilize the residual network to apply to the asynchronous advantage actor-critic algorithm and has achieved improvement greatly in the inverted pendulum problem.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Physics: Conference Series	Publication Date: Aug 1, 2019
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Asynchronous Advantage Actor-Critic Algorithms Based on Residual Networks

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series

Lead the way for us

Similar Papers

A deep actor critic reinforcement learning framework for learning to rank
Vaibhav Padhye ... Kailasam Lakshmanan
Neurocomputing | VOL. 547
Vaibhav Padhye, et. al.Vaibhav Padhye ... Kailasam Lakshmanan
18 May 2023
Neurocomputing | VOL. 547

Transparency and Explanation in Deep Reinforcement Learning Neural Networks
Rahul Iyer ... Yuezhang Li
-
Rahul Iyer, et. al.Rahul Iyer ... Yuezhang Li
27 Dec 2018
27 Dec 2018

Break through the limits of learning by machines
Zhongzhi Shi
Chinese Science Bulletin | VOL. 61
Zhongzhi ShiZhongzhi Shi
20 Sep 2016
Chinese Science Bulletin | VOL. 61

On random matrices arising in deep neural networks: General I.I.D. case
Leonid Pastur ... Victor Slavin
Random Matrices: Theory and Applications | VOL. 12
Leonid Pastur, et. al.Leonid Pastur ... Victor Slavin
14 Jul 2022
Random Matrices: Theory and Applications | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Asynchronous Advantage Actor-Critic Algorithms Based on Residual Networks

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series