Prioritized Experience Replay in Multi-Actor-Attention-Critic for Reinforcement Learning

Sheng Fan,Bowei Yang,Guanghua Song,Xiaohong Jiang

doi:10.1088/1742-6596/1631/1/012040

Prioritized Experience Replay in Multi-Actor-Attention-Critic for Reinforcement Learning

Sheng Fan, Bowei Yang + Show 2 more

Open Access

https://doi.org/10.1088/1742-6596/1631/1/012040

Copy DOI

Journal: Journal of Physics: Conference Series	Publication Date: Sep 1, 2020
Citations: 2	License type: cc-by

Affiliation: Zhejiang University, Zhejiang University of Science and Technology

#Prioritized Experience Replay #Priority Metric + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Experience replay is a significant method of off-policy reinforcement learning (RL), which makes RL reuse the past experience and reduce the correlation between samples. Multi-Actor-Attention-Critic (MAAC) is a successful off-policy multi-agent reinforcement learning algorithm, due to its good scalability. To accelerate convergence, we use prioritized experience replay (PER) to optimize the experience selection in MAAC, and propose the PER-MAAC algorithm. In the PER-MAAC, the priority metric is based on the temporal-difference error during training. The algorithm is evaluated in the scenarios of Multi-UAV Cooperative Navigation and Rover-Tower. The experimental results show that PER-MAAC improves the speed of convergence.

Full Text