Robust experience replay sampling for multi-agent reinforcement learning

Isack Thomas Nicholaus,Dae-Ki Kang

doi:10.1016/j.patrec.2021.11.006

Abstract

• Propose new algorithms for acquiring suitable experiences from buffer through filtering. • Strengthen exploration strategy by reducing repetitive decisions at a given state. • Improve performance which is higher than or comparable to the baseline algorithms. • Achieve early convergence and improved policy searching compared to the baselines. Learning from the relevant experiences leads to fast convergence if the experiences provide useful information. We present the new and simple yet efficient technique to find suitable samples of experiences to train the agents in a given state of an environment. We intended to increase the number of states visited and unique sequences that efficiently reduce the number of states the agents have to explore or exploit. Our technique implicitly introduces additional strength to the exploration-exploitation trade-off. It filters the samples of experiences that can benefit more than half the number of agents and then utilizes the experiences to extract useful information for decision making. First, we compute the similarities between the observed state and previous states in the experiences to achieve this filtering. Then, we filter the samples using the hyper-parameter, z , to decide which experiences will be suitable. We found out that agents learn quickly and efficiently since sampled experiences provide useful information that speeds up convergence. In every episode, most agents learn or contribute to improve the total expected future return. We further study our approaches’ generalization ability and present different settings to show significant improvements in diverse experiment environments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust experience replay sampling for multi-agent reinforcement learning

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters

Lead the way for us

Journal: Pattern Recognition Letters	Publication Date: Mar 1, 2022
Citations: 6

Similar Papers

Experience Selection in Multi-agent Deep Reinforcement Learning
Yishen Wang ... Zongzhang Zhang
-
Yishen Wang, et. al.Yishen Wang ... Zongzhang Zhang
01 Nov 2019
01 Nov 2019

A Dynamically Adaptive Approach to Reducing Strategic Interference for Multiagent Systems
Wei Pan ... Nanding Wang
IEEE Transactions on Cognitive and Developmental Systems | VOL. 14
Wei Pan, et. al.Wei Pan ... Nanding Wang
01 Dec 2022
IEEE Transactions on Cognitive and Developmental Systems | VOL. 14

Autonomous Motion Decision-making based on Deep Reinforcement Learning for Autonomous Driving
Jie Hu ... Tiankuo Liu
-
Jie Hu, et. al.Jie Hu ... Tiankuo Liu
28 Oct 2022
28 Oct 2022

A Novel Lightweight Deep Convolutional Neural Network Model for Human Emotions Recognition in Diverse Environments
Tehmina Kalsum ... Zahid Mehmood
Journal of Sensors | VOL. 2023
Tehmina Kalsum, et. al.Tehmina Kalsum ... Zahid Mehmood
01 Jan 2023
Journal of Sensors | VOL. 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust experience replay sampling for multi-agent reinforcement learning

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters