Multi-Agent Collaborative Target Search Based on the Multi-Agent Deep Deterministic Policy Gradient with Emotional Intrinsic Motivation

Xiaoping Zhang,Yuanpeng Zheng,Fumiya Iida,Arsen Abdulali,Li Wang

doi:10.3390/app132111951

Abstract

Multi-agent collaborative target search is one of the main challenges in the multi-agent field, and deep reinforcement learning (DRL) is a good way to learn such a task. However, DRL always faces the problem of sparse reward, which to some extent reduces its efficiency in task learning. Introducing intrinsic motivation has proved to be a useful way to make the sparse reward in DRL. So, based on the multi-agent deep deterministic policy gradient (MADDPG) structure, a new MADDPG algorithm with the emotional intrinsic motivation name MADDPG-E is proposed in this paper for the multi-agent collaborative target search. In MADDPG-E, a new emotional intrinsic motivation module with three emotions, joy, sadness, and fear, is designed. The three emotions are defined by corresponding psychological knowledge to the multi-agent embodied situations in an environment. An emotional steady-state variable function H is then designed to help judge the goodness of the emotions. Based on H, an emotion-based intrinsic reward function is finally proposed. With the designed emotional intrinsic motivation module, the multi-agent system always tries to make itself joy, which means it always learns to search the target. To show the effectiveness of the proposed MADDPG-E algorithm, two kinds of simulation experiments with a determined initial position and random initial position, respectively, are carried out, and comparisons are performed with MADDPG as well as MADDPG-ICM (MADDPG with an intrinsic curiosity module). The results show that with the designed emotional intrinsic motivation module, MADDPG-E has a higher learning speed and better learning stability, and the advantage is more obvious when facing complex situations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-Agent Collaborative Target Search Based on the Multi-Agent Deep Deterministic Policy Gradient with Emotional Intrinsic Motivation

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Journal: Applied Sciences	Publication Date: Nov 1, 2023
License type: CC BY 4.0

Similar Papers

Computation Offloading and Resource Allocation Based on Multi-agent Federated Learning
Yiming Yao ... Zheyuan Hu
-
Yiming Yao, et. al.Yiming Yao ... Zheyuan Hu
01 Jan 2021
01 Jan 2021

Intrinsic Motivation for Deep Deterministic Policy Gradient in Multi-Agent Environments
Xiaoge Cao ... Tao Lu
-
Xiaoge Cao, et. al.Xiaoge Cao ... Tao Lu
06 Nov 2020
06 Nov 2020

A Friend-or-Foe framework for multi-agent reinforcement learning policy generation in mixing cooperative–competitive scenarios
Yu Sun ... Jun Lai
Transactions of the Institute of Measurement and Control | VOL. 44
Yu Sun, et. al.Yu Sun ... Jun Lai
29 Mar 2022
Transactions of the Institute of Measurement and Control | VOL. 44

Cooperative multi-agent target searching: a deep reinforcement learning approach based on parallel hindsight experience replay
Yi Zhou ... Xiaozhi Gao
Complex & Intelligent Systems | VOL. 9
Yi Zhou, et. al.Yi Zhou ... Xiaozhi Gao
21 Feb 2023
Complex & Intelligent Systems | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Agent Collaborative Target Search Based on the Multi-Agent Deep Deterministic Policy Gradient with Emotional Intrinsic Motivation

Abstract

Talk to us

Similar Papers

More From: Applied Sciences