Analysis of coordinated behavior structures with multi-agent deep reinforcement learning

Yuki Miyashita,Toshiharu Sugawara

doi:10.1007/s10489-020-01832-y

Yuki Miyashita, Toshiharu Sugawara

Open Access

https://doi.org/10.1007/s10489-020-01832-y

Copy DOI

Journal: Applied Intelligence	Publication Date: Sep 16, 2020
Citations: 8	License type: open-access

Affiliation: Waseda University, Shimizu (Japan)

Abstract

Cooperation and coordination are major issues in studies on multi-agent systems because the entire performance of such systems is greatly affected by these activities. The issues are challenging however, because appropriate coordinated behaviors depend on not only environmental characteristics but also other agents’ strategies. On the other hand, advances in multi-agent deep reinforcement learning (MADRL) have recently attracted attention, because MADRL can considerably improve the entire performance of multi-agent systems in certain domains. The characteristics of learned coordination structures and agent’s resulting behaviors, however, have not been clarified sufficiently. Therefore, we focus here on MADRL in which agents have their own deep Q-networks (DQNs), and we analyze their coordinated behaviors and structures for the pickup and floor laying problem, which is an abstraction of our target application. In particular, we analyze the behaviors around scarce resources and long narrow passages in which conflicts such as collisions are likely to occur. We then indicated that different types of inputs to the networks exhibit similar performance but generate various coordination structures with associated behaviors, such as division of labor and a shared social norm, with no direct communication.

Highlights

Cooperation and coordination are important issues in the study of multi-agent systems, because they are essential to achieve the desired autonomous control to improve the overall efficiency in sophisticated cooperative tasks
Because we focus on multiagent deep reinforcement learning (MADRL) for a multi-agent pickup and floor laying problem, the agents individually learn the Q-values and associated policies by using their own deep Q-networks (DQNs) to identify cooperative and coordinated behaviors without direct communication
We evaluated the performance on the pickup and floor laying problem by agents with the proposed input information (RVs, historic relative view (HRV), and Local view (LV)) and various observable

Summary

Introduction

Cooperation and coordination are important issues in the study of multi-agent systems, because they are essential to achieve the desired autonomous control to improve the overall efficiency in sophisticated cooperative tasks It is difficult and complicated to identify appropriate coordination structures and behavioral rules or strategies, such as social norms and division of work areas. Our experimental results indicate that the agents in distributed MADRL could generate a number of coordinated structures for cooperative work, but the structures were quite different depending on the input information fed to the agents’ own DQNs. For example, when the agents had absolute locations in the environment, they formed divisional cooperation by spatial segmentation. We discuss the relationships between the agents’ coordination structures and performance in the pickup and floor laying problem

Related work

Problem formulation

Deep reinforcement learning with local observation

Experience replay

Types of views

Relative view

Historic relative view

Local view

Experimental setting

Performance comparison

Emergent coordination structures

Behavior in small spaces

Discussion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Analysis of coordinated behavior structures with multi-agent deep reinforcement learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Intelligence

Lead the way for us

Similar Papers

Coordination in Collaborative Work by Deep Reinforcement Learning with Various State Descriptions
Yuki Miyashita ... Toshiharu Sugawara
-
Yuki Miyashita, et. al.Yuki Miyashita ... Toshiharu Sugawara
01 Jan 2019
01 Jan 2019

Independent Learning Approaches: Overcoming Multi-Agent Learning Pathologies In Team-Games

-

06 Mar 2020
06 Mar 2020

Multi-agent Deep Reinforcement Learning based on Maximum Entropy
Zihao Wang ... Yanxin Zhang
-
Zihao Wang, et. al.Zihao Wang ... Yanxin Zhang
18 Jun 2021
18 Jun 2021

Deep Multi Agent Reinforcement Learning for Autonomous Driving
Sushrut Bhalla ... Mark Crowley
-
Sushrut Bhalla, et. al.Sushrut Bhalla ... Mark Crowley
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Analysis of coordinated behavior structures with multi-agent deep reinforcement learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Intelligence