Learning to Communicate and Act Using Hierarchical Reinforcement Learning

Mohammad Ghavamzadeh ,Sridhar Mahadevan

doi:10.1109/aamas.2004.160

Abstract

In this paper, we address the issue of rational communication behavior among autonomous agents. The goal is for agents to learn a policy to optimize the communication needed for proper coordination, given the communication cost. We extend our previously reported cooperative hierarchical reinforcement learning (HRL) algorithm to include communication decisions and propose a new multiagent HRL algorithm, called COM-Cooperative HRL. In this algorithm, we define cooperative subtasks to be those subtasks in which coordination among agents significantly improves the performance of the overall task. Those levels of the hierarchy which include cooperative subtasks are called cooperation levels. Coordination skills among agents are learned faster by sharing information at the cooperation levels, rather than the level of primitive actions. We add a communication level to the hierarchical decomposition of the problem below each cooperation level. Before making a decision at a cooperative subtask, agents decide if it is worthwhile to perform a communication action. A communication action has a certain cost and provides each agent at a certain cooperation level with the actions selected by the other agents at the same level. We demonstrate the efficacy of the COM-Cooperative HRL algorithm as well as the relation between the communication cost and the learned communication policy using a multiagent taxi domain.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning to Communicate and Act Using Hierarchical Reinforcement Learning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Hierarchical multi-agent reinforcement learning
Mohammad Ghavamzadeh ... Rajbala Makar
Autonomous Agents and Multi-Agent Systems | VOL. 13
Mohammad Ghavamzadeh, et. al.Mohammad Ghavamzadeh ... Rajbala Makar
04 Apr 2006
Autonomous Agents and Multi-Agent Systems | VOL. 13

Hierarchical Reinforcement Learning for UAV-PE Game With Alternative Delay Update Method.
Xiao Ma ... Lei Guo
IEEE transactions on neural networks and learning systems | VOL. PP
Xiao Ma, et. al.Xiao Ma ... Lei Guo
01 Jan 2024
IEEE transactions on neural networks and learning systems | VOL. PP

Layered direct policy search for learning hierarchical skills
Felix End ... Riad Akrour
-
Felix End, et. al.Felix End ... Riad Akrour
01 May 2017
01 May 2017

Extending Hierarchical Reinforcement Learning to Continuous-Time, Average-Reward, and Multi-Agent Models
Mohammad Ghavamzadeh ... Sridhar Mahadevan
-
Mohammad Ghavamzadeh, et. al.Mohammad Ghavamzadeh ... Sridhar Mahadevan
09 Jul 2003
09 Jul 2003

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning to Communicate and Act Using Hierarchical Reinforcement Learning

Abstract

Talk to us

Similar Papers