Layered Learning in Multiagent Systems

Peter Stone

doi:10.7551/mitpress/4151.001.0001

Abstract

Multi-agent systems in complex, real-time domains require agents to act effectively both autonomously and as part of a team. This dissertation addresses multi-agent systems consisting of teams of autonomous agents acting in real-time, noisy, collaborative, and adversarial environments. Because of the inherent complexity of this type of multi-agent system, this thesis investigates the use of machine learning within multi-agent systems. The dissertation makes four main contributions to the fields of Machine Learning and Multi-Agent Systems. First, the thesis defines a team member agent architecture within which a flexible team structure is presented, allowing agents to decompose the task space into flexible roles and allowing them to smoothly switch roles while acting. Team organization is achieved by the introduction of a locker-room agreement as a collection of conventions followed by all team members. It defines agent roles, team formations, and pre-compiled multi-agent plans. In addition, the team member agent architecture includes a communication paradigm for domains with single-channel, low-bandwidth, unreliable communication. The communication paradigm facilitates team coordination while being robust to lost messages and active interference from opponents. Second, the thesis introduces layered learning, a general-purpose machine learning paradigm for complex domains in which learning a mapping directly from agents' sensors to their actuators is intractable. Given a hierarchical task decomposition, layered learning allows for learning at each level of the hierarchy, with learning at each level directly affecting learning at the next higher level. Third, the thesis introduces a new multi-agent reinforcement learning algorithm, namely team-partitioned, opaque-transition reinforcement learning (TPOT-RL). TPOT-RL is designed for domains in which agents cannot necessarily observe the state changes when other team members act. It exploits local, action-dependent features to aggressively generalize its input representation for learning and partitions the task among the agents, allowing them to simultaneously learn collaborative policies by observing the long-term effects of their actions. Fourth, the thesis contributes a fully functioning multi-agent system that incorporates learning in a real-time, noisy domain with teammates and adversaries. Detailed algorithmic descriptions of the agents' behaviors as well as their source code are included in the thesis. Empirical results validate all four contributions within the simulated robotic soccer domain. The generality of the contributions is verified by applying them to the real robotic soccer, and network routing domains. Ultimately, this dissertation demonstrates that by learning portions of their cognitive processes, selectively communicating, and coordinating their behaviors via common knowledge, a group of independent agents can work towards a common goal in a complex, real-time, noisy, collaborative, and adversarial environment.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Layered Learning in Multiagent Systems

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Lyapunov-Based Reinforcement Learning for Decentralized Multi-agent Control
Qingrui Zhang ... Wei Pan
-
Qingrui Zhang, et. al.Qingrui Zhang ... Wei Pan
01 Jan 2020
01 Jan 2020

Cloud Computing for Synergized Emotional Model Evolution in Multi-Agent Learning Systems
...
Strojniski Vestnik-journal of Mechanical Engineering | VOL. 56
, et. al. ...
15 Nov 2010
Strojniski Vestnik-journal of Mechanical Engineering | VOL. 56

Multi-Agent Learning with Policy Prediction
Chongjie Zhang ... Victor Lesser
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 24
Chongjie Zhang, et. al.Chongjie Zhang ... Victor Lesser
04 Jul 2010
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 24

Adaptation and learning in multi-agent systems: Some remarks and a bibliography
Gerhard Weiß
-
Gerhard WeißGerhard Weiß
01 Jan 1996
01 Jan 1996

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Layered Learning in Multiagent Systems

Abstract

Talk to us

Similar Papers