Multi-agent Domains Research Articles

Abstract The continuous interest in Artificial Intelligence (AI) has brought, among other things, the development of several scenarios where multiple artificial entities interact with each other. As for all the other autonomous settings, these multi-agent systems require orchestration. This is, generally, achieved through techniques derived from the vast field of Automated Planning. Notably, arbitration in multi-agent domains is not only tasked with regulating how the agents act, but must also consider the interactions between the agents’ information flows and must, therefore, reason on an epistemic level. This brings a substantial overhead that often diminishes the reasoning process’s usability in real-world situations. To address this problem, we present ECHO, a hierarchical framework that embeds classical and multi-agent epistemic (epistemic, for brevity) planners in a single architecture. The idea is to combine (i) classical; and(ii) epistemic solvers to model efficiently the agents’ interactions with the (i) ‘physical world’; and(ii) information flows, respectively. In particular, the presented architecture starts by planning on the ‘epistemic level’, with a high level of abstraction, focusing only on the information flows. Then it refines the planning process, due to the classical planner, to fully characterize the interactions with the ‘physical’ world. To further optimize the solving process, we introduced the concept of macros in epistemic planning and enriched the ‘classical’ part of the domain with goal-networks. Finally, we evaluated our approach in an actual robotic environment showing that our architecture indeed reduces the overall computational time.

Read full abstract

In multi-agent domains, dealing with non-stationary opponents that change behaviors (policies) consistently over time is still a challenging problem, where an agent usually requires the ability to detect the opponent’s policy accurately and adopt the optimal response policy accordingly. Previous works commonly assume that the opponent’s observations and actions during online interactions are known, which can significantly limit their applications, especially in partially observable environments. This paper focuses on efficient policy detecting and reusing techniques against non-stationary opponents without their local information. We propose an algorithm called Bayesian policy reuse with LocAl oBservations (Bayes-Lab) by incorporating variational autoencoders (VAE) into the Bayesian policy reuse (BPR) framework. Following the centralized training with decentralized execution (CTDE) paradigm, we train VAE as an opponent model during the offline phase to extract the latent relationship between the agent’s local observations and the opponent’s local observations. During online execution, the trained opponent models are used to reconstruct the opponent’s local observations, which can be combined with episodic rewards to update the belief about the opponent’s policy. Finally, the agent reuses the best response policy based on the updated belief to improve online performance. We demonstrate that Bayes-Lab outperforms existing state-of-the-art methods in terms of detection accuracy, accumulative rewards, and episodic rewards in a predator–prey scenario. In this experimental environment, Bayes-Lab can achieve about 80% detection accuracy and the highest accumulative rewards, and its performance is less affected by the opponent policy switching interval. When the switching interval is less than 10, its detection accuracy is at least 10% higher than other algorithms.

Read full abstract

Multi-agent Domains Research Articles

Related Topics

Articles published on Multi-agent Domains

A sequential multi-agent reinforcement learning framework for different action spaces

Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain.

Leveraging Partial Symmetry for Multi-Agent Reinforcement Learning

ECHO: A hierarchical combination of classical and multi-agent epistemic planning problems

Planning in Multi-Agent Domains with Untruthful Announcements

GLDAP: Global Dynamic Action Persistence Adaptation for Deep Reinforcement Learning

Efficiently Detecting Non-Stationary Opponents: A Bayesian Policy Reuse Approach under Partial Observability

Multi-Agent Tree Search with Dynamic Reward Shaping

Multi-Agent Advisor Q-Learning

A Survey of Multi-Agent Cross Domain Cooperative Perception

Quantifying the effects of environment and population diversity in multi-agent reinforcement learning

Social Momentum: Design and Evaluation of a Framework for Socially Competent Robot Navigation

Applications of Multi-Agent Deep Reinforcement Learning: Models and Algorithms

An action language for multi-agent domains

KnowRU: Knowledge Reuse via Knowledge Distillation in Multi-Agent Reinforcement Learning.

Autonomous Bus Fleet Control Using Multiagent Reinforcement Learning

Assistant Agents for Sequential Planning Problems

Resilient Multi-Agent Reinforcement Learning with Adversarial Value Decomposition

Multi-agent deep reinforcement learning: a survey

Efficient Multiagent Policy Optimization Based on Weighted Estimators in Stochastic Cooperative Environments

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multi-agent Domains Research Articles

Related Topics

Articles published on Multi-agent Domains

A sequential multi-agent reinforcement learning framework for different action spaces

Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain.

Leveraging Partial Symmetry for Multi-Agent Reinforcement Learning

ECHO: A hierarchical combination of classical and multi-agent epistemic planning problems

Planning in Multi-Agent Domains with Untruthful Announcements

GLDAP: Global Dynamic Action Persistence Adaptation for Deep Reinforcement Learning

Efficiently Detecting Non-Stationary Opponents: A Bayesian Policy Reuse Approach under Partial Observability

Multi-Agent Tree Search with Dynamic Reward Shaping

Multi-Agent Advisor Q-Learning

A Survey of Multi-Agent Cross Domain Cooperative Perception

Quantifying the effects of environment and population diversity in multi-agent reinforcement learning

Social Momentum: Design and Evaluation of a Framework for Socially Competent Robot Navigation

Applications of Multi-Agent Deep Reinforcement Learning: Models and Algorithms

An action language for multi-agent domains

KnowRU: Knowledge Reuse via Knowledge Distillation in Multi-Agent Reinforcement Learning.

Autonomous Bus Fleet Control Using Multiagent Reinforcement Learning

Assistant Agents for Sequential Planning Problems

Resilient Multi-Agent Reinforcement Learning with Adversarial Value Decomposition

Multi-agent deep reinforcement learning: a survey

Efficient Multiagent Policy Optimization Based on Weighted Estimators in Stochastic Cooperative Environments