Do Deep Reinforcement Learning Agents Model Intentions?

Tambet Matiisen,Daniel Majoral,Aqeel Labash,Raul Vicente,Jaan Aru

doi:10.3390/stats6010004

Tambet Matiisen, Daniel Majoral + Show 3 more

Open Access

https://doi.org/10.3390/stats6010004

Copy DOI

Journal: Stats	Publication Date: Dec 28, 2022
Citations: 1	License type: CC BY 4.0

Affiliation: University of Tartu

Abstract

Inferring other agents’ mental states, such as their knowledge, beliefs and intentions, is thought to be essential for effective interactions with other agents. Recently, multi-agent systems trained via deep reinforcement learning have been shown to succeed in solving various tasks. Still, how each agent models or represents other agents in their environment remains unclear. In this work, we test whether deep reinforcement learning agents trained with the multi-agent deep deterministic policy gradient (MADDPG) algorithm explicitly represent other agents’ intentions (their specific aims or plans) during a task in which the agents have to coordinate the covering of different spots in a 2D environment. In particular, we tracked over time the performance of a linear decoder trained to predict the final targets of all agents from the hidden-layer activations of each agent’s neural network controller. We observed that the hidden layers of agents represented explicit information about other agents’ intentions, i.e., the target landmark the other agent ended up covering. We also performed a series of experiments in which some agents were replaced by others with fixed targets to test the levels of generalization of the trained agents. We noticed that during the training phase, the agents developed a preference for each landmark, which hindered generalization. To alleviate the above problem, we evaluated simple changes to the MADDPG training algorithm which lead to better generalization against unseen agents. Our method for confirming intention modeling in deep learning agents is simple to implement and can be used to improve the generalization of multi-agent systems in fields such as robotics, autonomous vehicles and smart cities.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Do Deep Reinforcement Learning Agents Model Intentions?

Abstract

Talk to us

Similar Papers

More From: Stats

Lead the way for us

Similar Papers

Machine Learning Agents Augmented by Digital Twinning for Smart Production Scheduling
Kosmas Alexopoulos ... Panagiotis Mavrothalassitis
IFAC PapersOnLine | VOL. 56
Kosmas Alexopoulos, et. al.Kosmas Alexopoulos ... Panagiotis Mavrothalassitis
01 Jan 2023
IFAC PapersOnLine | VOL. 56

Robust Multi-Agent Reinforcement Learning via Minimax Deep Deterministic Policy Gradient
Shihui Li ... Stuart Russell
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 33
Shihui Li, et. al.Shihui Li ... Stuart Russell
17 Jul 2019
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 33

A Confrontation Decision-Making Method with Deep Reinforcement Learning and Knowledge Transfer for Multi-Agent System
Chunyang Hu
Symmetry | VOL. 12
Chunyang HuChunyang Hu
16 Apr 2020
Symmetry | VOL. 12

A Collaborative Control Method of Dual-Arm Robots Based on Deep Reinforcement Learning
Luyu Liu ... Qianyuan Liu
Applied Sciences | VOL. 11
Luyu Liu, et. al.Luyu Liu ... Qianyuan Liu
18 Feb 2021
Applied Sciences | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Do Deep Reinforcement Learning Agents Model Intentions?

Abstract

Talk to us

Similar Papers

More From: Stats