Multiagent cooperation and competition with deep reinforcement learning

Ardi Tampuu,Dorian Kodelja,Jaan Aru,Juhan Aru,Ilya Kuzovkin,Kristjan Korjus,Raul Vicente,Tambet Matiisen

doi:10.1371/journal.pone.0172395

Abstract

Evolution of cooperation and competition can appear when multiple adaptive agents share a biological, social, or technological niche. In the present work we study how cooperation and competition emerge between autonomous agents that learn by reinforcement while using only their raw visual input as the state representation. In particular, we extend the Deep Q-Learning framework to multiagent environments to investigate the interaction between two learning agents in the well-known video game Pong. By manipulating the classical rewarding scheme of Pong we show how competitive and collaborative behaviors emerge. We also describe the progression from competitive to collaborative behavior when the incentive to cooperate is increased. Finally we show how learning by playing against another adaptive agent, instead of against a hard-wired algorithm, results in more robust strategies. The present work shows that Deep Q-Networks can become a useful tool for studying decentralized learning of multiagent systems coping with high-dimensional environments.

Highlights

In the ever-changing world biological and engineered agents need to cope with unpredictability
Instead of a single agent playing against a hardcoded algorithm, we explore how multiple agents controlled by autonomous Deep Q-Network (DQN) learn to cooperate and compete while sharing a high-dimensional environment and being fed only raw visual input
In the present work we demonstrated that agents controlled by autonomous Deep Q-Networks (DQNs) are able to learn a two player video game such as Pong from raw sensory data

Summary

Introduction

In the ever-changing world biological and engineered agents need to cope with unpredictability. By learning from trial-and-error an animal, or a robot, can adapt its behavior in a novel or changing environment. This is the main intuition behind reinforcement learning [1, 2]. A reinforcement learning agent modifies its behavior based on the rewards it collects while interacting with the environment. Collective animal behavior [4] and distributed control systems are important examples of multiple autonomous actors in dynamic environments. Phenomena such as cooperation, communication, and competition may emerge in reinforced multiagent systems

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS ONE	Publication Date: Apr 5, 2017
Citations: 610	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Multiagent cooperation and competition with deep reinforcement learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE

Lead the way for us

Similar Papers

Deep reinforcement learning for automated radiation adaptation in lung cancer.
Huan‐Hsin Tseng ... Randall K Ten Haken
Medical Physics | VOL. 44
Huan‐Hsin Tseng, et. al.Huan‐Hsin Tseng ... Randall K Ten Haken
14 Nov 2017
Medical Physics | VOL. 44

EETS: An energy-efficient task scheduler in cloud computing based on improved DQN algorithm
Huanhuan Hou ... Azlan Ismail
Journal of King Saud University - Computer and Information Sciences | VOL. 36
Huanhuan Hou, et. al.Huanhuan Hou ... Azlan Ismail
31 Aug 2024
Journal of King Saud University - Computer and Information Sciences | VOL. 36

Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments
Yan Zheng ... Zongzhang Zhang
-
Yan Zheng, et. al.Yan Zheng ... Zongzhang Zhang
01 Jan 2018
01 Jan 2018

DDPG Agent to Swing Up and Balance Cart- Pole System
Buvanesh Pandian V
International Journal of Advanced Research in Science, Communication and Technology | VOL. -
Buvanesh Pandian VBuvanesh Pandian V
09 Apr 2021
International Journal of Advanced Research in Science, Communication and Technology | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multiagent cooperation and competition with deep reinforcement learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE