PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning

Jizhou Wu,Tianpei Yang,Jianye Hao,Xiaotian Hao,Weixun Wang,Matthew E Taylor,Yan Zheng

doi:10.1609/aaai.v38i14.29524

Abstract

Despite many breakthroughs in recent years, it is still hard for MultiAgent Reinforcement Learning (MARL) algorithms to directly solve complex tasks in MultiAgent Systems (MASs) from scratch. In this work, we study how to use Automatic Curriculum Learning (ACL) to reduce the number of environmental interactions required to learn a good policy. In order to solve a difficult task, ACL methods automatically select a sequence of tasks (i.e., curricula). The idea is to obtain maximum learning progress towards the final task by continuously learning on tasks that match the current capabilities of the learners. The key question is how to measure the learning progress of the learner for better curriculum selection. We propose a novel ACL framework, PrOgRessive mulTiagent Automatic curricuLum (PORTAL), for MASs. PORTAL selects curricula according to two critera: 1) How difficult is a task, relative to the learners’ current abilities? 2) How similar is a task, relative to the final task? By learning a shared feature space between tasks, PORTAL is able to characterize different tasks based on the distribution of features and select those that are similar to the final task. Also, the shared feature space can effectively facilitate the policy transfer between curricula. Experimental results show that PORTAL can train agents to master extremely hard cooperative tasks, which can not be achieved with previous state-of-the-art MARL algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

A Multiagent Reinforcement Learning Algorithm with Non-linear Dynamics
S Abdallah ... V Lesser
Journal of Artificial Intelligence Research | VOL. 33
S Abdallah, et. al.S Abdallah ... V Lesser
17 Dec 2008
Journal of Artificial Intelligence Research | VOL. 33

Lyapunov-Based Reinforcement Learning for Decentralized Multi-agent Control
Qingrui Zhang ... Wei Pan
-
Qingrui Zhang, et. al.Qingrui Zhang ... Wei Pan
01 Jan 2020
01 Jan 2020

A Multiagent Fuzzy Policy Reinforcement Learning Algorithm with Application to Leader-Follower Robotic Systems
Erfu Yang ... Dongbing Gu
-
Erfu Yang, et. al.Erfu Yang ... Dongbing Gu
01 Oct 2006
01 Oct 2006

Rules-PPO-QMIX: Multi-Agent Reinforcement Learning with Mixed Rules for Large Scene Tasks
Zi-Zhen Shen ... Rui Yu
-
Zi-Zhen Shen, et. al.Zi-Zhen Shen ... Rui Yu
22 Oct 2021
22 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence