Introducing Symmetries to Black Box Meta Reinforcement Learning

Louis Kirsch,Sebastian Flennerhag,Yutian Chen,Abram Friesen,Hado Van Hasselt,Junhyuk Oh

doi:10.1609/aaai.v36i7.20681

Abstract

Meta reinforcement learning (RL) attempts to discover new RL algorithms automatically from environment interaction. In so-called black-box approaches, the policy and the learning algorithm are jointly represented by a single neural network. These methods are very flexible, but they tend to underperform compared to human-engineered RL algorithms in terms of generalisation to new, unseen environments. In this paper, we explore the role of symmetries in meta-generalisation. We show that a recent successful meta RL approach that meta-learns an objective for backpropagation-based learning exhibits certain symmetries (specifically the reuse of the learning rule, and invariance to input and output permutations) that are not present in typical black-box meta RL systems. We hypothesise that these symmetries can play an important role in meta-generalisation. Building off recent work in black-box supervised meta learning, we develop a black-box meta RL system that exhibits these same symmetries. We show through careful experimentation that incorporating these symmetries can lead to algorithms with a greater ability to generalise to unseen action & observation spaces, tasks, and environments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Introducing Symmetries to Black Box Meta Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 28, 2022
Citations: 4

Similar Papers

Dynamic Economic Optimization of a Continuously Stirred Tank Reactor Using Reinforcement Learning
Derek Machalek ... Titus Quah
-
Derek Machalek, et. al.Derek Machalek ... Titus Quah
01 Jul 2020
01 Jul 2020

Biped dynamic walking using reinforcement learning
Hamid Benbrahim ... Judy A Franklin
Robotics and Autonomous Systems | VOL. 22
Hamid Benbrahim, et. al.Hamid Benbrahim ... Judy A Franklin
01 Dec 1997
Robotics and Autonomous Systems | VOL. 22

Bi-DexHands: Towards Human-Level Bimanual Dexterous Manipulation.
Yuanpei Chen ... Hao Dong
IEEE transactions on pattern analysis and machine intelligence | VOL. 46
Yuanpei Chen, et. al.Yuanpei Chen ... Hao Dong
01 May 2024
IEEE transactions on pattern analysis and machine intelligence | VOL. 46

An Exploration Strategy for RL with Considerations of Budget and Risk
Jonathan Serrano Cuevas ... Eduardo Morales Manzanares
-
Jonathan Serrano Cuevas, et. al.Jonathan Serrano Cuevas ... Eduardo Morales Manzanares
01 Jan 2017
01 Jan 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Introducing Symmetries to Black Box Meta Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence