Adversarial Inverse Reinforcement Learning With Self-Attention Dynamics Model

Jiankai Sun,Lantao Yu,Pinqian Dong,Bolei Zhou,Bo Lu

doi:10.1109/lra.2021.3061397

Abstract

In many real-world applications where specifying a proper reward function is difficult, it is desirable to learn policies from expert demonstrations. Adversarial Inverse Reinforcement Learning (AIRL) is one of the most common approaches for learning from demonstrations. However, due to the stochastic policy, current computation graph of AIRL is no longer end-to-end differentiable like Generative Adversarial Networks (GANs), resulting in the need for high-variance gradient estimation methods and large sample size. In this work, we propose the Model-based Adversarial Inverse Reinforcement Learning (MAIRL), an end-to-end model-based policy optimization method with self-attention. By adopting the self-attention dynamics model to make the computation graph end-to-end differentiable, MAIRL has the low variance for policy optimization. We evaluate our approach thoroughly on various control tasks. The experimental results show that our approach not only learns near-optimal rewards and policies that match expert behavior but also outperforms previous inverse reinforcement learning algorithms in real robot experiments. Code is available at https://decisionforce.github.io/MAIRL/.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adversarial Inverse Reinforcement Learning With Self-Attention Dynamics Model

Abstract

Talk to us

Similar Papers

More From: IEEE Robotics and Automation Letters

Lead the way for us

Journal: IEEE Robotics and Automation Letters	Publication Date: Apr 1, 2021
Citations: 16

Similar Papers

Adversarial Confidence Learning for Medical Image Segmentation and Synthesis.
Dong Nie ... Dinggang Shen
International journal of computer vision | VOL. 128
Dong Nie, et. al.Dong Nie ... Dinggang Shen
21 Mar 2020
International journal of computer vision | VOL. 128

Exploring adversarial deep learning for fusion in multi-color channel skin detection applications
Mohammed Chyad ... Vladimir Simic
Information Fusion | VOL. 114
Mohammed Chyad, et. al.Mohammed Chyad ... Vladimir Simic
14 Aug 2024
Information Fusion | VOL. 114

Generative attention adversarial classification network for unsupervised domain adaptation
Wendong Chen ... Haifeng Hu
Pattern Recognition | VOL. 107
Wendong Chen, et. al.Wendong Chen ... Haifeng Hu
05 Jun 2020
Pattern Recognition | VOL. 107

An Analysis on Adversarial Machine Learning: Methods and Applications
Ali Dabouei
-
Ali DaboueiAli Dabouei
24 May 2022
24 May 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adversarial Inverse Reinforcement Learning With Self-Attention Dynamics Model

Abstract

Talk to us

Similar Papers

More From: IEEE Robotics and Automation Letters