BAGAIL: Multi-modal imitation learning from imbalanced demonstrations

Sijia Gu,Fei Zhu

doi:10.1016/j.neunet.2024.106251

Abstract

Expert demonstrations in imitation learning often contain different behavioral modes, e.g., driving modes such as driving on the left, keeping the lane, and driving on the right in the driving tasks. Although most existing multi-modal imitation learning methods allow learning from demonstrations of multiple modes, they have strict constraints on the data of each mode, generally requiring a near data ratio of all modes. Otherwise, it tends to fall into a mode collapse or only learn the data distribution of the mode that has the largest data volume. To address the problem, an algorithm that balances real-fake loss and classification loss by modifying the output of the discriminator, referred to as BAlanced Generative Adversarial Imitation Learning (BAGAIL), is proposed. With this modification, the generator is only rewarded for generating real trajectories with correct modes. BAGAIL is therefore able to deal with imbalanced expert demonstrations and carry out efficient learning for each mode. The learning process of BAGAIL is divided into a pre-training stage and an imitation learning stage. During the pre-training stage, BAGAIL initializes the generator parameters by means of conditional Behavioral Cloning, laying the foundation for the direction of parameter optimization. During the imitation learning stage, BAGAIL optimizes the parameters by using the adversary between the generator and the modified discriminator so that the finally obtained policy can successfully learn the distribution of imbalanced expert data. The experiments showed that BAGAIL accurately distinguished different behavioral modes with imbalanced demonstrations. What is more, the learning result of each mode is close to the expert standard and more stable than other multi-modal imitation learning methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

BAGAIL: Multi-modal imitation learning from imbalanced demonstrations

Abstract

Talk to us

Similar Papers

More From: Neural Networks

Lead the way for us

Similar Papers

Multimodal Machine Learning for Stroke Prognosis and Diagnosis: A Systematic Review.
Saeed Shurrab ... Bartlomiej Piechowski-Jozwiak
IEEE journal of biomedical and health informatics | VOL. PP
Saeed Shurrab, et. al.Saeed Shurrab ... Bartlomiej Piechowski-Jozwiak
22 Aug 2024
IEEE journal of biomedical and health informatics | VOL. PP

Learning Multimodal Representations by Symmetrically Transferring Local Structures
Bin Dong ... Kai Lu
Symmetry | VOL. 12
Bin Dong, et. al.Bin Dong ... Kai Lu
13 Sep 2020
Symmetry | VOL. 12

Geometric Correspondence-Based Multimodal Learning for Ophthalmic Image Analysis.
Yan Wang ... Jennifer K Sun
IEEE Transactions on Medical Imaging | VOL. 43
Yan Wang, et. al.Yan Wang ... Jennifer K Sun
01 May 2024
IEEE Transactions on Medical Imaging | VOL. 43

MCL: A Contrastive Learning Method for Multimodal Data Fusion in Violence Detection
Liu Yang ... Zhenjie Wu
IEEE Signal Processing Letters | VOL. 30
Liu Yang, et. al.Liu Yang ... Zhenjie Wu
01 Jan 2023
IEEE Signal Processing Letters | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

BAGAIL: Multi-modal imitation learning from imbalanced demonstrations

Abstract

Talk to us

Similar Papers

More From: Neural Networks