Multimodal information bottleneck for deep reinforcement learning with multiple sensors

Bang You,Huaping Liu

doi:10.1016/j.neunet.2024.106347

Abstract

Reinforcement learning has achieved promising results on robotic control tasks but struggles to leverage information effectively from multiple sensory modalities that differ in many characteristics. Recent works construct auxiliary losses based on reconstruction or mutual information to extract joint representations from multiple sensory inputs to improve the sample efficiency and performance of reinforcement learning algorithms. However, the representations learned by these methods could capture information irrelevant to learning a policy and may degrade the performance. We argue that compressing information in the learned joint representations about raw multimodal observations is helpful, and propose a multimodal information bottleneck model to learn task-relevant joint representations from egocentric images and proprioception. Our model compresses and retains the predictive information in multimodal observations for learning a compressed joint representation, which fuses complementary information from visual and proprioceptive feedback and meanwhile filters out task-irrelevant information in raw multimodal observations. We propose to minimize the upper bound of our multimodal information bottleneck objective for computationally tractable optimization. Experimental evaluations on several challenging locomotion tasks with egocentric images and proprioception show that our method achieves better sample efficiency and zero-shot robustness to unseen white noise than leading baselines. We also empirically demonstrate that leveraging information from egocentric images and proprioception is more helpful for learning policies on locomotion tasks than solely using one single modality.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multimodal information bottleneck for deep reinforcement learning with multiple sensors

Abstract

Talk to us

Similar Papers

More From: Neural Networks

Lead the way for us

Similar Papers

On the Performance of Reinforcement Learning Algorithms for Dynamic Matching of Renewable Energy with Flexible Loads
Majid Majidi ... Masood Parvania
-
Majid Majidi, et. al.Majid Majidi ... Masood Parvania
06 Dec 2022
06 Dec 2022

A Method for High-Value Driving Demonstration Data Generation Based on One-Dimensional Deep Convolutional Generative Adversarial Networks
Yukun Wu ... Siyuan Qiu
Electronics | VOL. 11
Yukun Wu, et. al.Yukun Wu ... Siyuan Qiu
31 Oct 2022
Electronics | VOL. 11

Parallel Curriculum Experience Replay in Distributed Reinforcement Learning
...
-
, et. al. ...
11 Apr 2021
11 Apr 2021

About the Integration of Learning and Decision-Making Models in Intelligent Systems of Real-Time
Alexander P Eremeev ... Alexander A Kozhukhov
-
Alexander P Eremeev, et. al.Alexander P Eremeev ... Alexander A Kozhukhov
05 Dec 2018
05 Dec 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multimodal information bottleneck for deep reinforcement learning with multiple sensors

Abstract

Talk to us

Similar Papers

More From: Neural Networks