Deep Recurrent Belief Propagation Network for POMDPs

Yuhui Wang,Xiaoyang Tan

doi:10.1609/aaai.v35i11.17227

Abstract

In many real-world sequential decision-making tasks, especially in continuous control like robotic control, it is rare that the observations are perfect, that is, the sensory data could be incomplete, noisy or even dynamically polluted due to the unexpected malfunctions or intrinsic low quality of the sensors. Previous methods handle these issues in the framework of POMDPs and are either deterministic by feature memorization or stochastic by belief inference. In this paper, we present a new method that lies somewhere in the middle of the spectrum of research methodology identified above and combines the strength of both approaches. In particular, the proposed method, named Deep Recurrent Belief Propagation Network (DRBPN), takes a hybrid style belief updating procedure − an RNN-type feature extraction step followed by an analytical belief inference, significantly reducing the computational cost while faithfully capturing the complex dynamics and maintaining the necessary uncertainty for generalization. The effectiveness of the proposed method is verified on a collection of benchmark tasks, showing that our approach outperforms several state-of-the-art methods under various challenging scenarios.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Recurrent Belief Propagation Network for POMDPs

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: May 18, 2021
Citations: 2

Similar Papers

A novel Domain Adaptive Deep Recurrent Network for multivariate time series prediction
Tao Yang ... Hongru Li
Engineering Applications of Artificial Intelligence | VOL. 106
Tao Yang, et. al.Tao Yang ... Hongru Li
21 Oct 2021
Engineering Applications of Artificial Intelligence | VOL. 106

Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks
Hakan Erdogan ... Jonathan Le Roux
-
Hakan Erdogan, et. al.Hakan Erdogan ... Jonathan Le Roux
01 Apr 2015
01 Apr 2015

An Unsupervised Learning Algorithm for Deep Recurrent Spiking Neural Networks
Pangao Du ... Xiaomei Pi
-
Pangao Du, et. al.Pangao Du ... Xiaomei Pi
28 Oct 2020
28 Oct 2020

Learning Brain Dynamics With Coupled Low-Dimensional Nonlinear Oscillators and Deep Recurrent Networks.
Germán Abrevaya ... Silvina Ponce Dawson
Neural Computation | VOL. 33
Germán Abrevaya, et. al.Germán Abrevaya ... Silvina Ponce Dawson
26 Jul 2021
Neural Computation | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Recurrent Belief Propagation Network for POMDPs

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence