Blockwise Sequential Model Learning for Partially Observable Reinforcement Learning

Giseung Park,Sungho Choi,Youngchul Sung

doi:10.1609/aaai.v36i7.20764

Abstract

This paper proposes a new sequential model learning architecture to solve partially observable Markov decision problems. Rather than compressing sequential information at every timestep as in conventional recurrent neural network-based methods, the proposed architecture generates a latent variable in each data block with a length of multiple timesteps and passes the most relevant information to the next block for policy optimization. The proposed blockwise sequential model is implemented based on self-attention, making the model capable of detailed sequential learning in partial observable settings. The proposed model builds an additional learning network to efficiently implement gradient estimation by using self-normalized importance sampling, which does not require the complex blockwise input data reconstruction in the model learning. Numerical results show that the proposed method significantly outperforms previous methods in various partially observable environments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Blockwise Sequential Model Learning for Partially Observable Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 28, 2022
Citations: 2

Similar Papers

On the Evaluation of Sequential Machine Learning for Network Intrusion Detection
Andrea Corsini ... Shanchieh Jay Yang
-
Andrea Corsini, et. al.Andrea Corsini ... Shanchieh Jay Yang
17 Aug 2021
17 Aug 2021

Machine learning with knowledge constraints for process optimization of open-air perovskite solar cell manufacturing
Zhe Liu ... Austin C Flick
Joule | VOL. 6
Zhe Liu, et. al.Zhe Liu ... Austin C Flick
01 Apr 2022
Joule | VOL. 6

A sequential learning method with Kalman filter and extreme learning machine for regression and time series forecasting
Jarley P Nóbrega ... Adriano L.I Oliveira
Neurocomputing | VOL. 337
Jarley P Nóbrega, et. al.Jarley P Nóbrega ... Adriano L.I Oliveira
02 Feb 2019
Neurocomputing | VOL. 337

Sequential Learning by Touch, Vision, and Audition
Christopher M Conway ... Morten H Christiansen
-
Christopher M Conway, et. al.Christopher M Conway ... Morten H Christiansen
24 Apr 2019
24 Apr 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Blockwise Sequential Model Learning for Partially Observable Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence