Reinforcement Learning in Latent Action Sequence Space

Heecheol Kim,Masanori Yamada,Hiroshi Yamakawa,Tomoharu Iwata,Kosuke Miyoshi

doi:10.1109/iros45743.2020.9341629

Abstract

One problem in real-world applications of reinforcement learning is the high dimensionality of the action search spaces, which comes from the combination of actions over time. To reduce the dimensionality of action sequence search spaces, macro actions have been studied, which are sequences of primitive actions to solve tasks. However, previous studies relied on humans to define macro actions or assumed macro actions to be repetitions of the same primitive actions. We propose encoded action sequence reinforcement learning (EASRL), a reinforcement learning method that learns flexible sequences of actions in a latent space for a high-dimensional action sequence search space. With EASRL, encoder and decoder networks are trained with demonstration data by using variational autoencoders for mapping macro actions into the latent space. Then, we learn a policy network in the latent space, which is a distribution over encoded macro actions given a state. By learning in the latent space, we can reduce the dimensionality of the action sequence search space and handle various patterns of action sequences. We experimentally demonstrate that the proposed method outperforms other reinforcement learning methods on tasks that require an extensive amount of search.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reinforcement Learning in Latent Action Sequence Space

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Reusability and Transferability of Macro Actions for Reinforcement Learning
Yi-Hsiang Chang ... Kuan-Yu Chang
ACM Transactions on Evolutionary Learning and Optimization | VOL. 2
Yi-Hsiang Chang, et. al.Yi-Hsiang Chang ... Kuan-Yu Chang
31 Mar 2022
ACM Transactions on Evolutionary Learning and Optimization | VOL. 2

Author response: Model-based whole-brain perturbational landscape of neurodegenerative diseases
Yonatan Sanz Perl ... Pavel Prado
-
Yonatan Sanz Perl, et. al.Yonatan Sanz Perl ... Pavel Prado
09 Mar 2023
09 Mar 2023

Decision letter: Model-based whole-brain perturbational landscape of neurodegenerative diseases
Jordi A Matias-Guiu ... Timothy E Behrens
-
Jordi A Matias-Guiu, et. al.Jordi A Matias-Guiu ... Timothy E Behrens
13 Jan 2023
13 Jan 2023

Editor's evaluation: Model-based whole-brain perturbational landscape of neurodegenerative diseases
Muireann Irish
-
Muireann IrishMuireann Irish
13 Jan 2023
13 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reinforcement Learning in Latent Action Sequence Space

Abstract

Talk to us

Similar Papers