Hierarchical Policies of Subgoals for Safe Deep Reinforcement Learning

Fumin Yu,Yao Yuan,Xiaofei Xing,Feng Gao,Yinglong Dai

doi:10.1007/978-981-99-0272-9_15

Abstract

AbstractReinforcement learning is a machine learning method that relies on the agent to learn by trial and error to solve decision optimization problems. It is well known that an agent based on deep reinforcement learning in complex environments is difficult to train. Moreover, the agent will generate unsafe and strange actions due to the lack of sufficient reward feedback from the environment. To make the agent converge to a better policy and make its behavior safer and more controllable under sparse rewards, we propose a subgoal embedding method based on prior knowledge and hierarchical strategy that can make the training process converge faster. The subgoal embedding method can be combined with existing reinforcement learning methods. In this paper, we combine the subgoal embedding method with REINFORCE algorithm and PPO(Proximal Policy Optimization) algorithm to test the method in the MiniGrid-DoorKey game environment of the gym platform. The experiments demonstrate the effectiveness of the subgoal embedding method.KeywordsReinforcement learningDeep reinforcement learningSubgoal embeddingSparse rewardHierarchical strategiesSafe agent

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Hierarchical Policies of Subgoals for Safe Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Deep Reinforcement Learning for Robotic Hand Manipulation
Muhammed Saeed ... Mohammed Nagdi
-
Muhammed Saeed, et. al.Muhammed Saeed ... Mohammed Nagdi
26 Feb 2021
26 Feb 2021

A Survey of Multi-Task Deep Reinforcement Learning
Nelson Vithayathil Varghese ... Qusay H Mahmoud
Electronics | VOL. 9
Nelson Vithayathil Varghese, et. al.Nelson Vithayathil Varghese ... Qusay H Mahmoud
22 Aug 2020
Electronics | VOL. 9

Evaluating the Efficacy of Different Neural Network Deep Reinforcement Algorithms in Complex Search-and-Retrieve Virtual Simulations
Ishita Vohra ... Varun Dutt
-
Ishita Vohra, et. al.Ishita Vohra ... Varun Dutt
01 Jan 2021
01 Jan 2021

Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain.
Jianye Hao ... Peng Liu
IEEE transactions on neural networks and learning systems | VOL. 35
Jianye Hao, et. al.Jianye Hao ... Peng Liu
01 Jul 2024
IEEE transactions on neural networks and learning systems | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hierarchical Policies of Subgoals for Safe Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers