Human-Inspired Meta-Reinforcement Learning Using Bayesian Knowledge and Enhanced Deep Q-Network

Joshua Ho,Chien-Min Wang,Chi-Wei Feng,Yi-Hsin You,Chung-Ta King

doi:10.1142/s1793351x2444001x

Abstract

Over the last decades, there has been growing interest in research in multiple and interdisciplinary fields of human-AI computing. In particular, approaches integrating human’s perspective and design with reinforcement learning (RL) have received more attention. However, the current research on RL may need to consider its enhancement from human-inspired approaches further. In this work, we focus on enabling a meta-reinforcement learning (meta-RL) agent to achieve adaptation and generalization, according to modeling Markov Decision Processes (MDP) using Bayesian knowledge and analysis. By introducing a novel framework called human-inspired meta-RL (HMRL), we incorporate the agent performing resilient actions to leverage the dynamic dense reward based on the knowledge and prediction of a Bayesian analysis. The proposed framework can make the agent learn generalization and prevent the agent from failing catastrophically. The experimental results show that our approach helps the agent reduce computational costs with learning adaptation. In addition to the system design, we have also extended further algorithmic improvement based on learning within a deep Q-network (DQN) implementations for more complicated future tasks, which compared replay buffers to possibly enhance the optimization process. Finally, we conclude and anticipate that integrating human-inspired meta-RL can enable learning more formulations relating to robustness and scalability, leading to promising directions and more complex AI goals in the future.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Human-Inspired Meta-Reinforcement Learning Using Bayesian Knowledge and Enhanced Deep Q-Network

Abstract

Talk to us

Similar Papers

More From: International Journal of Semantic Computing

Lead the way for us

Similar Papers

Temporal Consistency-Based Loss Function for Both Deep Q-Networks and Deep Deterministic Policy Gradients for Continuous Actions
Chayoung Kim
Symmetry | VOL. 13
Chayoung KimChayoung Kim
13 Dec 2021
Symmetry | VOL. 13

Pursuit and Evasion Strategy of a Differential Game Based on Deep Reinforcement Learning.
Can Xu ... Ligang Dong
Frontiers in bioengineering and biotechnology | VOL. 10
Can Xu, et. al.Can Xu ... Ligang Dong
22 Mar 2022
Frontiers in bioengineering and biotechnology | VOL. 10

Developing game AI agent behaving like human by mixing reinforcement learning and supervised learning
Shohei Miyashita ... Xiao Zeng
-
Shohei Miyashita, et. al.Shohei Miyashita ... Xiao Zeng
01 Jun 2017
01 Jun 2017

UAV Dynamic Object Tracking with Lightweight Deep Vision Reinforcement Learning
Hy Nguyen ... Hung Du
Algorithms | VOL. 16
Hy Nguyen, et. al.Hy Nguyen ... Hung Du
27 Apr 2023
Algorithms | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Human-Inspired Meta-Reinforcement Learning Using Bayesian Knowledge and Enhanced Deep Q-Network

Abstract

Talk to us

Similar Papers

More From: International Journal of Semantic Computing