TAG: Teacher-Advice Mechanism With Gaussian Process for Reinforcement Learning.

Ke Lin,Duantengchuan Li,Yanjie Li,Shiyu Chen,Qi Liu,Jianqi Gao,Yanrui Jin,Liang Gong

doi:10.1109/tnnls.2023.3262956

Abstract

Reinforcement learning (RL) still suffers from the problem of sample inefficiency and struggles with the exploration issue, particularly in situations with long-delayed rewards, sparse rewards, and deep local optimum. Recently, learning from demonstration (LfD) paradigm was proposed to tackle this problem. However, these methods usually require a large number of demonstrations. In this study, we present a sample efficient teacher-advice mechanism with Gaussian process (TAG) by leveraging a few expert demonstrations. In TAG, a teacher model is built to provide both an advice action and its associated confidence value. Then, a guided policy is formulated to guide the agent in the exploration phase via the defined criteria. Through the TAG mechanism, the agent is capable of exploring the environment more intentionally. Moreover, with the confidence value, the guided policy can guide the agent precisely. Also, due to the strong generalization ability of Gaussian process, the teacher model can utilize the demonstrations more effectively. Therefore, substantial improvement in performance and sample efficiency can be attained. Considerable experiments on sparse reward environments demonstrate that the TAG mechanism can help typical RL algorithms achieve significant performance gains. In addition, the TAG mechanism with soft actor-critic algorithm (TAG-SAC) attains the state-of-the-art performance over other LfD counterparts on several delayed reward and complicated continuous control environments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

TAG: Teacher-Advice Mechanism With Gaussian Process for Reinforcement Learning.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems

Lead the way for us

Journal: IEEE transactions on neural networks and learning systems	Publication Date: Sep 1, 2024
Citations: 1

Similar Papers

Control of Space Flexible Manipulator Using Soft Actor-Critic and Random Network Distillation
Chen Yang ... Xueqian Wang
-
Chen Yang, et. al.Chen Yang ... Xueqian Wang
01 Dec 2019
01 Dec 2019

Multi-objective Control Strategy for Islanded Microgrid Based on Soft Actor Critic Algorithm
Jingxing Xiao ... Ying Ye
-
Jingxing Xiao, et. al.Jingxing Xiao ... Ying Ye
01 Apr 2023
01 Apr 2023

Learning from Demonstration with Gaussian Process Approach for an Omni-directional Mobile Robot
Daniel Garcia ... Emilio Vargas Soto
IEEE Latin America Transactions | VOL. 16
Daniel Garcia, et. al.Daniel Garcia ... Emilio Vargas Soto
01 Apr 2018
IEEE Latin America Transactions | VOL. 16

Automatic task decomposition and state abstraction from demonstration
...
-
, et. al. ...
04 Jun 2012
04 Jun 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TAG: Teacher-Advice Mechanism With Gaussian Process for Reinforcement Learning.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems