Imitation Learning with Demonstrations and Shaping Rewards

Kshitij Judah,Robby Goetschalckx,Alan Fern,Prasad Tadepalli

doi:10.1609/aaai.v28i1.9024

Abstract

Imitation Learning (IL) is a popular approach for teaching behavior policies to agents by demonstrating the desired target policy. While the approach has lead to many successes, IL often requires a large set of demonstrations to achieve robust learning, which can be expensive for the teacher. In this paper, we consider a novel approach to improve the learning efficiency of IL by providing a shaping reward function in addition to the usual demonstrations. Shaping rewards are numeric functions of states (and possibly actions) that are generally easily specified, and capture general principles of desired behavior, without necessarily completely specifying the behavior. Shaping rewards have been used extensively in reinforcement learning, but have been seldom considered for IL, though they are often easy to specify. Our main contribution is to propose an IL approach that learns from both shaping rewards and demonstrations. We demonstrate the effectiveness of the approach across several IL problems, even when the shaping reward is not fully consistent with the demonstrations.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Imitation Learning with Demonstrations and Shaping Rewards

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Jun 21, 2014
Citations: 22

Similar Papers

Learning to Touch Objects Through Stage-Wise Deep Reinforcement Learning
Francois De La Bourdonnaye ... Celine Teuliere
-
Francois De La Bourdonnaye, et. al.Francois De La Bourdonnaye ... Celine Teuliere
21 Jun 2018
21 Jun 2018

ASSESSING TROPHIC STATE AND WATER QUALITY OF SMALL LAKES AND PONDS IN PERAK
Zati Sharip ... Mohd Zaki Mat Amin
Jurnal teknologi | VOL. 86
Zati Sharip, et. al.Zati Sharip ... Mohd Zaki Mat Amin
15 Jan 2024
Jurnal teknologi | VOL. 86

FUSION SPARSE AND SHAPING REWARD FUNCTION IN SOFT ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR MOBILE ROBOT NAVIGATION
Mohamad Hafiz Abu Bakar ... Zubair Adil Soomro
Jurnal teknologi | VOL. 86
Mohamad Hafiz Abu Bakar, et. al.Mohamad Hafiz Abu Bakar ... Zubair Adil Soomro
15 Jan 2024
Jurnal teknologi | VOL. 86

Reinforcement Learning with Potential Functions Trained to Discriminate Good and Bad States
Yifei Chen ... Hamidreza Kasaei
-
Yifei Chen, et. al.Yifei Chen ... Hamidreza Kasaei
18 Jul 2021
18 Jul 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Imitation Learning with Demonstrations and Shaping Rewards

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence