Analysis of Inverse Reinforcement Learning with Perturbed Demonstrations

Francisco S Melo ,Manuel Lopes ,Ricardo Ferreira

doi:10.3233/978-1-60750-606-5-349

Abstract

Inverse reinforcement learning (IRL) addresses the problem of recovering the unknown reward function for a given Markov decision problem (MDP) given the corresponding optimal policy or a perturbed version thereof. This paper studies the space of possible solutions to the general IRL problem, when the agent is provided with incomplete/imperfect information regarding the optimal policy for the MDP whose reward must be estimated. We focus on scenarios with finite state-action spaces and discuss the constraints imposed on the set of possible solutions when the agent is provided with (i) perturbed policies; (ii) optimal policies; and (iii) incomplete policies. We discuss previous works on IRL in light of our analysis and show that, with our characterization of the solution space, it is possible to determine non-trivial closed-form solutions for the IRL problem. We also discuss several other interesting aspects of the IRL problem that stem from our analysis.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Analysis of Inverse Reinforcement Learning with Perturbed Demonstrations

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Recent Advancements in Inverse Reinforcement Learning
Alberto Maria Metelli
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38
Alberto Maria MetelliAlberto Maria Metelli
24 Mar 2024
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38

Sample-Efficient I-Projections for Robot Learning

-

19 Apr 2021
19 Apr 2021

Gradient-based inverse risk-sensitive reinforcement learning
Eric Mazumdar ... Tanner Fiez
-
Eric Mazumdar, et. al.Eric Mazumdar ... Tanner Fiez
01 Dec 2017
01 Dec 2017

Task-Guided Inverse Reinforcement Learning under Partial Information
Franck Djeumou ... Murat Cubuktepe
Proceedings of the International Conference on Automated Planning and Scheduling | VOL. 32
Franck Djeumou, et. al.Franck Djeumou ... Murat Cubuktepe
13 Jun 2022
Proceedings of the International Conference on Automated Planning and Scheduling | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Analysis of Inverse Reinforcement Learning with Perturbed Demonstrations

Abstract

Talk to us

Similar Papers