What Is It You Really Want of Me? Generalized Reward Learning with Biased Beliefs about Domain Dynamics

Ze Gong,Yu Zhang

doi:10.1609/aaai.v34i03.5630

Abstract

Reward learning as a method for inferring human intent and preferences has been studied extensively. Prior approaches make an implicit assumption that the human maintains a correct belief about the robot's domain dynamics. However, this may not always hold since the human's belief may be biased, which can ultimately lead to a misguided estimation of the human's intent and preferences, which is often derived from human feedback on the robot's behaviors. In this paper, we remove this restrictive assumption by considering that the human may have an inaccurate understanding of the robot. We propose a method called Generalized Reward Learning with biased beliefs about domain dynamics (GeReL) to infer both the reward function and human's belief about the robot in a Bayesian setting based on human ratings. Due to the complex forms of the posteriors, we formulate it as a variational inference problem to infer the posteriors of the parameters that govern the reward function and human's belief about the robot simultaneously. We evaluate our method in a simulated domain and with a user study where the user has a bias based on the robot's appearances. The results show that our method can recover the true human preferences while subject to such biased beliefs, in contrast to prior approaches that could have misinterpreted them completely.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

What Is It You Really Want of Me? Generalized Reward Learning with Biased Beliefs about Domain Dynamics

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Apr 3, 2020
Citations: 4

Similar Papers

The effect of robot's behavior vs. appearance on communication with humans
Eunil Park ... Jongsik Lee
-
Eunil Park, et. al.Eunil Park ... Jongsik Lee
06 Mar 2011
06 Mar 2011

Active preference-based Gaussian process regression for reward learning and optimization
Erdem Bıyık ... Dorsa Sadigh
The International Journal of Robotics Research | VOL. 43
Erdem Bıyık, et. al.Erdem Bıyık ... Dorsa Sadigh
07 Nov 2023
The International Journal of Robotics Research | VOL. 43

Weak Human Preference Supervision for Deep Reinforcement Learning
Zehong Cao ... Chin-Teng Lin
IEEE Transactions on Neural Networks and Learning Systems | VOL. 32
Zehong Cao, et. al.Zehong Cao ... Chin-Teng Lin
01 Dec 2021
IEEE Transactions on Neural Networks and Learning Systems | VOL. 32

Active Preference-Based Gaussian Process Regression for Reward Learning
Erdem Biyik ... Dorsa Sadigh
-
Erdem Biyik, et. al.Erdem Biyik ... Dorsa Sadigh
12 Jul 2020
12 Jul 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

What Is It You Really Want of Me? Generalized Reward Learning with Biased Beliefs about Domain Dynamics

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence