Leveraging human knowledge in tabular reinforcement learning: a study of human subjects

Ariel Rosenfeld,Sarit Kraus,Moshe Cohen,Matthew E Taylor

doi:10.1017/s0269888918000206

Abstract

AbstractReinforcement learning (RL) can be extremely effective in solving complex, real-world problems. However, injecting human knowledge into an RL agent may require extensive effort and expertise on the human designer’s part. To date, human factors are generally not considered in the development and evaluation of possible RL approaches. In this article, we set out to investigate how different methods for injecting human knowledge are applied, in practice, by human designers of varying levels of knowledge and skill. We perform the first empirical evaluation of several methods, including a newly proposed method named State Action Similarity Solutions (SASS) which is based on the notion of similarities in the agent’s state–action space. Through this human study, consisting of 51 human participants, we shed new light on the human factors that play a key role in RL. We find that the classical reward shaping technique seems to be the most natural method for most designers, both expert and non-expert, to speed up RL. However, we further find that our proposed method SASS can be effectively and efficiently combined with reward shaping, and provides a beneficial alternative to using only a single-speedup method with minimal human designer effort overhead.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Leveraging human knowledge in tabular reinforcement learning: a study of human subjects

Abstract

Talk to us

Similar Papers

More From: The Knowledge Engineering Review

Lead the way for us

Journal: The Knowledge Engineering Review	Publication Date: Jan 1, 2018
Citations: 18

Similar Papers

Leveraging Human Knowledge in Tabular Reinforcement Learning: A Study of Human Subjects
Ariel Rosenfeld ... Sarit Kraus
-
Ariel Rosenfeld, et. al.Ariel Rosenfeld ... Sarit Kraus
01 Aug 2017
01 Aug 2017

Subgoal-Based Reward Shaping to Improve Efficiency in Reinforcement Learning
Takato Okudo ... Seiji Yamada
IEEE Access | VOL. 9
Takato Okudo, et. al.Takato Okudo ... Seiji Yamada
01 Jan 2020
IEEE Access | VOL. 9

LORM: a novel reinforcement learning framework for biped gait control.
Weiyi Zhang ... Guangqi Wang
PeerJ. Computer science | VOL. 8
Weiyi Zhang, et. al.Weiyi Zhang ... Guangqi Wang
28 Mar 2022
PeerJ. Computer science | VOL. 8

Reward Shaping with Dynamic Trajectory Aggregation
Takato Okudo ... Seiji Yamada
-
Takato Okudo, et. al.Takato Okudo ... Seiji Yamada
18 Jul 2021
18 Jul 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Leveraging human knowledge in tabular reinforcement learning: a study of human subjects

Abstract

Talk to us

Similar Papers

More From: The Knowledge Engineering Review