How to Recommend Preferable Solutions of a User in Interactive Reinforcement Learning?

Tomohiro Yamaguchi,Takuma Nishimura,Kazuhiro Sato

doi:10.5772/13757

Tomohiro Yamaguchi, Takuma Nishimura + Show 1 more

Open Access

https://doi.org/10.5772/13757

Copy DOI

Publication Date: Jan 14, 2011
Citations: 3	License type: cc-by-nc-sa

Affiliation: National Institute of Technology, Nara College

Abstract

We propose a new method of recommending preferable solutions of a user in interactive reinforcement learning. Interactive reinforcement learning is different from normal reinforcement learning in that a human gives the reward function to the learner interactively. It is that the reward function may not be fixed for the learner if an end-user changes his mind or his preference. However, most of previous reinforcement learning methods assume that the reward function is fixed and the optimal solution is unique, so they will be useless in interactive reinforcement learning with such an end-user. To solve this, it is necessary for the learner to estimate the userpsilas preference and to consider its changes. This paper proposes a new method how to match an end-userpsilas preference solution with the learnerpsilas recommended solution. Experiments are performed with twenty subjects to evaluate the effectiveness of our method. As the experimental results, a large number of subjects prefer each every-visit-optimal solution than the optimal solution. On the other hand, a small number of subjects prefer each every-visit-non-optimal solution. We will discuss the reason why the end-userspsila preferences are divided into two groups.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

How to Recommend Preferable Solutions of a User in Interactive Reinforcement Learning?

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

How to recommend preferable solutions of a user in interactive reinforcement learning ?
Tomohiro Yamaguchi ... Takuma Nishimura
-
Tomohiro Yamaguchi, et. al.Tomohiro Yamaguchi ... Takuma Nishimura
01 Aug 2008
01 Aug 2008

Preparing various policies for interactive reinforcement learning for the SICE-ICASE International Joint Conference 2006 (SICE-ICCAS 2006)
Kazuhiro Satoh ... Tomohiro Yamaguchi
-
Kazuhiro Satoh, et. al.Kazuhiro Satoh ... Tomohiro Yamaguchi
01 Jan 2006
01 Jan 2006

Keeping Humans in the Loop: Teaching via Feedback in Continuous Action Space Environments
Isaac Sheidlower ... Allison Moore
-
Isaac Sheidlower, et. al.Isaac Sheidlower ... Allison Moore
23 Oct 2022
23 Oct 2022

Training Agents With Interactive Reinforcement Learning and Contextual Affordances
Francisco Cruz ... Sven Magg
IEEE Transactions on Cognitive and Developmental Systems | VOL. 8
Francisco Cruz, et. al.Francisco Cruz ... Sven Magg
01 Dec 2016
IEEE Transactions on Cognitive and Developmental Systems | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

How to Recommend Preferable Solutions of a User in Interactive Reinforcement Learning?

Abstract

Talk to us

Similar Papers