Learning via human feedback in continuous state and action spaces

Ngo Anh Vien,Tae Choong Chung,Wolfgang Ertel

doi:10.1007/s10489-012-0412-6

Abstract

This paper considers the problem of extending Training an Agent Manually via Evaluative Reinforcement (TAMER) in continuous state and action spaces. Investigative research using the TAMER framework enables a non-technical human to train an agent through a natural form of human feedback (negative or positive). The advantages of TAMER have been shown on tasks of training agents by only human feedback or combining human feedback with environment rewards. However, these methods are originally designed for discrete state-action, or continuous state-discrete action problems. This paper proposes an extension of TAMER to allow both continuous states and actions, called ACTAMER. The new framework utilizes any general function approximation of a human trainer's feedback signal. Moreover, a combined capability of ACTAMER and reinforcement learning is also investigated and evaluated. The combination of human feedback and reinforcement learning is studied in both settings: sequential and simultaneous. Our experimental results demonstrate the proposed method successfully allowing a human to train an agent in two continuous state-action domains: Mountain Car and Cart-pole (balancing).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning via human feedback in continuous state and action spaces

Abstract

Talk to us

Similar Papers

More From: Applied Intelligence

Lead the way for us

Journal: Applied Intelligence	Publication Date: Feb 2, 2013
Citations: 33

Similar Papers

Reinforcement learning combined with human feedback in continuous state and action spaces
Ngo Anh Vien ... Wolfgang Ertel
-
Ngo Anh Vien, et. al. Ngo Anh Vien ... Wolfgang Ertel
01 Nov 2012
01 Nov 2012

Continuous-action reinforcement learning with fast policy search and adaptive basis function selection
Xin Xu ... Dewen Hu
Soft Computing | VOL. 15
Xin Xu, et. al.Xin Xu ... Dewen Hu
28 Mar 2010
Soft Computing | VOL. 15

Learning Agents with Prioritization and Parameter Noise in Continuous State and Action Space
Rajesh Mangannavar ... Gopalakrishnan Srinivasaraghavan
-
Rajesh Mangannavar, et. al.Rajesh Mangannavar ... Gopalakrishnan Srinivasaraghavan
01 Jan 2019
01 Jan 2019

Reinforcement Learning with Particle Swarm Optimization Policy (PSO-P) in Continuous State and Action Spaces
Daniel Hein ... Steffen Udluft
International Journal of Swarm Intelligence Research | VOL. 7
Daniel Hein, et. al.Daniel Hein ... Steffen Udluft
01 Jul 2016
International Journal of Swarm Intelligence Research | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning via human feedback in continuous state and action spaces

Abstract

Talk to us

Similar Papers

More From: Applied Intelligence