An Interactive Framework for Learning Continuous Actions Policies Based on Corrective Feedback

Carlos Celemin,Javier Ruiz-Del-Solar

doi:10.1007/s10846-018-0839-z

Abstract

The main goal of this article is to present COACH (COrrective Advice Communicated by Humans), a new learning framework that allows non-expert humans to advise an agent while it interacts with the environment in continuous action problems. The human feedback is given in the action domain as binary corrective signals (increase/decrease the current action magnitude), and COACH is able to adjust the amount of correction that a given action receives adaptively, taking state-dependent past feedback into consideration. COACH also manages the credit assignment problem that normally arises when actions in continuous time receive delayed corrections. The proposed framework is characterized and validated extensively using four well-known learning problems. The experimental analysis includes comparisons with other interactive learning frameworks, with classical reinforcement learning approaches, and with human teleoperators trying to solve the same learning problems by themselves. In all the reported experiments COACH outperforms the other methods in terms of learning speed and final performance. It is of interest to add that COACH has been applied successfully for addressing a complex real-world learning problem: the dribbling of the ball by humanoid soccer players.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Interactive Framework for Learning Continuous Actions Policies Based on Corrective Feedback

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent & Robotic Systems

Lead the way for us

Journal: Journal of Intelligent & Robotic Systems	Publication Date: May 12, 2018
Citations: 36

Similar Papers

Learning via human feedback in continuous state and action spaces
Ngo Anh Vien ... Wolfgang Ertel
Applied Intelligence | VOL. 39
Ngo Anh Vien, et. al.Ngo Anh Vien ... Wolfgang Ertel
02 Feb 2013
Applied Intelligence | VOL. 39

COACH: Learning continuous actions from COrrective Advice Communicated by Humans
Carlos Celemin ... Javier Ruiz-Del-Solar
-
Carlos Celemin, et. al.Carlos Celemin ... Javier Ruiz-Del-Solar
01 Jul 2015
01 Jul 2015

Interactive Learning of Continuous Actions from Corrective Advice Communicated by Humans
Carlos Celemin ... Javier Ruiz-Del-Solar
-
Carlos Celemin, et. al.Carlos Celemin ... Javier Ruiz-Del-Solar
01 Jan 2015
01 Jan 2015

Reinforcement learning combined with human feedback in continuous state and action spaces
Ngo Anh Vien ... Wolfgang Ertel
-
Ngo Anh Vien, et. al. Ngo Anh Vien ... Wolfgang Ertel
01 Nov 2012
01 Nov 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Interactive Framework for Learning Continuous Actions Policies Based on Corrective Feedback

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent &amp; Robotic Systems

More From: Journal of Intelligent & Robotic Systems