Multi-Channel Interactive Reinforcement Learning for Sequential Tasks.

Dorothea Koert,Vildan Salikutluk,Carlo D'Eramo,Jan Peters,Maximilian Kircher

doi:10.3389/frobt.2020.00097

Abstract

The ability to learn new tasks by sequencing already known skills is an important requirement for future robots. Reinforcement learning is a powerful tool for this as it allows for a robot to learn and improve on how to combine skills for sequential tasks. However, in real robotic applications, the cost of sample collection and exploration prevent the application of reinforcement learning for a variety of tasks. To overcome these limitations, human input during reinforcement can be beneficial to speed up learning, guide the exploration and prevent the choice of disastrous actions. Nevertheless, there is a lack of experimental evaluations of multi-channel interactive reinforcement learning systems solving robotic tasks with input from inexperienced human users, in particular for cases where human input might be partially wrong. Therefore, in this paper, we present an approach that incorporates multiple human input channels for interactive reinforcement learning in a unified framework and evaluate it on two robotic tasks with 20 inexperienced human subjects. To enable the robot to also handle potentially incorrect human input we incorporate a novel concept for self-confidence, which allows the robot to question human input after an initial learning phase. The second robotic task is specifically designed to investigate if this self-confidence can enable the robot to achieve learning progress even if the human input is partially incorrect. Further, we evaluate how humans react to suggestions of the robot, once the robot notices human input might be wrong. Our experimental evaluations show that our approach can successfully incorporate human input to accelerate the learning process in both robotic tasks even if it is partially wrong. However, not all humans were willing to accept the robot's suggestions or its questioning of their input, particularly if they do not understand the learning process and the reasons behind the robot's suggestions. We believe that the findings from this experimental evaluation can be beneficial for the future design of algorithms and interfaces of interactive reinforcement learning systems used by inexperienced users.

Highlights

Future robots are expected to cope with a variety of different tasks which renders manual programming of each task highly difficult
We use human input as a valuable source of information to guide the exploration of a robot in a Reinforcement Learning (RL) setting, speed up the learning, and prevent disasters which could be caused by exploratory actions in real robotic scenarios
In the second robotic task, we investigate how humans react to the concept of self-confidence of the robot and how they respond if a robot starts to make own suggestions, once it recognizes human input might be incorrect

Summary

INTRODUCTION

Future robots are expected to cope with a variety of different tasks which renders manual programming of each task highly difficult. In real robotic scenarios, there might be high costs assigned to taking wrong actions, such as breaking valuable objects, the robot’s hardware, or even cause harm to humans This can further confine the exploration of the agent. In this paper, we present a novel interactive RL framework that integrates multiple channels for human input, namely action suggestion, and prohibition as well as feedback and sub-goal definition. We evaluate this approach for two robotic tasks with non-expert users. The main contribution of this paper, is the evaluation of our multi-channel interactive RL framework, which includes our concept for self-confidence of the robot, on two sequential robotic tasks with 20 inexperienced human subjects.

RELATED WORK

MULTI-CHANNEL INTERACTIVE REINFORCEMENT LEARNING

Human Input Channels

Human Feedback After Action Execution

Human Subgoal Rewards

Human State Modifications

Reinforcement Learning Module

Human-Advice-Module

Action Selection

Self-Confidence

Component Implementation

EXPERIMENTAL EVALUATION

Robotic Kitchen Task

Robotic Sorting Task

Discussion

Findings

CONCLUSION AND FUTURE WORK

ETHICS STATEMENT

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Robotics and AI	Publication Date: Sep 24, 2020
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Multi-Channel Interactive Reinforcement Learning for Sequential Tasks.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Robotics and AI

Lead the way for us

Similar Papers

Multi-Channel Interactive Reinforcement Learning for Sequential Tasks.
...
Frontiers in Robotics and AI | VOL. 7
, et. al. ...
24 Sep 2020
Frontiers in Robotics and AI | VOL. 7

Interactive Machine Learning for Assistive Robots

-

16 Oct 2020
16 Oct 2020

Keeping Humans in the Loop: Teaching via Feedback in Continuous Action Space Environments
Isaac Sheidlower ... Allison Moore
-
Isaac Sheidlower, et. al.Isaac Sheidlower ... Allison Moore
23 Oct 2022
23 Oct 2022

Learning from Unreliable Human Action Advice in Interactive Reinforcement Learning
Lisa Scherf ... Dorothea Koert
-
Lisa Scherf, et. al.Lisa Scherf ... Dorothea Koert
28 Nov 2022
28 Nov 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Channel Interactive Reinforcement Learning for Sequential Tasks.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Robotics and AI