Abstract

The ability to learn new tasks by sequencing already known skills is an important requirement for future robots. Reinforcement learning is a powerful tool for this as it allows for a robot to learn and improve on how to combine skills for sequential tasks. However, in real robotic applications, the cost of sample collection and exploration prevent the application of reinforcement learning for a variety of tasks. To overcome these limitations, human input during reinforcement can be beneficial to speed up learning, guide the exploration and prevent the choice of disastrous actions. Nevertheless, there is a lack of experimental evaluations of multi-channel interactive reinforcement learning systems solving robotic tasks with input from inexperienced human users, in particular for cases where human input might be partially wrong. Therefore, in this paper, we present an approach that incorporates multiple human input channels for interactive reinforcement learning in a unified framework and evaluate it on two robotic tasks with 20 inexperienced human subjects. To enable the robot to also handle potentially incorrect human input we incorporate a novel concept for self-confidence, which allows the robot to question human input after an initial learning phase. The second robotic task is specifically designed to investigate if this self-confidence can enable the robot to achieve learning progress even if the human input is partially incorrect. Further, we evaluate how humans react to suggestions of the robot, once the robot notices human input might be wrong. Our experimental evaluations show that our approach can successfully incorporate human input to accelerate the learning process in both robotic tasks even if it is partially wrong. However, not all humans were willing to accept the robot's suggestions or its questioning of their input, particularly if they do not understand the learning process and the reasons behind the robot's suggestions. We believe that the findings from this experimental evaluation can be beneficial for the future design of algorithms and interfaces of interactive reinforcement learning systems used by inexperienced users.

Highlights

  • Future robots are expected to cope with a variety of different tasks which renders manual programming of each task highly difficult

  • We use human input as a valuable source of information to guide the exploration of a robot in a Reinforcement Learning (RL) setting, speed up the learning, and prevent disasters which could be caused by exploratory actions in real robotic scenarios

  • In the second robotic task, we investigate how humans react to the concept of self-confidence of the robot and how they respond if a robot starts to make own suggestions, once it recognizes human input might be incorrect

Read more

Summary

INTRODUCTION

Future robots are expected to cope with a variety of different tasks which renders manual programming of each task highly difficult. In real robotic scenarios, there might be high costs assigned to taking wrong actions, such as breaking valuable objects, the robot’s hardware, or even cause harm to humans This can further confine the exploration of the agent. In this paper, we present a novel interactive RL framework that integrates multiple channels for human input, namely action suggestion, and prohibition as well as feedback and sub-goal definition. We evaluate this approach for two robotic tasks with non-expert users. The main contribution of this paper, is the evaluation of our multi-channel interactive RL framework, which includes our concept for self-confidence of the robot, on two sequential robotic tasks with 20 inexperienced human subjects.

RELATED WORK
MULTI-CHANNEL INTERACTIVE REINFORCEMENT LEARNING
Human Input Channels
Human Feedback After Action Execution
Human Subgoal Rewards
Human State Modifications
Reinforcement Learning Module
Human-Advice-Module
Action Selection
Self-Confidence
Component Implementation
EXPERIMENTAL EVALUATION
Robotic Kitchen Task
Robotic Sorting Task
Discussion
Findings
CONCLUSION AND FUTURE WORK
ETHICS STATEMENT

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.