TeachMe: Three-phase learning framework for robotic motion imitation based on interactive teaching and reinforcement learning

Taewoo Kim,Joo-Haeng Lee

doi:10.1109/ro-man46459.2019.8956326

Abstract

Motion imitation is a fundamental communication skill for a robot; especially, as a nonverbal interaction with a human. Owing to kinematic configuration differences between the human and the robot, it is challenging to determine the appropriate mapping between the two pose domains. Moreover, technical limitations while extracting 3D motion details, such as wrist joint movements from human motion videos, results in significant challenges in motion retargeting. Explicit mapping over different motion domains indicates a considerably inefficient solution. To solve these problems, we propose a three-phase reinforcement learning scheme to enable a NAO robot to learn motions from human pose skeletons extracted from video inputs. Our learning scheme consists of three phases: (i) phase one for learning preparation, (ii) phase two for a simulation-based reinforcement learning, and (iii) phase three for a human-in-the-loop-based reinforcement learning. In phase one, embeddings of the motions of a human skeleton and robot are learned by an autoencoder. In phase two, the NAO robot learns a rough imitation skill using reinforcement learning that translates the learned embeddings. In the last phase, the robot learns motion details that were not considered in the previous phases by interactively setting rewards based on direct teaching instead of the method used in the previous phase. Especially, it is to be noted that a relatively smaller number of interactive inputs are required for motion details in phase three when compared to the large volume of training sets required for overall imitation in phase two. The experimental results demonstrate that the proposed method improves the imitation skills efficiently for hand waving and saluting motions obtained from NTU-DB.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

TeachMe: Three-phase learning framework for robotic motion imitation based on interactive teaching and reinforcement learning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Nurse's social responsibility: A hybrid concept analysis in Iran.
Zahra Hadian Jazi ... Armin Zareian
Medical journal of the Islamic Republic of Iran | VOL. 33
Zahra Hadian Jazi, et. al.Zahra Hadian Jazi ... Armin Zareian
21 May 2019
Medical journal of the Islamic Republic of Iran | VOL. 33

A Generative Human-Robot Motion Retargeting Approach Using a Single RGBD Sensor
Sen Wang ... Ruigang Yang
IEEE Access | VOL. 7
Sen Wang, et. al.Sen Wang ... Ruigang Yang
01 Jan 2019
IEEE Access | VOL. 7

Sitting Behavior and Obesity: Evidence from the Whitehall II Study
Richard M Pulsford ... Melvyn M Hillsdon
American Journal of Preventive Medicine | VOL. 44
Richard M Pulsford, et. al.Richard M Pulsford ... Melvyn M Hillsdon
16 Jan 2013
American Journal of Preventive Medicine | VOL. 44

C-3PO: Cyclic-Three-Phase Optimization for Human-Robot Motion Retargeting based on Reinforcement Learning
Taewoo Kim ... Joo-Haeng Lee
-
Taewoo Kim, et. al.Taewoo Kim ... Joo-Haeng Lee
01 May 2020
01 May 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TeachMe: Three-phase learning framework for robotic motion imitation based on interactive teaching and reinforcement learning

Abstract

Talk to us

Similar Papers