Hierarchical Task-Parameterized Learning from Demonstration for Collaborative Object Movement.

Siyao Hu,Katherine J Kuchenbecker

doi:10.1155/2019/9765383

Abstract

Learning from demonstration (LfD) enables a robot to emulate natural human movement instead of merely executing preprogrammed behaviors. This article presents a hierarchical LfD structure of task-parameterized models for object movement tasks, which are ubiquitous in everyday life and could benefit from robotic support. Our approach uses the task-parameterized Gaussian mixture model (TP-GMM) algorithm to encode sets of demonstrations in separate models that each correspond to a different task situation. The robot then maximizes its expected performance in a new situation by either selecting a good existing model or requesting new demonstrations. Compared to a standard implementation that encodes all demonstrations together for all test situations, the proposed approach offers four advantages. First, a simply defined distance function can be used to estimate test performance by calculating the similarity between a test situation and the existing models. Second, the proposed approach can improve generalization, e.g., better satisfying the demonstrated task constraints and speeding up task execution. Third, because the hierarchical structure encodes each demonstrated situation individually, a wider range of task situations can be modeled in the same framework without deteriorating performance. Last, adding or removing demonstrations incurs low computational load, and thus, the robot's skill library can be built incrementally. We first instantiate the proposed approach in a simulated task to validate these advantages. We then show that the advantages transfer to real hardware for a task where naive participants collaborated with a Willow Garage PR2 robot to move a handheld object. For most tested scenarios, our hierarchical method achieved significantly better task performance and subjective ratings than both a passive model with only gravity compensation and a single TP-GMM encoding all demonstrations.

Highlights

Many modern humanoid robots are designed to operate in human environments, like homes and hospitals
We focus on an object movement task, such as the scenario shown in Figure 1, because it is ubiquitous in everyday life and could benefit from robot assistance
Similar to using Dynamic motion primitives (DMPs) on generalized trajectories from a Gaussian Process (GP) to ensure that the prescribed goals are reached [15], we introduce a generalizationenhancing strategy that is specific to the movement task: if the start and goal points become farther or closer to each other, the trajectory can be proportionately stretched or compressed in the start-goal direction to accommodate the change: T

Summary

Introduction

Many modern humanoid robots are designed to operate in human environments, like homes and hospitals. If designed well, such robots could help humans accomplish tasks and lower their physical and/or mental workload. One interesting task type is jointly manipulating an object with a partner [1], as it requires human collaboration, shared physical control, and adapting to new situations. As opposed to having an operator devise control policies and reprogram the robot for every new situation it encounters, learning from demonstration (LfD, known as programming by demonstration (PbD)) provides a direct method for robots to learn and replicate human behaviors [2, 3]. LfD control policies are learned from demonstrations in which a human teacher controls the robot to accomplish the task.

Objectives

Methods

Discussion

Conclusion