DeepDynamicHand: A Deep Neural Architecture for Labeling Hand Manipulation Strategies in Video Sources Exploiting Temporal Information.

Visar Arapi,Cosimo Della Santina,Davide Bacciu,Matteo Bianchi,Antonio Bicchi

doi:10.3389/fnbot.2018.00086

Abstract

Humans are capable of complex manipulation interactions with the environment, relying on the intrinsic adaptability and compliance of their hands. Recently, soft robotic manipulation has attempted to reproduce such an extraordinary behavior, through the design of deformable yet robust end-effectors. To this goal, the investigation of human behavior has become crucial to correctly inform technological developments of robotic hands that can successfully exploit environmental constraint as humans actually do. Among the different tools robotics can leverage on to achieve this objective, deep learning has emerged as a promising approach for the study and then the implementation of neuro-scientific observations on the artificial side. However, current approaches tend to neglect the dynamic nature of hand pose recognition problems, limiting the effectiveness of these techniques in identifying sequences of manipulation primitives underpinning action generation, e.g., during purposeful interaction with the environment. In this work, we propose a vision-based supervised Hand Pose Recognition method which, for the first time, takes into account temporal information to identify meaningful sequences of actions in grasping and manipulation tasks. More specifically, we apply Deep Neural Networks to automatically learn features from hand posture images that consist of frames extracted from grasping and manipulation task videos with objects and external environmental constraints. For training purposes, videos are divided into intervals, each associated to a specific action by a human supervisor. The proposed algorithm combines a Convolutional Neural Network to detect the hand within each video frame and a Recurrent Neural Network to predict the hand action in the current frame, while taking into consideration the history of actions performed in the previous frames. Experimental validation has been performed on two datasets of dynamic hand-centric strategies, where subjects regularly interact with objects and environment. Proposed architecture achieved a very good classification accuracy on both datasets, reaching performance up to 94%, and outperforming state of the art techniques. The outcomes of this study can be successfully applied to robotics, e.g., for planning and control of soft anthropomorphic manipulators.

Highlights

The hand is the primary tool humans rely on to successfully interact with the external environment, capitalizing on the intrinsic adaptability of human “end-effector” to multiply manipulation and grasping capabilities
The dynamic aspects resulting from the interaction between hand, object, and environment are essential for human grasps and may represent an unmatched source of inspiration for devising successful and robust approaches to soft robot grasping. To favor such a cross-fertilization between neuroscientific observations and robotics research, our research aims at analyzing videos on human hands during Environmental Constraint Exploitation (ECE)-based object grasping, to identify dynamic strategies for a successful “manipulation with the environment” task
On-line classification is not the goal of this work, since our objective is to deeply characterize human grasping primitives for a possible translation on the robotic side, the sequence learning network we propose could be in principle be used for real time predictions, since it requires less than 2 ms to process each hand feature vector

Summary

Introduction

The hand is the primary tool humans rely on to successfully interact with the external environment, capitalizing on the intrinsic adaptability of human “end-effector” to multiply manipulation and grasping capabilities. A correct recognition of hand pose and gesture represents an active field of research with applications that are not limited to human-robot interaction and human-inspired robotic grasping (Terlemez et al, 2014) but cross-fertilize several technological and scientific domains, such as neuroscience (Santello et al, 2016), rehabilitation (Dipietro et al, 2008), tele-operation (Fani et al, 2018), haptics and virtual reality (Bianchi et al, 2013), just to cite a few. Hand pose recognition is usually performed through wearable or remote devices (Ciotti et al, 2016). The former category comprises glove and surface marker-based systems. For a comparative analysis of these techniques please refer to Rautaray and Agrawal (2015)

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Neurorobotics	Publication Date: Dec 17, 2018
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

DeepDynamicHand: A Deep Neural Architecture for Labeling Hand Manipulation Strategies in Video Sources Exploiting Temporal Information.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Neurorobotics

Lead the way for us

Similar Papers

Design and Modeling of a Hybrid Soft Robotic Manipulator With Compliant Mechanism
Weiqiang Dou ... Jie Yang
IEEE Robotics and Automation Letters | VOL. 8
Weiqiang Dou, et. al.Weiqiang Dou ... Jie Yang
01 Apr 2023
IEEE Robotics and Automation Letters | VOL. 8

Screw theory-based stiffness analysis for a fluidic-driven soft robotic manipulator
Jialei Shi ... Daniel Martins
-
Jialei Shi, et. al.Jialei Shi ... Daniel Martins
30 May 2021
30 May 2021

Motion analysis and experimental study of a cable-driven soft surgical robot
Runxi Zhang ... Hesheng Wang
-
Runxi Zhang, et. al.Runxi Zhang ... Hesheng Wang
01 Jun 2015
01 Jun 2015

Design and Fabrication of a Soft Robotic Manipulator Driven by Fiber-Reinforced Actuators
Yaxi Wang ... Qingsong Xu
-
Yaxi Wang, et. al.Yaxi Wang ... Qingsong Xu
01 May 2018
01 May 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DeepDynamicHand: A Deep Neural Architecture for Labeling Hand Manipulation Strategies in Video Sources Exploiting Temporal Information.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Neurorobotics