Abstract

This paper aims at investigating the action prediction problem from a pure kinematic perspective. Specifically, we address the problem of recognizing future actions, indeed human intentions, underlying a same initial (and apparently unrelated) motor act. This study is inspired by neuroscientific findings asserting that motor acts at the very onset are embedding information about the intention with which are performed, even when different intentions originate from a same class of movements. To demonstrate this claim in computational and empirical terms, we designed an ad hoc experiment and built a new 3D and 2D dataset where, in both training and testing, we analyze a same class of grasping movements underlying different intentions. We investigate how much the intention discriminants generalize across subjects, discovering that each subject tends to affect the prediction by his/her own bias. Inspired by the domain adaptation problem, we propose to interpret each subject as a domain, leading to a novel subject adversarial paradigm. The proposed approach favorably copes with our new problem, boosting the considered baseline features encoding 2D and 3D information and which do not exploit the subject information.

Highlights

  • Recognizing human actions is an active area of research which is faced under different paradigms in computer vision and pattern recognition

  • In Vondrick et al (2016), a convolutional neural nets (CNN) is trained by jointly considering the present and the future frames of a given scene, while, in our case, only the graspings are exploitable as data

  • We introduce Intention from Motion, a novel problem consisting in predicting the goal i.e., intention) that originates from an human action by using the kinematics only, in a context-free setting

Read more

Summary

Introduction

Recognizing human actions is an active area of research which is faced under different paradigms in computer vision and pattern recognition. As a different paradigm, we introduce a new and more demanding problem, consisting in the anticipation of future, never observed, actions where the only available input data consists of motion segments of a nondiscriminant and apparently unrelated (with respect to the future action) motor act. The problem we would like to address consists in the prediction of human intentions, defined as the overarching goal embedded in an action sequence (see Fig. 1). It would be desirable to predict the intention of a person in his/her car stopped at a police checkpoint whether, while opening the International Journal of Computer Vision (2020) 128:220–239

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.