3D+T dense motion trajectories as kinematics primitives to recognize gestures on depth video sequences

Fabio Martínez Carrillo,Fabián Castillo,Lola Bautista

doi:10.33571/rpolitec.v15n29a7

Fabio Martínez Carrillo, Fabián Castillo + Show 1 more

Open Access

https://doi.org/10.33571/rpolitec.v15n29a7

Copy DOI

Abstract

RGB-D sensors have allowed attacking many classical problems in computer vision such as segmentation, scene representations and human interaction, among many others. Regarding motion characterization, typical RGB-D strategies are limited to namely analyze global shape changes and capture scene flow fields to describe local motions in depth sequences. Nevertheless, such strategies only recover motion information among a couple of frames, limiting the analysis of coherent large displacements along time. This work presents a novel strategy to compute 3D+t dense and long motion trajectories as fundamental kinematic primitives to represent video sequences. Each motion trajectory models kinematic words primitives that together can describe complex gestures developed along videos. Such kinematic words were processed into a bag-of-kinematic-words framework to obtain an occurrence video descriptor. The novel video descriptor based on 3D+t motion trajectories achieved an average accuracy of 80% in a dataset of 5 gestures and 100 videos.

Full Text