First-person activity recognition from micro-action representations using convolutional neural networks and object flow histograms

Panagiotis Giannakeris,Konstantinos Avgerinakis,Stefanos Vrochidis,Panagiotis C Petrantonakis,Ioannis Kompatsiaris

doi:10.1007/s11042-020-09902-6

Abstract

A novel first-person human activity recognition framework is proposed in this work. Our proposed methodology is inspired by the central role moving objects have in egocentric activity videos. Using a Deep Convolutional Neural Network we detect objects and develop discriminant object flow histograms in order to represent fine-grained micro-actions during short temporal windows. Our framework is based on the assumption that large scale activities are synthesized by fine-grained micro-actions. We gather all the micro-actions and perform Gaussian Mixture Model clusterization, so as to build a micro-action vocabulary that is later used in a Fisher encoding schema. Results show that our method can reach 60% recognition rate on the benchmark ADL dataset. The capabilities of the proposed framework are also showcased by profoundly evaluating for a great deal of hyper-parameters and comparing to other State-of-the-Art works.

Full Text