A new spatial-temporal histograms of gradients descriptor and HOD-VLAD encoding for human action recognition

Bo Lin,Bin Fang

doi:10.1142/s0219691319400095

Abstract

Automatic human action recognition is a core functionality of systems for video surveillance and human object interaction. In the whole recognition system, feature description and encoding represent two crucial key steps. In order to construct a powerful action recognition framework, it is important that the two steps must provide reliable performance. In this paper, we proposed a new human action feature descriptor which is called spatio-temporal histograms of gradients (SPHOG). SPHOG is based on the spatial and temporal derivation signal, which extracts the gradient changes between consecutive frames. Compared to the traditional descriptors histograms of optical flow, our proposed SPHOG costs less computation resource. In order to incorporate the distribution information of local descriptors into Vector of Locally Aggregated Descriptors (VLAD), which is a popular encoding approach for Bag-of-Feature representation, a Gaussian kernel is implanted to compute the weighted distance histograms of local descriptors. By doing this, the encoding schema for bag-of-feature (BOF) representation is more effective. We validated our proposed algorithm for human action recognition on three public available datasets KTH, UCF Sports and HMDB51. The evaluation experiment results indicate that the proposed descriptor and encoding method can improve the efficiency of human action recognition and the recognition accuracy.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A new spatial-temporal histograms of gradients descriptor and HOD-VLAD encoding for human action recognition

Abstract

Talk to us

Similar Papers

More From: International journal of wavelets, multiresolution and information processing

Lead the way for us

Journal: International journal of wavelets, multiresolution and information processing	Publication Date: Mar 1, 2019
Citations: 6

Similar Papers

Spatial-temporal histograms of gradients and HOD-VLAD encoding for human action recognition
Bo Lin ... Bin Fang
-
Bo Lin, et. al.Bo Lin ... Bin Fang
01 Dec 2017
01 Dec 2017

Human action recognition based on spatio-temporal three-dimensional scattering transform descriptor and an improved VLAD feature encoding algorithm
Bo Lin ... Jiye Qian
Neurocomputing | VOL. 348
Bo Lin, et. al.Bo Lin ... Jiye Qian
05 Nov 2018
Neurocomputing | VOL. 348

Two-stream spatiotemporal feature fusion for human action recognition
Amany Abdelbaky ... Saleh Aly
The Visual Computer | VOL. 37
Amany Abdelbaky, et. al.Amany Abdelbaky ... Saleh Aly
09 Aug 2020
The Visual Computer | VOL. 37

A compact descriptor CHOG3D and its application in human action recognition
Yanli Ji ... Atsushi Shimada
Ieej Transactions on Electrical and Electronic Engineering | VOL. 8
Yanli Ji, et. al.Yanli Ji ... Atsushi Shimada
20 Nov 2012
Ieej Transactions on Electrical and Electronic Engineering | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A new spatial-temporal histograms of gradients descriptor and HOD-VLAD encoding for human action recognition

Abstract

Talk to us

Similar Papers

More From: International journal of wavelets, multiresolution and information processing