Histogram of Oriented Gradient-Based Fusion of Features for Human Action Recognition in Action Video Sequences.

Chirag I Patel,Dileep Labana,Muhammad Awais,Sharnil Pandya,Kirit Modi,Hemant Ghayvat

doi:10.3390/s20247299

Abstract

Human Action Recognition (HAR) is the classification of an action performed by a human. The goal of this study was to recognize human actions in action video sequences. We present a novel feature descriptor for HAR that involves multiple features and combining them using fusion technique. The major focus of the feature descriptor is to exploits the action dissimilarities. The key contribution of the proposed approach is to built robust features descriptor that can work for underlying video sequences and various classification models. To achieve the objective of the proposed work, HAR has been performed in the following manner. First, moving object detection and segmentation are performed from the background. The features are calculated using the histogram of oriented gradient (HOG) from a segmented moving object. To reduce the feature descriptor size, we take an averaging of the HOG features across non-overlapping video frames. For the frequency domain information we have calculated regional features from the Fourier hog. Moreover, we have also included the velocity and displacement of moving object. Finally, we use fusion technique to combine these features in the proposed work. After a feature descriptor is prepared, it is provided to the classifier. Here, we have used well-known classifiers such as artificial neural networks (ANNs), support vector machine (SVM), multiple kernel learning (MKL), Meta-cognitive Neural Network (McNN), and the late fusion methods. The main objective of the proposed approach is to prepare a robust feature descriptor and to show the diversity of our feature descriptor. Though we are using five different classifiers, our feature descriptor performs relatively well across the various classifiers. The proposed approach is performed and compared with the state-of-the-art methods for action recognition on two publicly available benchmark datasets (KTH and Weizmann) and for cross-validation on the UCF11 dataset, HMDB51 dataset, and UCF101 dataset. Results of the control experiments, such as a change in the SVM classifier and the effects of the second hidden layer in ANN, are also reported. The results demonstrate that the proposed method performs reasonably compared with the majority of existing state-of-the-art methods, including the convolutional neural network-based feature extractors.

Highlights

IntroductionAutomatic understanding of video data (e.g., action recognition) remains a difficult but important challenge
In machine vision, automatic understanding of video data remains a difficult but important challenge
The method of recognizing human actions that occur in a video sequence is defined as human action recognition (HAR)

Summary

Introduction

Automatic understanding of video data (e.g., action recognition) remains a difficult but important challenge. The method of recognizing human actions that occur in a video sequence is defined as human action recognition (HAR). It is difficult to differentiate routine life actions, such as running, jogging, and walking, using an executable script. Human action recognition is a challenging multi-class classification problem due to high intra-class variability within a given class. We propose a scheme to design a feature descriptor that is highly invariant to the fluctuations present in the classes. This paper addresses various challenges in HAR, such as variation in the background (outdoor or indoor), recognizing the gender of the action performer, variation in clothes worn, and scale variation. We deal with constrained video sequences that involve moving background and multiple actions in single video sequence

Objectives

Methods

Findings

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors (Basel, Switzerland)	Publication Date: Dec 18, 2020
Citations: 53	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Histogram of Oriented Gradient-Based Fusion of Features for Human Action Recognition in Action Video Sequences.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

Traffic sign recognition using hybrid features descriptor and artificial neural network classifier
Md Zainal Abedin ... Kaushik Deb
-
Md Zainal Abedin, et. al.Md Zainal Abedin ... Kaushik Deb
01 Dec 2016
01 Dec 2016

Integration of moment invariants and uniform local binary patterns for human activity recognition in video sequences
Swati Nigam ... Ashish Khare
Multimedia Tools and Applications | VOL. 75
Swati Nigam, et. al.Swati Nigam ... Ashish Khare
16 Nov 2015
Multimedia Tools and Applications | VOL. 75

Pedestrian detection from thermal images with a scattered difference of directional gradients feature descriptor
Bin Qi ... Zheng Liu
-
Bin Qi, et. al.Bin Qi ... Zheng Liu
01 Oct 2014
01 Oct 2014

Histogram of directional derivative based spatio-temporal descriptor for human action recognition
Sidharth B Bhorge ... Ramachandra R Manthalkar
-
Sidharth B Bhorge, et. al.Sidharth B Bhorge ... Ramachandra R Manthalkar
01 Feb 2017
01 Feb 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Histogram of Oriented Gradient-Based Fusion of Features for Human Action Recognition in Action Video Sequences.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)