Human Action Recognition Based on Motion Feature and Manifold Learning

Jun Wang,Wentao Ma,Limin Xia

doi:10.1109/access.2021.3088155

Abstract

Human action recognition is an important task in the fields of video content analysis and computer vision. Since the performance of most existing action recognition frameworks depends on the representation of features, many researches aim to construct more discriminative features. In this paper, we propose a manifold learning framework based on optical flow for action recognition. First, we calculate the dense optical flow field of the original video sequence, and the attention pooling layer (AP) is adopted to separate target area and background area to eliminate background interference. On this basis, motion features (MF) based on the physical characteristics of dense optical flow are developed to characterize human motion information. After that, manifold learning is introduced to calculate the motion variance features (MVF), which reflect the change rate of motion features and measure the spatial correlation between features in non-Euclidean space. Finally, fusing the MVF obtained by manifold learning and MF, feeding fusion features into two fully connected layers (FC) in series for action classification and recognition. Experiments on several classic datasets show that the proposed method achieves 0.98%, 1.86% and 0.99% performance improvement on UCF 101, HMDB51 and JHMDB.

Highlights

T HE purpose of human action recognition (HAR) is to realize understanding of human behavior by analysing and processing the video containing human behavior
The research of HAR has made significant progress in image segmentation [1]–[4], target detection [5]–[8] and etc., it is still confronted with a great challenge because of the diversity and high non-linearity of human behavior, which is caused by the non-rigid structure of human body and the confusion of background and motion feature, etc
The mainstream action recognition framework is mainly limited by the following three aspects: (1) deep learning framework often needs to be trained with a large number of parameters, which is easy to fall into the disaster of dimensionality; (2) Due to the one-sidedness of manual features, its recognition ability is not enough to characterize motion states; (3) the intense interference caused by complex background confuses the recognition model

Summary

Introduction

T HE purpose of human action recognition (HAR) is to realize understanding of human behavior by analysing and processing the video containing human behavior. The mainstream action recognition framework is mainly limited by the following three aspects: (1) deep learning framework often needs to be trained with a large number of parameters, which is easy to fall into the disaster of dimensionality; (2) Due to the one-sidedness of manual features, its recognition ability is not enough to characterize motion states; (3) the intense interference caused by complex background confuses the recognition model. The attention pooling layer (AP) is inserted into the traditional 3-layer CNN structure to capture the region of interest (ROI) in continuous video frames The purpose of this step is to reduce the interference caused by the background, and reduce the computational burden, effectively. Concatenating MF with the motion variation features, and fed them into two fully connected layers to complete the action recognition tasks

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Human Action Recognition Based on Motion Feature and Manifold Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Visual cues for view-invariant human action recognition

-

17 Feb 2017
17 Feb 2017

Dense Optical Flow and Residual Network-Based Human Activity Recognition
Negar Sultana ... Md Shopon
-
Negar Sultana, et. al.Negar Sultana ... Md Shopon
01 Jan 2021
01 Jan 2021

Video Based Human Activity Detection, Recognition and Classification of actions using SVM
Jagadeesh B ... Chandrashekar M Patil
Transactions on Machine Learning and Artificial Intelligence | VOL. 6
Jagadeesh B, et. al.Jagadeesh B ... Chandrashekar M Patil
31 Dec 2019
Transactions on Machine Learning and Artificial Intelligence | VOL. 6

A Thermal Camera-based Activity Recognition Using Discriminant Skeleton Features and RNN
Md Zia Uddin ... Weria Khaksar
-
Md Zia Uddin, et. al.Md Zia Uddin ... Weria Khaksar
01 Jul 2019
01 Jul 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Human Action Recognition Based on Motion Feature and Manifold Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access