Human Action Recognition Based on Integrating Body Pose, Part Shape, and Motion

Hany El-Ghaish,Amin Shoukry,Rikio Onai,Mohamed E Hussien

doi:10.1109/access.2018.2868319

Hany El-Ghaish, Amin Shoukry + Show 2 more

Open Access

https://doi.org/10.1109/access.2018.2868319

Copy DOI

Abstract

Human action recognition is a challenging problem, especially in the presence of multiple actors in the scene and/or viewpoint variations. In this paper, three modalities, namely, 3-D skeletons, body part images, and motion history image (MHI), are integrated into a hybrid deep learning architecture for human action recognition. The three modalities capture the main aspects of an action: body pose, part shape, and body motion. Although the 3-D skeleton modality captures the actor’s pose, it lacks information about the shape of the body parts as well as the shape of manipulated objects. This is the reason for including both the body-part images and the MHI as additional modalities. The deployed architecture combines convolution neural networks (CNNs), long short-term memory (LSTM), and a fine-tuned pre-trained architecture into a hybrid one. It is called MCLP: m ulti-modal C NN + L STM + VGG16 p re-trained on ImageNet. The MCLP consists of three sub-models: CL1D (for CNN1D + LSTM), CL2D (for CNN2D + LSTM), and CMHI (CNN2D for MHI), which simultaneously extract the spatial and temporal patterns in the three modalities. The decisions of these three sub-models are fused by a late multiply fusion module, which proved to yield better accuracy than averaging or maximizing fusion methods. The proposed combined model and its sub-models have been evaluated both individually and collectively on four public data sets: UTkinect Action3D, SBU Interaction, Florence3-D Action, and NTU RGB+D. Our recognition rates outperform the state-of-the-art rates on all the evaluated data sets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2018
Citations: 19	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Human Action Recognition Based on Integrating Body Pose, Part Shape, and Motion

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

A Comparison Between Various Human Detectors and CNN-Based Feature Extractors for Human Activity Recognition via Aerial Captured Video Sequences
Nouar Aldahoul ... Aznul Qalid Md Sabri
IEEE Access | VOL. 10
Nouar Aldahoul, et. al.Nouar Aldahoul ... Aznul Qalid Md Sabri
01 Jan 2021
IEEE Access | VOL. 10

Prediction of Sufficient Accuracy for Human Activity Recognition using Novel Long Short Term Memory in Compared with Decision Tree
Sai Charan ... R Surendran
-
Sai Charan, et. al.Sai Charan ... R Surendran
23 Mar 2023
23 Mar 2023

Deep learning-based multi-modal approach using RGB and skeleton sequences for human activity recognition
Pratishtha Verma ... Animesh Sah
Multimedia Systems | VOL. 26
Pratishtha Verma, et. al.Pratishtha Verma ... Animesh Sah
25 Jul 2020
Multimedia Systems | VOL. 26

Bimodal HAR-An efficient approach to human activity analysis and recognition using bimodal hybrid classifiers
K Venkatachalam ... Weiping Ding
Information Sciences | VOL. 628
K Venkatachalam, et. al.K Venkatachalam ... Weiping Ding
01 Feb 2023
Information Sciences | VOL. 628

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Human Action Recognition Based on Integrating Body Pose, Part Shape, and Motion

Abstract

Talk to us

Similar Papers

More From: IEEE Access