Deep Convolutional Neural Networks for Human Action Recognition Using Depth Maps and Postures

Aouaidjia Kamel,David Dagan Feng,Ping Li,Ruimin Shen,Po Yang,Bin Sheng

doi:10.1109/tsmc.2018.2850149

Abstract

In this paper, we present a method (Action-Fusion) for human action recognition from depth maps and posture data using convolutional neural networks (CNNs). Two input descriptors are used for action representation. The first input is a depth motion image that accumulates consecutive depth maps of a human action, whilst the second input is a proposed moving joints descriptor which represents the motion of body joints over time. In order to maximize feature extraction for accurate action classification, three CNN channels are trained with different inputs. The first channel is trained with depth motion images (DMIs), the second channel is trained with both DMIs and moving joint descriptors together, and the third channel is trained with moving joint descriptors only. The action predictions generated from the three CNN channels are fused together for the final action classification. We propose several fusion score operations to maximize the score of the right action. The experiments show that the results of fusing the output of three channels are better than using one channel or fusing two channels only. Our proposed method was evaluated on three public datasets: 1) Microsoft action 3-D dataset (MSRAction3D); 2) University of Texas at Dallas-multimodal human action dataset; and 3) multimodal action dataset (MAD) dataset. The testing results indicate that the proposed approach outperforms most of existing state-of-the-art methods, such as histogram of oriented 4-D normals and Actionlet on MSRAction3D. Although MAD dataset contains a high number of actions (35 actions) compared to existing action RGB-D datasets, this paper surpasses a state-of-the-art method on the dataset by 6.84%.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Convolutional Neural Networks for Human Action Recognition Using Depth Maps and Postures

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans : a publication of the IEEE Systems, Man, and Cybernetics Society

Lead the way for us

Journal: IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans : a publication of the IEEE Systems, Man, and Cybernetics Society	Publication Date: Sep 1, 2019
Citations: 201

Similar Papers

Robust human action recognition based on depth motion maps and improved convolutional neural network
Min Xiang ... Fuli Chen
Journal of electronic imaging | VOL. 27
Min Xiang, et. al.Min Xiang ... Fuli Chen
28 Apr 2018
Journal of electronic imaging | VOL. 27

Two-stream deep representation for human action recognition
Najla Bouarada Ghrab ... Jianhong Zhou
-
Najla Bouarada Ghrab, et. al.Najla Bouarada Ghrab ... Jianhong Zhou
05 Mar 2022
05 Mar 2022

Weakly Supervised Graph Convolutional Neural Network for Human Action Localization
Daisuke Miki ... Shi Chen
-
Daisuke Miki, et. al.Daisuke Miki ... Shi Chen
01 Mar 2020
01 Mar 2020

Human action recognition based on convolutional neural network and spatial pyramid representation
Jihai Xiao ... Feng Li
Journal of Visual Communication and Image Representation | VOL. 71
Jihai Xiao, et. al.Jihai Xiao ... Feng Li
25 Nov 2019
Journal of Visual Communication and Image Representation | VOL. 71

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Convolutional Neural Networks for Human Action Recognition Using Depth Maps and Postures

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans : a publication of the IEEE Systems, Man, and Cybernetics Society