A Deep Sequence Learning Framework for Action Recognition in Small-Scale Depth Video Dataset.

Mohammad Farhad Bulbul,Daijin Kim,Amin Ullah,Hazrat Ali

doi:10.3390/s22186841

Abstract

Depth video sequence-based deep models for recognizing human actions are scarce compared to RGB and skeleton video sequences-based models. This scarcity limits the research advancements based on depth data, as training deep models with small-scale data is challenging. In this work, we propose a sequence classification deep model using depth video data for scenarios when the video data are limited. Unlike summarizing the frame contents of each frame into a single class, our method can directly classify a depth video, i.e., a sequence of depth frames. Firstly, the proposed system transforms an input depth video into three sequences of multi-view temporal motion frames. Together with the three temporal motion sequences, the input depth frame sequence offers a four-stream representation of the input depth action video. Next, the DenseNet121 architecture is employed along with ImageNet pre-trained weights to extract the discriminating frame-level action features of depth and temporal motion frames. The extracted four sets of feature vectors about frames of four streams are fed into four bi-directional (BLSTM) networks. The temporal features are further analyzed through multi-head self-attention (MHSA) to capture multi-view sequence correlations. Finally, the concatenated genre of their outputs is processed through dense layers to classify the input depth video. The experimental results on two small-scale benchmark depth datasets, MSRAction3D and DHA, demonstrate that the proposed framework is efficacious even for insufficient training samples and superior to the existing depth data-based action recognition methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Deep Sequence Learning Framework for Action Recognition in Small-Scale Depth Video Dataset.

Abstract

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Journal: Sensors	Publication Date: Sep 9, 2022
License type: CC BY 4.0

Similar Papers

Prototype-based budget maintenance for tracking in depth videos
Sari Awwad ... Massimo Piccardi
Multimedia Tools and Applications | VOL. 76
Sari Awwad, et. al.Sari Awwad ... Massimo Piccardi
22 Oct 2016
Multimedia Tools and Applications | VOL. 76

Spatio-Temporal Pyramid Model based on depth maps for action recognition
Haining Xu ... Enqing Chen
-
Haining Xu, et. al. Haining Xu ... Enqing Chen
01 Oct 2015
01 Oct 2015

Temporal depth video enhancement based on intrinsic static structure
Lu Sheng ... Songnan Li
-
Lu Sheng, et. al.Lu Sheng ... Songnan Li
01 Oct 2014
01 Oct 2014

Phase-based frame rate up-conversion for depth video
Lunan Zhou ... Yaowu Chen
Journal of Electronic Imaging | VOL. 27
Lunan Zhou, et. al.Lunan Zhou ... Yaowu Chen
02 Aug 2018
Journal of Electronic Imaging | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Deep Sequence Learning Framework for Action Recognition in Small-Scale Depth Video Dataset.

Abstract

Talk to us

Similar Papers

More From: Sensors