Multi Modal RGB D Action Recognition with CNN LSTM Ensemble Deep Network

D Srihari,P V

doi:10.14569/ijacsa.2020.0111284

Abstract

Human action recognition has transformed from a video processing problem into multi modal machine learning problem. The objective of this work is to perform multi modal human action recognition on an ensemble hybrid network of CNN and LSTM layers. The proposed CNN - LSTM ensemble network is a 2 - stream framework with one ensemble stream learning RGB sequences and the other depth. This proposed framework can learn both temporal and spatial dynamics in both RGB and depth modal action data. The hybrid network is found to be receptive towards both spatial and temporal fields because of the hierarchical structure of CNNs and LSTMs. Finally, to test our proposed model, we used our own BVCAction3D and three RGB D benchmark action datasets. The experiments were conducted on all the datasets using the proposed framework and was found to be effective when compared to similar deep learning architectures.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Advanced Computer Science and Applications	Publication Date: Jan 1, 2020
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Multi Modal RGB D Action Recognition with CNN LSTM Ensemble Deep Network

Abstract

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications

Lead the way for us

Similar Papers

From CNNs to Transformers in Multimodal Human Action Recognition: A Survey
Muhammad Bilal Shaikh ... Syed Muhammad Shamsul Islam
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 20
Muhammad Bilal Shaikh, et. al.Muhammad Bilal Shaikh ... Syed Muhammad Shamsul Islam
09 Jul 2024
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 20

Multi-Modal Human Action Recognition With Sub-Action Exploiting and Class-Privacy Preserved Collaborative Representation Learning
Chengwu Liang ... Lin Qi
IEEE Access | VOL. 8
Chengwu Liang, et. al.Chengwu Liang ... Lin Qi
01 Jan 2020
IEEE Access | VOL. 8

Internet-of-Things-Based Suspicious Activity Recognition Using Multimodalities of Computer Vision for Smart City Security
Amjad Rehman ... Robertas Damaševičius
Security and Communication Networks | VOL. 2022
Amjad Rehman, et. al.Amjad Rehman ... Robertas Damaševičius
05 Oct 2022
Security and Communication Networks | VOL. 2022

Review of Literature on Human Activity Detection and Recognition
Pavankumar Naik ... R Srinivasa Rao Kunte
International Journal of Management, Technology, and Social Sciences | VOL. -
Pavankumar Naik, et. al.Pavankumar Naik ... R Srinivasa Rao Kunte
23 Nov 2023
International Journal of Management, Technology, and Social Sciences | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi Modal RGB D Action Recognition with CNN LSTM Ensemble Deep Network

Abstract

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications