Human Action Recognition Using Deep Multilevel Multimodal (${M}^{2}$ ) Fusion of Depth and Inertial Sensors

Zeeshan Ahmad,Naimul Khan

doi:10.1109/jsen.2019.2947446

Abstract

Multimodal fusion frameworks for Human Action Recognition (HAR) using depth and inertial sensor data have been proposed over the years. In most of the existing works, fusion is performed at a single level (feature level or decision level), missing the opportunity to fuse rich mid-level features necessary for better classification. To address this shortcoming, in this paper, we propose three novel deep multilevel multimodal (M2) fusion frameworks to capitalize on different fusion strategies at various stages and to leverage the superiority of multilevel fusion. At input, we transform the depth data into depth images called sequential front view images (SFIs) and inertial sensor data into signal images. Each input modality, depth and inertial, is further made multimodal by taking convolution with the Prewitt filter. Creating “modality within modality” enables further complementary and discriminative feature extraction through Convolutional Neural Networks (CNNs). CNNs are trained on input images of each modality to learn low-level, high-level and complex features. Learned features are extracted and fused at different stages of the proposed frameworks to combine discriminative and complementary information. These highly informative features are served as input to a multi-class Support Vector Machine (SVM). We evaluate the proposed frameworks on three publicly available multimodal HAR datasets, namely, UTD Multimodal Human Action Dataset (MHAD), Berkeley MHAD, and UTD-MHAD Kinect V2. Experimental results show the supremacy of the proposed fusion frameworks over existing methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Human Action Recognition Using Deep Multilevel Multimodal (${M}^{2}$ ) Fusion of Depth and Inertial Sensors

Abstract

Talk to us

Similar Papers

More From: IEEE Sensors Journal

Lead the way for us

Journal: IEEE Sensors Journal	Publication Date: Feb 1, 2020
Citations: 80

Similar Papers

Multimodal Sensor Fusion Frameworks With Application to Human Action Recognition
Zeeshan Ahmad
-
Zeeshan AhmadZeeshan Ahmad
16 Feb 2024
16 Feb 2024

Multimodal Sensor Fusion Frameworks With Application to Human Action Recognition
Zeeshan Ahmad
-
Zeeshan AhmadZeeshan Ahmad
16 Feb 2024
16 Feb 2024

Towards Improved Human Action Recognition Using Convolutional Neural Networks and Multimodal Fusion of Depth and Inertial Sensor Data
Zeeshan Ahmad ... Naimul Khan
-
Zeeshan Ahmad, et. al.Zeeshan Ahmad ... Naimul Khan
01 Dec 2018
01 Dec 2018

CNN-Based Multistage Gated Average Fusion (MGAF) for Human Action Recognition Using Depth and Inertial Sensors
Zeeshan Ahmad ... Naimul Khan
IEEE Sensors Journal | VOL. 21
Zeeshan Ahmad, et. al.Zeeshan Ahmad ... Naimul Khan
06 Oct 2020
IEEE Sensors Journal | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Human Action Recognition Using Deep Multilevel Multimodal (${M}^{2}$ ) Fusion of Depth and Inertial Sensors

Abstract

Talk to us

Similar Papers

More From: IEEE Sensors Journal