Robust Human Activity Recognition Using Multimodal Feature-Level Fusion

Muhammad Ehatisham-Ul-Haq,Hafiz M A Malik,Ik Hyun Lee,Aun Irtaza,Muhammad Awais Azam,Muhammad Tariq Mahmood,Ali Javed

doi:10.1109/access.2019.2913393

Abstract

Automated recognition of human activities or actions has great significance as it incorporates wide-ranging applications, including surveillance, robotics, and personal health monitoring. Over the past few years, many computer vision-based methods have been developed for recognizing human actions from RGB and depth camera videos. These methods include space-time trajectory, motion encoding, key poses extraction, space-time occupancy patterns, depth motion maps, and skeleton joints. However, these camera-based approaches are affected by background clutter and illumination changes and applicable to a limited field of view only. Wearable inertial sensors provide a viable solution to these challenges but are subject to several limitations such as location and orientation sensitivity. Due to the complementary trait of the data obtained from the camera and inertial sensors, the utilization of multiple sensing modalities for accurate recognition of human actions is gradually increasing. This paper presents a viable multimodal feature-level fusion approach for robust human action recognition, which utilizes data from multiple sensors, including RGB camera, depth sensor, and wearable inertial sensors. We extracted the computationally efficient features from the data obtained from RGB-D video camera and inertial body sensors. These features include densely extracted histogram of oriented gradient (HOG) features from RGB/depth videos and statistical signal attributes from wearable sensors data. The proposed human action recognition (HAR) framework is tested on a publicly available multimodal human action dataset UTD-MHAD consisting of 27 different human actions. K-nearest neighbor and support vector machine classifiers are used for training and testing the proposed fusion model for HAR. The experimental results indicate that the proposed scheme achieves better recognition results as compared to the state of the art. The feature-level fusion of RGB and inertial sensors provides the overall best performance for the proposed system, with an accuracy rate of 97.6%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 112	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Robust Human Activity Recognition Using Multimodal Feature-Level Fusion

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Multi-Fusion Sensors for Action Recognition based on Discriminative Motion Cues and Random Forest
Sadaf Hafeez ... Shaharyar Kamal
-
Sadaf Hafeez, et. al.Sadaf Hafeez ... Shaharyar Kamal
21 Sep 2021
21 Sep 2021

Multimodal Sensor Fusion Frameworks With Application to Human Action Recognition
Zeeshan Ahmad
-
Zeeshan AhmadZeeshan Ahmad
16 Feb 2024
16 Feb 2024

Multimodal Sensor Fusion Frameworks With Application to Human Action Recognition
Zeeshan Ahmad
-
Zeeshan AhmadZeeshan Ahmad
16 Feb 2024
16 Feb 2024

Real-time action recognition by feature-level fusion of depth and inertial sensor
Yi Li ... Wei Feng
-
Yi Li, et. al.Yi Li ... Wei Feng
01 Jul 2017
01 Jul 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust Human Activity Recognition Using Multimodal Feature-Level Fusion

Abstract

Talk to us

Similar Papers

More From: IEEE Access