RGB-D Data-Based Action Recognition: A Review.

Muhammad Bilal Shaikh,Douglas Chai

doi:10.3390/s21124246

Muhammad Bilal Shaikh, Douglas Chai

Open Access

https://doi.org/10.3390/s21124246

Copy DOI

Journal: Sensors (Basel, Switzerland)	Publication Date: Jun 21, 2021
Citations: 38	License type: CC BY 4.0

Affiliation: Edith Cowan University

Abstract

Classification of human actions is an ongoing research problem in computer vision. This review is aimed to scope current literature on data fusion and action recognition techniques and to identify gaps and future research direction. Success in producing cost-effective and portable vision-based sensors has dramatically increased the number and size of datasets. The increase in the number of action recognition datasets intersects with advances in deep learning architectures and computational support, both of which offer significant research opportunities. Naturally, each action-data modality—such as RGB, depth, skeleton, and infrared (IR)—has distinct characteristics; therefore, it is important to exploit the value of each modality for better action recognition. In this paper, we focus solely on data fusion and recognition techniques in the context of vision with an RGB-D perspective. We conclude by discussing research challenges, emerging trends, and possible future research directions.

Highlights

Human action recognition (HAR) has recently gained increasing attention from computer vision researchers with applications in robot vision, multimedia content search, video surveillance, and motion tracking systems
The following subsections discuss the fundamental variants of neural networks, and later we present some modern deep learning-based approaches used in RGB-D data
As performance demand relies on high-end hardware and multiple graphical processing units (GPU), support is a must when experimenting with big data-related problems

Summary

Introduction

Human action recognition (HAR) has recently gained increasing attention from computer vision researchers with applications in robot vision, multimedia content search, video surveillance, and motion tracking systems. The development of low-cost sensors such as Microsoft Kinect [1], Intel RealSense [2], and Orbbec [3] has sparked further research into action recognition These sensors collect data in various modalities such as RGB video, depth, skeleton, and IR. All these modalities have their own characteristics that can help answer challenges related to action data and provide potential opportunities for computer vision researchers to examine vision data from different perspectives. RGB-D data acquisition and different consumer preferred sensors will be discussed in following subsections

RGB-D Data Acquisition

RGB-D Sensors

Classical Machine Learning-Based Techniques

Depth Data-Based Techniques

Skeleton Sequence-Based Techniques

RGB-D Data-Based Techniques

Deep Learning

Neural Networks Variants

Deep Learning-Based Techniques Using RGB-D Data

Single Stream

Two Stream

Hybrid Deep Learning-Based Techniques for HAR

Data Fusion Techniques

Early Fusion

Slow Fusion

Late Fusion

Multi-Resolution

Content-Based Video Summarization

Education and Learning

Healthcare Systems

Entertainment Systems

Safety and Surveillance Systems

Sports

Challenges in RGB-D Data Fusion

Combination of Classical Machine Learning and Deep Learning-Based Methods

Assessment in Practical Scenarios

Self-Learning

Interpretation of Online Human Actions

Multimodal Fusion

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

RGB-D Data-Based Action Recognition: A Review.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

Data fusion and multiple classifier systems for human activity detection and health monitoring: Review and open research directions
Henry Friday Nweke ... Ying Wah Teh
Information Fusion | VOL. 46
Henry Friday Nweke, et. al.Henry Friday Nweke ... Ying Wah Teh
18 Jun 2018
Information Fusion | VOL. 46

Data Mining and Fusion Techniques for Wireless Intelligent Sensor Networks
Nafees Akhter Farooqui ... Ritika
-
Nafees Akhter Farooqui, et. al.Nafees Akhter Farooqui ... Ritika
01 Jan 2020
01 Jan 2020

Joint segmentation and classification of human actions in video
Fernando De La Torre ... Minh Hoai
-
Fernando De La Torre, et. al.Fernando De La Torre ... Minh Hoai
01 Jun 2011
01 Jun 2011

Progetto di reti Sensori Wireless e tecniche di Fusione Sensoriale

-

25 May 2009
25 May 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

RGB-D Data-Based Action Recognition: A Review.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)