Human Action Recognition Methods Based on CNNs for RGB Video Input

Bogdan Alexandru Radulescu,Adina Magda Florea

doi:10.1109/cscs52396.2021.00026

Abstract

Human Action recognition is a complex problem that attracts more and more researchers from the scientific community due to its applicability in domains such as security and behavior analysis. At its core, this problem entails classifying an action into a finite set of classes. Neural network based approaches, and especially convolutional neural networks, are a good starting point for solving the problem of human action recognition. Due to their nature, they can recognize spatio-temporal features very well, making them ideal for working with sequences of RGB images. In this paper are proposed three types of convolutional neural network architectures that contribute to solving the problem of Human Action Recognition. The first one is based on 2D kernels, the second one on 3D kernels, and the third one on TCN (Temporal Convolutional Network) units. Each one is presented with its structure, advantages and disadvantages, along with metrics that measure their performance. The one based on 2D convolutions is the fastest, but it also has the lowest performances. The second one is a good middle ground, useful in certain situations which require a fast classifier operating on different action classes. Finally, the one based on TCNs performs close to some of the best existent models. It represents a viable solution to the proposed problem. It can classify many actions, using only RGB images of fairly low resolution, in real time. The three models have been tested on the RGB part of the NTU RGB+D dataset. The 2D convolution-based model obtained an accuracy of 7.43% on the Cross-Subject split and 10.28% on the Cross-View split. The 3D convolution-based model obtained 58.77% on Cross Subject and 56.11% on Cross-View. Finally, the TCN-based model obtained an accuracy of 80.45% on Cross-Subject and an accuracy of 82.57% on Cross-View.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Human Action Recognition Methods Based on CNNs for RGB Video Input

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Joint segmentation of multivariate time series with hidden process regression for human activity recognition
F Chamroukhi ... Y Amirat
Neurocomputing | VOL. 120
F Chamroukhi, et. al.F Chamroukhi ... Y Amirat
30 Apr 2013
Neurocomputing | VOL. 120

A new depth residual network combined recurrent with residual structure for human action recognition from videos
Min Wang ... Guixiong Tian
-
Min Wang, et. al.Min Wang ... Guixiong Tian
18 Mar 2022
18 Mar 2022

Chapter 9 - Deep Learning for Human Activity Recognition
Phyo P San ... Minh N Nguyen
Big Data Analytics for Sensor-Network Collected Intelligence | VOL. 444
Phyo P San, et. al.Phyo P San ... Minh N Nguyen
01 Jan 2017
Big Data Analytics for Sensor-Network Collected Intelligence | VOL. 444

Two Stream LSTM: A Deep Fusion Framework for Human Action Recognition
Harshala Gammulle ... Sridha Sridharan
-
Harshala Gammulle, et. al.Harshala Gammulle ... Sridha Sridharan
01 Mar 2017
01 Mar 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Human Action Recognition Methods Based on CNNs for RGB Video Input

Abstract

Talk to us

Similar Papers