Enhanced Spatial Stream of Two-Stream Network Using Optical Flow for Human Action Recognition

Shahbaz Khan,Wadood Abdul,Maazen Alsabaan,Aqib Perwaiz,Farhan Riaz,Ali Hassan,Farhan Hussain

doi:10.3390/app13148003

Abstract

Introduction: Convolutional neural networks (CNNs) have maintained their dominance in deep learning methods for human action recognition (HAR) and other computer vision tasks. However, the need for a large amount of training data always restricts the performance of CNNs. Method: This paper is inspired by the two-stream network, where a CNN is deployed to train the network by using the spatial and temporal aspects of an activity, thus exploiting the strengths of both networks to achieve better accuracy. Contributions: Our contribution is twofold: first, we deploy an enhanced spatial stream, and it is demonstrated that models pre-trained on a larger dataset, when used in the spatial stream, yield good performance instead of training the entire model from scratch. Second, a dataset augmentation technique is presented to minimize overfitting of CNNs, where we increase the dataset size by performing various transformations on the images such as rotation and flipping, etc. Results: UCF101 is a standard benchmark dataset for action videos, and our architecture has been trained and validated on it. Compared with the other two-stream networks, our results outperformed them in terms of accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Jul 8, 2023
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Enhanced Spatial Stream of Two-Stream Network Using Optical Flow for Human Action Recognition

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Graph Convolutional Neural Network for Human Action Recognition: A Comprehensive Survey
Tasweer Ahmad ... Xin Zhang
IEEE Transactions on Artificial Intelligence | VOL. 2
Tasweer Ahmad, et. al.Tasweer Ahmad ... Xin Zhang
01 Apr 2021
IEEE Transactions on Artificial Intelligence | VOL. 2

A Spatio-temporal Graph Transformer driven model for recognizing fine-grained data human activity
Yan Mao ... Cuicui Ye
Alexandria Engineering Journal | VOL. 104
Yan Mao, et. al.Yan Mao ... Cuicui Ye
20 Jun 2024
Alexandria Engineering Journal | VOL. 104

Feature learning for Human Activity Recognition using Convolutional Neural Networks
Federico Cruciani ... Raouf Hamzaoui
CCF Transactions on Pervasive Computing and Interaction | VOL. 2
Federico Cruciani, et. al.Federico Cruciani ... Raouf Hamzaoui
24 Jan 2020
CCF Transactions on Pervasive Computing and Interaction | VOL. 2

Motion boundary emphasised optical flow method for human action recognition
Cheng Peng ... Ah‐Chung Tsoi
IET Computer Vision | VOL. 14
Cheng Peng, et. al.Cheng Peng ... Ah‐Chung Tsoi
26 Jul 2020
IET Computer Vision | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhanced Spatial Stream of Two-Stream Network Using Optical Flow for Human Action Recognition

Abstract

Talk to us

Similar Papers

More From: Applied Sciences