ATTENTION-BASED LSTM NETWORK FOR ACTION RECOGNITION IN SPORTS

Mohib Ullah,Ahmed Mohammed,Muhammad Mudassar Yamin,Faouzi Alaya Cheikh,Sultan Daud Khan,Habib Ullah

doi:10.2352/issn.2470-1173.2021.6.iriacv-302

Abstract

Understanding human action from the visual data is an important computer vision application for video surveillance, sports player performance analysis, and many IoT applications. The traditional approaches for action recognition used hand-crafted visual and temporal features for classifying specific actions. In this paper, we followed the standard deep learning framework for action recognition but introduced channel and spatial attention module sequentially in the network. In a nutshell, our network consists of four main components. First, the input frames are given to a pre-trained CNN for extracting the visual features and the visual features are passed through the attention module. The transformed features maps are given to the bi-directional LSTM network that exploits the temporal dependency among the frames for the underlying action in the scene. The output of bi-direction LSTM is given to a fully connected layer with a softmax classifier that assigns the probabilities to the actions of the subject in the scene. In addition to cross-entropy loss, the marginal loss function is used that penalizes the network for the inter action classes and complimenting the network for the intra action variations. The network is trained and validated on a tennis dataset and in total six tennis players' actions are focused. The network is evaluated on standard performance metrics (precision, recall) promising results are achieved.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronic Imaging	Publication Date: Jan 18, 2021
Citations: 25	License type: cc-by

R Discovery Prime

R Discovery Prime

ATTENTION-BASED LSTM NETWORK FOR ACTION RECOGNITION IN SPORTS

Abstract

Talk to us

Similar Papers

More From: Electronic Imaging

Lead the way for us

Similar Papers

Implicit Attentional Selection of Bound Visual Features
David Melcher ... Zoltán Vidnyánszky
Neuron | VOL. 46
David Melcher, et. al.David Melcher ... Zoltán Vidnyánszky
01 Jun 2005
Neuron | VOL. 46

Parameter Identification for Bernoulli Serial Production Line Model
Yuting Sun ... Tianyu Zhu
IEEE Transactions on Automation Science and Engineering | VOL. 18
Yuting Sun, et. al.Yuting Sun ... Tianyu Zhu
25 Nov 2020
IEEE Transactions on Automation Science and Engineering | VOL. 18

An Efficient QRS Complex Detection Using Optimally Designed Digital Differentiator
Chandan Nayak ... Rajib Kar
Circuits, Systems, and Signal Processing | VOL. 38
Chandan Nayak, et. al.Chandan Nayak ... Rajib Kar
21 Jun 2018
Circuits, Systems, and Signal Processing | VOL. 38

Locus of Control in College Students with and Without Visual Impairments, and the Visual Characteristics that Affect It
Javad Abbasi Jondani
Journal of Visual Impairment & Blindness | VOL. 115
Javad Abbasi JondaniJavad Abbasi Jondani
01 Jan 2020
Journal of Visual Impairment & Blindness | VOL. 115

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ATTENTION-BASED LSTM NETWORK FOR ACTION RECOGNITION IN SPORTS

Abstract

Talk to us

Similar Papers

More From: Electronic Imaging