Spatial Attention Adapted to a LSTM Architecture with Frame Selection for Human Action Recognition in Videos

Carlos Orozco,Maria Buemi,Julio Berlles

doi:10.52591/lxai2021072417

Spatial Attention Adapted to a LSTM Architecture with Frame Selection for Human Action Recognition in Videos

Carlos Orozco, Maria Buemi + Show 1 more

Open Access

https://doi.org/10.52591/lxai2021072417

Copy DOI

Publication Date: Jul 24, 2021

#Action Recognition In Videos #Recognition In Videos + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Action recognition in videos is currently a topic of interest in the area of computer vision, due to potential applications such as: multimedia indexing, surveillance in public spaces, among others. In this work we propose an attention mechanism adapted to a CNN–LSTM base architecture. To carry out the training and testing phases, we used the HMDB-51 and UCF-101 datasets. We evaluate the performance of our system using accuracy as the evaluation metric, obtaining 57.3% and 90.4% for HMDB-51 and UCF-101 respectively.

Full Text