Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos

Sebastian Agethen,Winston H Hsu

doi:10.1109/tmm.2019.2932564

Abstract

Action recognition greatly benefits motion understanding in video analysis. Recurrent networks such as long short-term memory (LSTM) networks are a popular choice for motion-aware sequence learning tasks. Recently, a convolutional extension of LSTM was proposed, in which input-to-hidden and hidden-to-hidden transitions are modeled through convolution with a single kernel. This implies an unavoidable trade-off between effectiveness and efficiency. Herein, we propose a new enhancement to convolutional LSTM networks that supports accommodation of multiple convolutional kernels and layers. This resembles a Network-in-LSTM approach, which improves upon the aforementioned concern. In addition, we propose an attention-based mechanism that is specifically designed for our multi-kernel extension. We evaluated our proposed extensions in a supervised classification setting on the UCF-101 and Sports-1M datasets, with the findings showing that our enhancements improve accuracy. We also undertook qualitative analysis to reveal the characteristics of our system and the convolutional LSTM baseline.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia

Lead the way for us

Journal: IEEE Transactions on Multimedia	Publication Date: Aug 13, 2019
Citations: 56

Similar Papers

Skeleton-based human activity recognition using ConvLSTM and guided feature learning
Santosh Kumar Yadav ... Hari Mohan Pandey
Soft Computing | VOL. 26
Santosh Kumar Yadav, et. al.Santosh Kumar Yadav ... Hari Mohan Pandey
01 Oct 2021
Soft Computing | VOL. 26

Planning of Service Mobile Robot Based on Convolutional LSTM Network
Shuai Yin ... Arkady Yuschenko
Journal of Physics: Conference Series | VOL. 1828
Shuai Yin, et. al.Shuai Yin ... Arkady Yuschenko
01 Feb 2021
Journal of Physics: Conference Series | VOL. 1828

Short-Term Demand Forecast of E-Commerce Platform Based on ConvLSTM Network.
Zan Li ... Nairen Zhang
Computational intelligence and neuroscience | VOL. 2022
Zan Li, et. al.Zan Li ... Nairen Zhang
14 Jul 2022
Computational intelligence and neuroscience | VOL. 2022

Image Denoising with Deep Convolutional and Multi-directional LSTM Networks under Poisson Noise Environments
Teerawat Piriyatharawet ... Wuttipong Kumwilaisak
-
Teerawat Piriyatharawet, et. al.Teerawat Piriyatharawet ... Wuttipong Kumwilaisak
01 Sep 2018
01 Sep 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia