ECPNet: An Efficient Attention-Based Convolution Network with Pseudo-3D Block for Human Action Recognition

Xiuping Bao,Jiabin Yuan,Bei Chen

doi:10.1109/ictai.2019.00089

Abstract

Human action recognition has became an important task in computer vision and has received a significant amount of research interests in recent years. Convolutional Neural Network (CNN) has shown its power in image recognition task. While in the field of video recognition, it is still a challenge problem. In this paper, we introduce a high-efficient attention-based convolutional network named ECPNet for video understanding. ECPNet adopts the framework that is a consecutive connection of 2D CNN and pseudo-3D CNN. The pseudo-3D means we replace the traditional 3 × 3 × 3 kernel with two 3D convolutional filters shaped 1 × 3 × 3 and 3 × 1 × 1. Our ECPNet combines the advantages of both 2D and 3D CNNs: (1) ECPNet is an end-to-end network and can learn information of appearance from images and motion between frames. (2) ECPNet requires less computing resource and lower memory consumption than many state-of-art models. (3) ECPNet is easy to expand for different demands of runtime and classification accuracy. We evaluate the proposed model on three popular video benchmarks in human action recognition task: Kinetics-mini (split of full Kinetics), UCF101 and HMDB51. Our ECPNet achieves the excellent performance on above datasets with less time cost.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ECPNet: An Efficient Attention-Based Convolution Network with Pseudo-3D Block for Human Action Recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Retracted] Visual Sensing Human Motion Detection System for Interactive Music Teaching
Xunyun Chang ... Liangqing Peng
Journal of Sensors | VOL. 2021
Xunyun Chang, et. al.Xunyun Chang ... Liangqing Peng
01 Jan 2020
Journal of Sensors | VOL. 2021

An Attention-based Hybrid 2D/3D CNN-LSTM for Human Action Recognition
Khaled Bayoudh ... Faycal Hamdaoui
-
Khaled Bayoudh, et. al.Khaled Bayoudh ... Faycal Hamdaoui
25 Jan 2022
25 Jan 2022

Multi‐mode neural network for human action recognition
Haohua Zhao ... Liqing Zhang
IET Computer Vision | VOL. 14
Haohua Zhao, et. al.Haohua Zhao ... Liqing Zhang
06 Nov 2020
IET Computer Vision | VOL. 14

Combining Multi-Dimensional Convolutional Neural Network (CNN) With Visualization Method for Detection of Aphis gossypii Glover Infection in Cotton Leaves Using Hyperspectral Imaging.
Tianying Yan ... Pan Gao
Frontiers in Plant Science | VOL. 12
Tianying Yan, et. al.Tianying Yan ... Pan Gao
15 Feb 2021
Frontiers in Plant Science | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ECPNet: An Efficient Attention-Based Convolution Network with Pseudo-3D Block for Human Action Recognition

Abstract

Talk to us

Similar Papers