3D convolutional neural network with multi-model framework for action recognition

Longlong Jing,Yingli Tian,Yuancheng Ye,Xiaodong Yang

doi:10.1109/icip.2017.8296599

3D convolutional neural network with multi-model framework for action recognition

Longlong Jing, Yingli Tian + Show 2 more

https://doi.org/10.1109/icip.2017.8296599

Copy DOI

Publication Date: Sep 1, 2017

Citations: 33

Affiliation: The Graduate Center, CUNY, City University of New York, City College, City College of New York, Nvidia (United States)

#Raw Frame #3D Convolutional Neural Network + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In this paper, we propose an efficient and effective action recognition framework by combining multiple feature models from dynamic image, optical flow and raw frame, with 3D convolutional neural network (CNN). Dynamic image preserves the long-term temporal information, while optical flow captures short-term temporal information, and raw frame represents the appearance information. Experiments demonstrate that dynamic image provides complementary information to raw frame feature and optical flow feature. Furthermore, with the approximate rank pooling, the computation of dynamic images is about 360 times faster than optical flow, and the dynamic image requires far less memory than optical flow and raw frame.

Full Text