Action Recognition with Dynamic Image Networks

Hakan Bilen,Andrea Vedaldi,Efstratios Gavves,Basura Fernando

doi:10.1109/tpami.2017.2769085

Abstract

We introduce the concept of dynamic image, a novel compact representation of videos useful for video analysis, particularly in combination with convolutional neural networks (CNNs). A dynamic image encodes temporal data such as RGB or optical flow videos by using the concept of 'rank pooling'. The idea is to learn a ranking machine that captures the temporal evolution of the data and to use the parameters of the latter as a representation. We call the resulting representation dynamic image because it summarizes the video dynamics in addition to appearance. This powerful idea allows to convert any video to an image so that existing CNN models pre-trained with still images can be immediately extended to videos. We also present an efficient approximate rank pooling operator that runs two orders of magnitude faster than the standard ones with any loss in ranking performance and can be formulated as a CNN layer. To demonstrate the power of the representation, we introduce a novel four stream CNN architecture which can learn from RGB and optical flow frames as well as from their dynamic image representations. We show that the proposed network achieves state-of-the-art performance, 95.5 and 72.5 percent accuracy, in the UCF101 and HMDB51, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Action Recognition with Dynamic Image Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence

Lead the way for us

Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence	Publication Date: Nov 2, 2017
Citations: 262

Similar Papers

Dynamic Image Networks for Action Recognition
Hakan Bilen ... Andrea Vedaldi
-
Hakan Bilen, et. al.Hakan Bilen ... Andrea Vedaldi
01 Jun 2016
01 Jun 2016

3D convolutional neural network with multi-model framework for action recognition
Longlong Jing ... Yuancheng Ye
-
Longlong Jing, et. al.Longlong Jing ... Yuancheng Ye
01 Sep 2017
01 Sep 2017

Action Recognition in Videos Using Pre-Trained 2D Convolutional Neural Networks
Jun-Hwa Kim ... Chee Sun Won
IEEE Access | VOL. 8
Jun-Hwa Kim, et. al.Jun-Hwa Kim ... Chee Sun Won
01 Jan 2020
IEEE Access | VOL. 8

Discriminatively Learned Hierarchical Rank Pooling Networks
Basura Fernando ... Stephen Gould
International Journal of Computer Vision | VOL. 124
Basura Fernando, et. al.Basura Fernando ... Stephen Gould
24 Jun 2017
International Journal of Computer Vision | VOL. 124

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Action Recognition with Dynamic Image Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence