Memory-Augmented Dense Predictive Coding for Video Representation Learning

Tengda Han,Andrew Zisserman,Weidi Xie

doi:10.1007/978-3-030-58580-8_19

Abstract

The objective of this paper is self-supervised learning from video, in particular for representations for action recognition. We make the following contributions: (i) We propose a new architecture and learning framework Memory-augmented Dense Predictive Coding (MemDPC) for the task. It is trained with a predictive attention mechanism over the set of compressed memories, such that any future states can always be constructed by a convex combination of the condensed representations, allowing to make multiple hypotheses efficiently. (ii) We investigate visual-only self-supervised video representation learning from RGB frames, or from unsupervised optical flow, or both. (iii) We thoroughly evaluate the quality of the learnt representation on four different downstream tasks: action recognition, video retrieval, learning with scarce annotations, and unintentional action classification. In all cases, we demonstrate state-of-the-art or comparable performance over other approaches with orders of magnitude fewer training data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Memory-Augmented Dense Predictive Coding for Video Representation Learning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Video Representation Learning by Dense Predictive Coding
Tengda Han ... Weidi Xie
-
Tengda Han, et. al.Tengda Han ... Weidi Xie
01 Oct 2019
01 Oct 2019

Study on Various Self-supervised Video Representation Learning Methods
Soohyun Park ... Jongwon Choi
Moving Image & Technology (MINT) | VOL. 2
Soohyun Park, et. al.Soohyun Park ... Jongwon Choi
31 Aug 2022
Moving Image & Technology (MINT) | VOL. 2

Video Representation Learning with Graph Contrastive Augmentation
Jingran Zhang ... Xing Xu
-
Jingran Zhang, et. al.Jingran Zhang ... Xing Xu
17 Oct 2021
17 Oct 2021

A Novel Multi-Task Self-Supervised Representation Learning Paradigm
Yinggang Li ... Qi Zhang
Control theory & applications | VOL. -
Yinggang Li, et. al.Yinggang Li ... Qi Zhang
28 May 2021
Control theory & applications | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Memory-Augmented Dense Predictive Coding for Video Representation Learning

Abstract

Talk to us

Similar Papers