Self-Supervised Video Representation Learning with Motion-Contrastive Perception

Jinyu Liu,Rui-Wei Zhao,Rui Feng,Ying Cheng,Yuejie Zhang

doi:10.1109/icme52920.2022.9859802

Abstract

Visual-only self-supervised learning has achieved significant improvement in video representation learning. Existing related methods encourage models to learn video representations by utilizing contrastive learning or designing specific pretext tasks. However, some models are likely to focus on the background, which is unimportant for learning video representations. To alleviate this problem, we propose a new view called long-range residual frame to obtain more motion-specific information. Based on this, we propose the Motion-Contrastive Perception Network (MCPNet), which consists of two branches, namely, Motion Information Perception (MIP) and Contrastive Instance Perception (CIP), to learn generic video representations by focusing on the changing areas in videos. Specifically, the MIP branch aims to learn fine-grained motion features, and the CIP branch performs contrastive learning to learn overall semantics information for each instance. Experiments on two benchmark datasets UCF-101 and HMDB-51 show that our method outperforms current state-of-the-art visual-only self-supervised approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Self-Supervised Video Representation Learning with Motion-Contrastive Perception

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Study on Various Self-supervised Video Representation Learning Methods
Soohyun Park ... Jongwon Choi
Moving Image & Technology (MINT) | VOL. 2
Soohyun Park, et. al.Soohyun Park ... Jongwon Choi
31 Aug 2022
Moving Image & Technology (MINT) | VOL. 2

Self-Supervised Spatiotemporal Representation Learning by Exploiting Video Continuity
Hanwen Liang ... Yang Wang
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36
Hanwen Liang, et. al.Hanwen Liang ... Yang Wang
28 Jun 2022
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36

Video Representation Learning with Graph Contrastive Augmentation
Jingran Zhang ... Fumin Shen
-
Jingran Zhang, et. al.Jingran Zhang ... Fumin Shen
17 Oct 2021
17 Oct 2021

Similarity contrastive estimation for image and video soft contrastive self-supervised learning
Julien Denize ... Romain Hérault
Machine Vision and Applications | VOL. 34
Julien Denize, et. al.Julien Denize ... Romain Hérault
26 Sep 2023
Machine Vision and Applications | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Self-Supervised Video Representation Learning with Motion-Contrastive Perception

Abstract

Talk to us

Similar Papers