UniVIP: A Unified Framework for Self-Supervised Visual Pre-training

Zhaowen Li,Ming Tang,Rui Zhao,Jiahao Xie,Yousong Zhu,Wei Li,Jinqiao Wang,Chaoyang Zhao,Yingying Chen,Fan Yang,Zhiyang Chen,Liwei Wu

doi:10.1109/cvpr52688.2022.01422

Abstract

Self-supervised learning (SSL) holds promise in leveraging large amounts of unlabeled data. However, the success of popular SSL methods has limited on single-centric-object images like those in ImageNet and ignores the correlation among the scene and instances, as well as the semantic difference of instances in the scene. To address the above problems, we propose a Unified Self-supervised Visual Pre-training (UniVIP), a novel self-supervised framework to learn versatile visual representations on either single-centric-object or non-iconic dataset. The framework takes into account the representation learning at three levels: 1) the similarity of scene-scene, 2) the correlation of scene-instance, 3) the discrimination of instance-instance. During the learning, we adopt the optimal transport algorithm to automatically measure the discrimination of instances. Massive experiments show that Uni-VIP pre-trained on non-iconic COCO achieves state-of-the-art transfer performance on a variety of downstream tasks, such as image classification, semi-supervised learning, object detection and segmentation. Furthermore, our method can also exploit single-centric-object dataset such as ImageNet and outperforms BYOL by 2.5% with the same pre-training epochs in linear probing, and surpass current self-supervised object detection methods on COCO dataset, demonstrating its universality and potential.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

UniVIP: A Unified Framework for Self-Supervised Visual Pre-training

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

SmartLabel
Wen Wu ... Jie Yang
-
Wen Wu, et. al.Wen Wu ... Jie Yang
23 Oct 2006
23 Oct 2006

Self-Supervised Contrastive Learning on Cross-Augmented Samples for SAR Target Recognition
Xiaoyu Liu ... Jifang Pei
-
Xiaoyu Liu, et. al.Xiaoyu Liu ... Jifang Pei
01 May 2023
01 May 2023

Advances in Deep Learning for Hyperspectral Image Analysis—Addressing Challenges Arising in Practical Imaging Scenarios
Xiong Zhou ... Saurabh Prasad
-
Xiong Zhou, et. al.Xiong Zhou ... Saurabh Prasad
01 Jan 2020
01 Jan 2020

A review of research and development of semi-supervised learning strategies for medical image processing
Shengke Yang
EAI Endorsed Transactions on e-Learning | VOL. 9
Shengke YangShengke Yang
16 Jan 2024
EAI Endorsed Transactions on e-Learning | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

UniVIP: A Unified Framework for Self-Supervised Visual Pre-training

Abstract

Talk to us

Similar Papers