Dense Semantic Contrast for Self-Supervised Visual Representation Learning

Xiaoni Li,Aoting Zhang,Wei Wang,Weiping Wang,Ning Jiang,Yifei Zhang,Haiying Wu,Yu Zhou

doi:10.1145/3474085.3475551

Abstract

Self-supervised representation learning for visual pre-training has achieved remarkable success with sample (instance or pixel) discrimination and semantics discovery of instance, whereas there still exists a non-negligible gap between pre-trained model and downstream dense prediction tasks. Concretely, these downstream tasks require more accurate representation, in other words, the pixels from the same object must belong to a shared semantic category, which is lacking in the previous methods. In this work, we present Dense Semantic Contrast (DSC) for modeling semantic category decision boundaries at a dense level to meet the requirement of these tasks. Furthermore, we propose a dense cross-image semantic contrastive learning framework for multi-granularity representation learning. Specially, we explicitly explore the semantic structure of the dataset by mining relations among pixels from different perspectives. For intra-image relation modeling, we discover pixel neighbors from multiple views. And for inter-image relations, we enforce pixel representation from the same semantic class to be more similar than the representation from different classes in one mini-batch. Experimental results show that our DSC model outperforms state-of-the-art methods when transferring to downstream dense prediction tasks, including object detection, semantic segmentation, and instance segmentation. Code will be made available.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dense Semantic Contrast for Self-Supervised Visual Representation Learning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

CDEST: Class Distinguishability-Enhanced Self-Training Method for Adopting Pre-Trained Models to Downstream Remote Sensing Image Semantic Segmentation
Ming Zhang ... Ji Qi
Remote Sensing | VOL. 16
Ming Zhang, et. al.Ming Zhang ... Ji Qi
06 Apr 2024
Remote Sensing | VOL. 16

A Novel Multi-Task Self-Supervised Representation Learning Paradigm
Yinggang Li ... Qi Zhang
Control theory & applications | VOL. -
Yinggang Li, et. al.Yinggang Li ... Qi Zhang
28 May 2021
Control theory & applications | VOL. -

Conformer-Based Self-Supervised Learning For Non-Speech Audio Tasks
Sangeeta Srivastava ... Chunxi Liu
-
Sangeeta Srivastava, et. al.Sangeeta Srivastava ... Chunxi Liu
23 May 2022
23 May 2022

Video Representation Learning by Dense Predictive Coding
Tengda Han ... Weidi Xie
-
Tengda Han, et. al.Tengda Han ... Weidi Xie
01 Oct 2019
01 Oct 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dense Semantic Contrast for Self-Supervised Visual Representation Learning

Abstract

Talk to us

Similar Papers