Distilling Localization for Self-Supervised Representation Learning

Nanxuan Zhao,Zhirong Wu,Rynson W.H Lau,Stephen Lin

doi:10.1609/aaai.v35i12.17312

Abstract

Recent progress in contrastive learning has revolutionized unsupervised representation learning. Concretely, multiple views (augmentations) from the same image are encouraged to map to close embeddings, while views from different images are pulled apart.In this paper, through visualizing and diagnosing classification errors, we observe that current contrastive models are ineffective at localizing the foreground object, limiting their ability to extract discriminative high-level features. This is due to the fact that view generation process considers pixels in an image uniformly.To address this problem, we propose a data-driven approach for learning invariance to backgrounds. It first estimates foreground saliency in images and then creates augmentations by copy-and-pasting the foreground onto a variety of back-grounds. The learning still follows an instance discrimination approach, so that the representation is trained to disregard background content and focus on the foreground. We study a variety of saliency estimation methods, and find that most methods lead to improvements for contrastive learning. With this approach, significant performance is achieved for self-supervised learning on ImageNet classification, and also for object detection on PASCAL VOC and MSCOCO.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Distilling Localization for Self-Supervised Representation Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: May 18, 2021
Citations: 12

Similar Papers

When Does Contrastive Visual Representation Learning Work?
Elijah Cole ... Xuan Yang
-
Elijah Cole, et. al.Elijah Cole ... Xuan Yang
01 Jun 2022
01 Jun 2022

Non-parametric Representation Learning with Kernels
Pascal Esser ... Maximilian Fleissner
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38
Pascal Esser, et. al.Pascal Esser ... Maximilian Fleissner
24 Mar 2024
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38

A Novel Solution for EEG-based Emotion Recognition
Zhuofan Xie ... Mingzhang Zhou
-
Zhuofan Xie, et. al.Zhuofan Xie ... Mingzhang Zhou
13 Oct 2021
13 Oct 2021

Momentum Contrast for Unsupervised Visual Representation Learning
Kaiming He ... Haoqi Fan
-
Kaiming He, et. al.Kaiming He ... Haoqi Fan
01 Jun 2020
01 Jun 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Distilling Localization for Self-Supervised Representation Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence