Learning online visual invariances for novel objects via supervised and self-supervised training

Valerio Biscione,Jeffrey S Bowers

doi:10.1016/j.neunet.2022.02.017

Abstract

Humans can identify objects following various spatial transformations such as scale and viewpoint. This extends to novel objects, after a single presentation at a single pose, sometimes referred to as online invariance. CNNs have been proposed as a compelling model of human vision, but their ability to identify objects across transformations is typically tested on held-out samples of trained categories after extensive data augmentation. This paper assesses whether standard CNNs can support human-like online invariance by training models to recognize images of synthetic 3D objects that undergo several transformations: rotation, scaling, translation, brightness, contrast, and viewpoint. Through the analysis of models’ internal representations, we show that standard supervised CNNs trained on transformed objects can acquire strong invariances on novel classes even when trained with as few as 50 objects taken from 10 classes. This extended to a different dataset of photographs of real objects. We also show that these invariances can be acquired in a self-supervised way, through solving the same/different task. We suggest that this latter approach may be similar to how humans acquire invariances.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning online visual invariances for novel objects via supervised and self-supervised training

Abstract

Talk to us

Similar Papers

More From: Neural Networks

Lead the way for us

Journal: Neural Networks	Publication Date: Mar 5, 2022
Citations: 6

Similar Papers

Author response: THINGS-data, a multimodal collection of large-scale datasets for investigating object representations in human brain and behavior
Martin N Hebart ... Charles Y Zheng
-
Martin N Hebart, et. al.Martin N Hebart ... Charles Y Zheng
24 Jan 2023
24 Jan 2023

Editor's evaluation: THINGS-data, a multimodal collection of large-scale datasets for investigating object representations in human brain and behavior
Morgan Barense
-
Morgan BarenseMorgan Barense
26 Oct 2022
26 Oct 2022

Decision letter: THINGS-data, a multimodal collection of large-scale datasets for investigating object representations in human brain and behavior
Talia Konkle ... Floris P de Lange
-
Talia Konkle, et. al.Talia Konkle ... Floris P de Lange
26 Oct 2022
26 Oct 2022

Representation, representational transformation and spatial reasoning hierarchical in spatial thinking
F R Fiantika ... S P Setyawati
Journal of Physics: Conference Series | VOL. 1321
F R Fiantika, et. al.F R Fiantika ... S P Setyawati
01 Oct 2019
Journal of Physics: Conference Series | VOL. 1321

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning online visual invariances for novel objects via supervised and self-supervised training

Abstract

Talk to us

Similar Papers

More From: Neural Networks