Relative stability toward diffeomorphisms indicates performance in deep nets* *This article is an updated version of: Petrini L, Favero A, Geiger M and Wyart M 2021 Relative stability toward diffeomorphisms indicates performance in deep nets Advances in Neural Information Processing Systems vol 34 ed M Ranzato, A Beygelzimer, Y Dauphin, P S Liang and J Wortman Vaughan (New York: Curran

Leonardo Petrini,Alessandro Favero,Mario Geiger,Matthieu Wyart

doi:10.1088/1742-5468/ac98ac

Leonardo Petrini, Alessandro Favero + Show 2 more

Open Access

https://doi.org/10.1088/1742-5468/ac98ac

Copy DOI

Abstract

Understanding why deep nets can classify data in large dimensions remains a challenge. It has been proposed that they do so by becoming stable to diffeomorphisms, yet existing empirical measurements support that it is often not the case. We revisit this question by defining a maximum-entropy distribution on diffeomorphisms, that allows to study typical diffeomorphisms of a given norm. We confirm that stability toward diffeomorphisms does not strongly correlate to performance on benchmark data sets of images. By contrast, we find that the stability toward diffeomorphisms relative to that of generic transformations R f correlates remarkably with the test error ϵ t. It is of order unity at initialization but decreases by several decades during training for state-of-the-art architectures. For CIFAR10 and 15 known architectures we find , suggesting that obtaining a small R f is important to achieve good performance. We study how R f depends on the size of the training set and compare it to a simple model of invariant learning.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Statistical Mechanics: Theory and Experiment	Publication Date: Nov 1, 2022
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Abstract

Talk to us

Similar Papers

More From: Journal of Statistical Mechanics: Theory and Experiment

Lead the way for us

Similar Papers

Application of an Iterated Function System with Dynamic Selection Probability to Deliberative Decision-Making
Yutaka Yamaguti ... Ichiro Tsuda
-
Yutaka Yamaguti, et. al.Yutaka Yamaguti ... Ichiro Tsuda
01 Jan 2015
01 Jan 2015

Deep Mining from Omics Data.
Abeer Alzubaidi ... Jonathan Tepper
Methods in molecular biology (Clifton, N.J.) | VOL. 2449
Abeer Alzubaidi, et. al.Abeer Alzubaidi ... Jonathan Tepper
01 Jan 2021
Methods in molecular biology (Clifton, N.J.) | VOL. 2449

Encoding and Recall of Natural Image Sequences with Conditionally Restricted Boltzmann Machines
Susemihl Alex
Frontiers in Computational Neuroscience | VOL. 6
Susemihl AlexSusemihl Alex
01 Jan 2012
Frontiers in Computational Neuroscience | VOL. 6

Problem-Based Learning Model: Its Effectiveness in Improving Creative Thinking Skills of Students with Different Academic Abilities
S Suciati ... K D Santika
Jurnal Pendidikan IPA Indonesia | VOL. 12
S Suciati, et. al.S Suciati ... K D Santika
12 Jan 2024
Jurnal Pendidikan IPA Indonesia | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Abstract

Talk to us

Similar Papers

More From: Journal of Statistical Mechanics: Theory and Experiment