Can neural networks benefit from objectives that encourage iterative convergent computations? A case study of ResNets and object classification.

Samuel Lippl,Benjamin Peters,Nikolaus Kriegeskorte

doi:10.1371/journal.pone.0293440

Abstract

Recent work has suggested that feedforward residual neural networks (ResNets) approximate iterative recurrent computations. Iterative computations are useful in many domains, so they might provide good solutions for neural networks to learn. However, principled methods for measuring and manipulating iterative convergence in neural networks remain lacking. Here we address this gap by 1) quantifying the degree to which ResNets learn iterative solutions and 2) introducing a regularization approach that encourages the learning of iterative solutions. Iterative methods are characterized by two properties: iteration and convergence. To quantify these properties, we define three indices of iterative convergence. Consistent with previous work, we show that, even though ResNets can express iterative solutions, they do not learn them when trained conventionally on computer-vision tasks. We then introduce regularizations to encourage iterative convergent computation and test whether this provides a useful inductive bias. To make the networks more iterative, we manipulate the degree of weight sharing across layers using soft gradient coupling. This new method provides a form of recurrence regularization and can interpolate smoothly between an ordinary ResNet and a "recurrent" ResNet (i.e., one that uses identical weights across layers and thus could be physically implemented with a recurrent network computing the successive stages iteratively across time). To make the networks more convergent we impose a Lipschitz constraint on the residual functions using spectral normalization. The three indices of iterative convergence reveal that the gradient coupling and the Lipschitz constraint succeed at making the networks iterative and convergent, respectively. To showcase the practicality of our approach, we study how iterative convergence impacts generalization on standard visual recognition tasks (MNIST, CIFAR-10, CIFAR-100) or challenging recognition tasks with partial occlusions (Digitclutter). We find that iterative convergent computation, in these tasks, does not provide a useful inductive bias for ResNets. Importantly, our approach may be useful for investigating other network architectures and tasks as well and we hope that our study provides a useful starting point for investigating the broader question of whether iterative convergence can help neural networks in their generalization.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Can neural networks benefit from objectives that encourage iterative convergent computations? A case study of ResNets and object classification.

Abstract

Talk to us

Similar Papers

More From: PLOS ONE

Lead the way for us

Journal: PLOS ONE	Publication Date: Mar 21, 2024
License type: CC BY 4.0

Similar Papers

Connectionist Models of Neurons, Learning Processes, and Artificial Intelligence
-
-
--
01 Jan 2001
01 Jan 2001

Fused Deep Convolutional Neural Networks Based on Voting Approach for Efficient Object Classification
Redha Ali ... Hussin K Ragb
-
Redha Ali, et. al.Redha Ali ... Hussin K Ragb
01 Jul 2019
01 Jul 2019

PIC: Partitioned Iterative Convergence for Clusters
Reza Farivar ... Roy H Campbell
-
Reza Farivar, et. al.Reza Farivar ... Roy H Campbell
01 Sep 2012
01 Sep 2012

Study of hot spot detection using neural network judgment
Norimasa Nagase ... Satoshi Yamauchi
-
Norimasa Nagase, et. al.Norimasa Nagase ... Satoshi Yamauchi
03 May 2007
03 May 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Can neural networks benefit from objectives that encourage iterative convergent computations? A case study of ResNets and object classification.

Abstract

Talk to us

Similar Papers

More From: PLOS ONE