Recurrent Neural Networks with Small Weights Implement Definite Memory Machines

Barbara Hammer,Peter Tiňo

doi:10.1162/08997660360675080

Abstract

Recent experimental studies indicate that recurrent neural networks initialized with “small” weights are inherently biased toward definite memory machines (Tiňno, Čerňanský, & Beňušková, 2002a, 2002b). This article establishes a theoretical counterpart: transition function of recurrent network with small weights and squashing activation function is a contraction. We prove that recurrent networks with contractive transition function can be approximated arbitrarily well on input sequences of unbounded length by a definite memory machine. Conversely, every definite memory machine can be simulated by a recurrent network with contractive transition function. Hence, initialization with small weights induces an architectural bias into learning with recurrent neural networks. This bias might have benefits from the point of view of statistical learning theory: it emphasizes one possible region of the weight space where generalization ability can be formally proved. It is well known that standard recurrent neural networks are not distribution independent learnable in the probably approximately correct (PAC) sense if arbitrary precision and inputs are considered. We prove that recurrent networks with contractive transition function with a fixed contraction parameter fulfill the so-called distribution independent uniform convergence of empirical distances property and hence, unlike general recurrent networks, are distribution independent PAC learnable.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Recurrent Neural Networks with Small Weights Implement Definite Memory Machines

Abstract

Talk to us

Similar Papers

More From: Neural Computation

Lead the way for us

Journal: Neural Computation	Publication Date: Aug 1, 2003
Citations: 96

Similar Papers

Large scale recurrent neural network on GPU
Boxun Li ... Bo Huang
-
Boxun Li, et. al.Boxun Li ... Bo Huang
01 Jul 2014
01 Jul 2014

Editor's evaluation: Neural population dynamics of computing with synaptic modulations
Gianluigi Mongillo
-
Gianluigi MongilloGianluigi Mongillo
08 Jan 2023
08 Jan 2023

Author response: Neural population dynamics of computing with synaptic modulations
Kyle Aitken ... Stefan Mihalas
-
Kyle Aitken, et. al.Kyle Aitken ... Stefan Mihalas
10 Feb 2023
10 Feb 2023

Decision letter: Neural population dynamics of computing with synaptic modulations
Omri Barak ... Joshua I Gold
-
Omri Barak, et. al.Omri Barak ... Joshua I Gold
08 Jan 2023
08 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Recurrent Neural Networks with Small Weights Implement Definite Memory Machines

Abstract

Talk to us

Similar Papers

More From: Neural Computation