Gradient-based learning drives robust representations in recurrent neural networks by balancing compression and expansion

Matthew Farrell,Timothy Moore,Stefano Recanatesi,Eric Shea-Brown,Guillaume Lajoie

doi:10.1038/s42256-022-00498-0

Abstract

Neural networks need the right representations of input data to learn. Here we ask how gradient-based learning shapes a fundamental property of representations in recurrent neural networks (RNNs)—their dimensionality. Through simulations and mathematical analysis, we show how gradient descent can lead RNNs to compress the dimensionality of their representations in a way that matches task demands during training while supporting generalization to unseen examples. This can require an expansion of dimensionality in early timesteps and compression in later ones, and strongly chaotic RNNs appear particularly adept at learning this balance. Beyond helping to elucidate the power of appropriately initialized artificial RNNs, this fact has implications for neurobiology as well. Neural circuits in the brain reveal both high variability associated with chaos and low-dimensional dynamical structures. Taken together, our findings show how simple gradient-based learning rules lead neural networks to solve tasks with robust representations that generalize to new cases.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nature Machine Intelligence	Publication Date: Jun 1, 2022
Citations: 20	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Gradient-based learning drives robust representations in recurrent neural networks by balancing compression and expansion

Abstract

Talk to us

Similar Papers

More From: Nature Machine Intelligence

Lead the way for us

Similar Papers

The Sixth International Symposium on Neural Networks (ISNN 2009)
-
-
--
01 Jan 2009
The Sixth International Symposium on Neural Networks (ISNN 2009)
-

Connectionist Models of Neurons, Learning Processes, and Artificial Intelligence
-
-
--
01 Jan 2001
01 Jan 2001

Combining Recurrent and Convolutional Neural Networks for Relation Classification
Ngoc Thang Vu ... Pankaj Gupta
-
Ngoc Thang Vu, et. al.Ngoc Thang Vu ... Pankaj Gupta
01 Jan 2015
01 Jan 2015

Modeling neural networks and curvelet thresholding for denoising Gaussian noise
...
-
, et. al. ...
29 Nov 2015
29 Nov 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Gradient-based learning drives robust representations in recurrent neural networks by balancing compression and expansion

Abstract

Talk to us

Similar Papers

More From: Nature Machine Intelligence