The expressivity and training of deep neural networks: Toward the edge of chaos?

Gege Zhang,Gangwei Li,Weining Shen,Weidong Zhang

doi:10.1016/j.neucom.2019.12.044

Gege Zhang, Gangwei Li + Show 2 more

Open Access

https://doi.org/10.1016/j.neucom.2019.12.044

Copy DOI

Abstract

Expressivity is one of the most significant issues in assessing neural networks. In this paper, we provide a quantitative analysis of the expressivity for the deep neural network (DNN) from its dynamic model, where the Hilbert space is employed to analyze the convergence and criticality. We study the feature mapping of several widely used activation functions obtained by Hermite polynomials, and find sharp declines or even saddle points in the feature space, which stagnate the information transfer in DNNs. We then present a new activation function design based on the Hermite polynomials for better utilization of spatial representation. Moreover, we analyze the information transfer of DNNs, emphasizing the convergence problem caused by the mismatch between input and topological structure. We also study the effects of input perturbations and regularization operators on critical expressivity. Our theoretical analysis reveals that DNNs use spatial domains for information representation and evolve to the edge of chaos as depth increases. In actual training, whether a particular network can ultimately arrive the edge of chaos depends on its ability to overcome convergence and pass information to the required network depth. Finally, we demonstrate the empirical performance of the proposed hypothesis via multivariate time series prediction and image classification examples.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Neurocomputing	Publication Date: Dec 26, 2019
Citations: 5	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

The expressivity and training of deep neural networks: Toward the edge of chaos?

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Similar Papers

Neuroevolution in Deep Neural Networks: Current Trends and Future Challenges
Edgar Galvan ... Peter Mooney
IEEE Transactions on Artificial Intelligence | VOL. 2
Edgar Galvan, et. al.Edgar Galvan ... Peter Mooney
04 May 2021
IEEE Transactions on Artificial Intelligence | VOL. 2

A convergence analysis of Nesterov’s accelerated gradient method in training deep linear neural networks
Xin Liu ... Zhisong Pan
Information Sciences | VOL. 612
Xin Liu, et. al.Xin Liu ... Zhisong Pan
05 Sep 2022
Information Sciences | VOL. 612

HNPU: An Adaptive DNN Training Processor Utilizing Stochastic Dynamic Fixed-Point and Active Bit-Precision Searching
Donghyeon Han ... Dongseok Im
IEEE Journal of Solid-State Circuits | VOL. 56
Donghyeon Han, et. al.Donghyeon Han ... Dongseok Im
24 Mar 2021
IEEE Journal of Solid-State Circuits | VOL. 56

A Framework for Distributed Deep Neural Network Training with Heterogeneous Computing Platforms
Bontak Gu ... Young Geun Kim
-
Bontak Gu, et. al.Bontak Gu ... Young Geun Kim
01 Dec 2019
01 Dec 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The expressivity and training of deep neural networks: Toward the edge of chaos?

Abstract

Talk to us

Similar Papers

More From: Neurocomputing