Convergence of deep ReLU networks

Yuesheng Xu,Haizhang Zhang

doi:10.1016/j.neucom.2023.127174

Abstract

We explore convergence of deep neural networks with the popular ReLU activation function, as the depth of the networks tends to infinity. To this end, we introduce the notion of activation domains and activation matrices of a ReLU network. By replacing applications of the ReLU activation function by multiplications with activation matrices on activation domains, we obtain an explicit expression of the ReLU network. We then identify the convergence of the ReLU networks as convergence of a class of infinite products of matrices. Sufficient and necessary conditions for convergence of these infinite products of matrices are studied. As a result, we establish necessary conditions for ReLU networks to converge that the sequence of weight matrices converges to the identity matrix and the sequence of the bias vectors converges to zero as the depth of ReLU networks increases to infinity. Moreover, we obtain sufficient conditions in terms of the weight matrices and bias vectors at hidden layers for pointwise convergence of deep ReLU networks. These results provide mathematical insights to convergence of deep neural networks. Experiments are conducted to mathematically verify the results and to illustrate their potential usefulness in initialization of deep neural networks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Convergence of deep ReLU networks

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Dec 28, 2023
Citations: 6

Similar Papers

Convergence of deep convolutional neural networks
Yuesheng Xu ... Haizhang Zhang
Neural Networks | VOL. 153
Yuesheng Xu, et. al.Yuesheng Xu ... Haizhang Zhang
30 Jun 2022
Neural Networks | VOL. 153

ReLU Networks Are Universal Approximators via Piecewise Linear or Constant Functions.
Changcun Huang
Neural Computation | VOL. 32
Changcun HuangChangcun Huang
18 Sep 2020
Neural Computation | VOL. 32

Stronger convergence results for deep residual networks: network width scales linearly with training data size
Talha Cihad Gulcu
Information and Inference: A Journal of the IMA | VOL. 11
Talha Cihad GulcuTalha Cihad Gulcu
24 Nov 2020
Information and Inference: A Journal of the IMA | VOL. 11

Towards Quantifying Intrinsic Generalization of Deep ReLU Networks
Shaeke Salman ... Xiuwen Liu
-
Shaeke Salman, et. al.Shaeke Salman ... Xiuwen Liu
01 Jul 2020
01 Jul 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Convergence of deep ReLU networks

Abstract

Talk to us

Similar Papers

More From: Neurocomputing