Analysis of the rate of convergence of fully connected deep neural network regression estimates with smooth activation function

Sophie Langer

doi:10.1016/j.jmva.2020.104695

Abstract

This article contributes to the current statistical theory of deep neural networks (DNNs). It was shown that DNNs are able to circumvent the so-called curse of dimensionality in case that suitable restrictions on the structure of the regression function hold. In most of those results the tuning parameter is the sparsity of the network, which describes the number of non-zero weights in the network. This constraint seemed to be the key factor for the good rate of convergence results. Recently, the assumption was disproved. In particular, it was shown that simple fully connected DNNs can achieve the same rate of convergence. Those fully connected DNNs are based on the unbounded ReLU activation function. In this article we extend the results to smooth activation functions, i.e., to the sigmoid activation function. It is shown that estimators based on fully connected DNNs with sigmoid activation function also achieve the minimax rates of convergence (up to lnn-factors). In our result the number of hidden layers is fixed, the number of neurons per layer tends to infinity for sample size tending to infinity and a bound for the weights in the network is given.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Analysis of the rate of convergence of fully connected deep neural network regression estimates with smooth activation function

Abstract

Talk to us

Similar Papers

More From: Journal of Multivariate Analysis

Lead the way for us

Journal: Journal of Multivariate Analysis	Publication Date: Nov 10, 2020
Citations: 11

Similar Papers

Evaluation of Sigmoid and ReLU Activation Functions Using Asymptotic Method
-
Network and Complex Systems | VOL. -
--
01 Jun 2022
Network and Complex Systems | VOL. -

An investigation on deep learning with beta stabilizer
Qi Liu ... Kai Yu
-
Qi Liu, et. al.Qi Liu ... Kai Yu
01 Nov 2016
01 Nov 2016

Fast Activation Function Approach for Deep Learning Based Online Anomaly Intrusion Detection
Khaled Alrawashdeh ... Carla Purdy
-
Khaled Alrawashdeh, et. al.Khaled Alrawashdeh ... Carla Purdy
01 May 2018
01 May 2018

Stable recovery of entangled weights: Towards robust identification of deep neural networks from minimal samples
Christian Fiedler ... Timo Klock
Applied and Computational Harmonic Analysis | VOL. 62
Christian Fiedler, et. al.Christian Fiedler ... Timo Klock
01 Jan 2023
Applied and Computational Harmonic Analysis | VOL. 62

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Analysis of the rate of convergence of fully connected deep neural network regression estimates with smooth activation function

Abstract

Talk to us

Similar Papers

More From: Journal of Multivariate Analysis