An analysis of training and generalization errors in shallow and deep networks

H.N Mhaskar,T Poggio

doi:10.1016/j.neunet.2019.08.028

H.N Mhaskar, T Poggio

Open Access

https://doi.org/10.1016/j.neunet.2019.08.028

Copy DOI

Abstract

This paper is motivated by an open problem around deep networks, namely, the apparent absence of over-fitting despite large over-parametrization which allows perfect fitting of the training data. In this paper, we analyze this phenomenon in the case of regression problems when each unit evaluates a periodic activation function. We argue that the minimal expected value of the square loss is inappropriate to measure the generalization error in approximation of compositional functions in order to take full advantage of the compositional structure. Instead, we measure the generalization error in the sense of maximum loss, and sometimes, as a pointwise error. We give estimates on exactly how many parameters ensure both zero training error as well as a good generalization error. We prove that a solution of a regularization problem is guaranteed to yield a good training error as well as a good generalization error and estimate how much error to expect at which test data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Neural Networks	Publication Date: Sep 7, 2019
Citations: 15	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

An analysis of training and generalization errors in shallow and deep networks

Abstract

Talk to us

Similar Papers

More From: Neural Networks

Lead the way for us

Similar Papers

Going Deeper, Generalizing Better: An Information-Theoretic View for Deep Learning.
Jingwei Zhang ... Tongliang Liu
IEEE transactions on neural networks and learning systems | VOL. PP
Jingwei Zhang, et. al.Jingwei Zhang ... Tongliang Liu
01 Jan 2024
IEEE transactions on neural networks and learning systems | VOL. PP

Machine learning applied to estimating broadband source signature characteristics in a shallow ocean environment
David P Knobles ... Mohsen Badiey
The Journal of the Acoustical Society of America | VOL. 141
David P Knobles, et. al.David P Knobles ... Mohsen Badiey
01 May 2017
The Journal of the Acoustical Society of America | VOL. 141

A formula of equations of states in singular learning machines
Sumio Watanabe
-
Sumio WatanabeSumio Watanabe
01 Jun 2008
01 Jun 2008

Training error, generalization error and learning curves in neural learning
S.-I Amari
-
S.-I AmariS.-I Amari
20 Nov 1995
20 Nov 1995

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An analysis of training and generalization errors in shallow and deep networks

Abstract

Talk to us

Similar Papers

More From: Neural Networks