Lower Bounds on the Generalization Error of Nonlinear Learning Models

Inbar Seroussi,Ofer Zeitouni

doi:10.1109/tit.2022.3189760

Lower Bounds on the Generalization Error of Nonlinear Learning Models

Inbar Seroussi, Ofer Zeitouni

Open Access

https://doi.org/10.1109/tit.2022.3189760

Copy DOI

Journal: IEEE Transactions on Information Theory	Publication Date: Dec 1, 2022
Citations: 1

Affiliation: Weizmann Institute of Science

#Multi-layer Neural Networks #Theory Of Random Matrices + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We study in this paper lower bounds for the generalization error of models derived from multi-layer neural networks, in the regime where the size of the layers is commensurate with the number of samples in the training data. We derive explicit generalization lower bounds for general biased estimators, in the cases of two-layered networks. For linear activation function, the bound is asymptotically tight. In the nonlinear case, we provide a comparison of our bounds with an empirical study of the stochastic gradient descent algorithm. In addition, we derive bounds for unbiased estimators, which show that the latter have unacceptable performance for truly nonlinear networks. The analysis uses elements from the theory of large random matrices.

Full Text