Kolmogorov width decay and poor approximators in machine learning: shallow neural networks, random feature models and neural tangent kernels

Weinan E,Stephan Wojtowytsch

doi:10.1007/s40687-020-00233-4

Kolmogorov width decay and poor approximators in machine learning: shallow neural networks, random feature models and neural tangent kernels

Weinan E, Stephan Wojtowytsch

Open Access

https://doi.org/10.1007/s40687-020-00233-4

Copy DOI

Journal: Research in the mathematical sciences	Publication Date: Jan 5, 2021
Citations: 15

Affiliation: Princeton University

#Reproducing Kernel Hilbert Spaces #Shallow Neural Networks + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We establish a scale separation of Kolmogorov width type between subspaces of a given Banach space under the condition that a sequence of linear maps converges much faster on one of the subspaces. The general technique is then applied to show that reproducing kernel Hilbert spaces are poor \(L^{2}\)-approximators for the class of two-layer neural networks in high dimension, and that multi-layer networks with small path norm are poor approximators for certain Lipschitz functions, also in the \(L^{2}\)-topology.

Full Text