Abstract

We establish $L^{\infty } $ and $L^{2} $ error bounds for functions of many variables that are approximated by linear combinations of rectified linear unit (ReLU) and squared ReLU ridge functions with $\ell ^{1} $ and $\ell ^{0} $ controls on their inner and outer parameters. With the squared ReLU ridge function, we show that the $L^{2} $ approximation error is inversely proportional to the inner layer $\ell ^{0} $ sparsity and it need only be sublinear in the outer layer $\ell ^{0} $ sparsity. Our constructions are obtained using a variant of the Maurey–Jones–Barron probabilistic method, which can be interpreted as either stratified sampling with proportionate allocation or two-stage cluster sampling. We also provide companion error lower bounds that reveal near optimality of our constructions. Despite the sparsity assumptions, we showcase the richness and flexibility of these ridge combinations by defining a large family of functions, in terms of certain spectral conditions, that are particularly well approximated by them.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.