Abstract

We discuss the expressive power of neural networks which use the non-smooth ReLU activation function ϱ(x) = max{0, x} by analyzing the approximation theoretic properties of such networks. The existing results mainly fall into two categories: approximation using ReLU networks with a fixed depth, or using ReLU networks whose depth increases with the approximation accuracy. After reviewing these findings, we show that the results concerning networks with fixed depth—which up to now only consider approximation in Lp(λ) for the Lebesgue measure λ—can be generalized to approximation in Lp(µ), for any finite Borel measure µ. In particular, the generalized results apply in the usual setting of statistical learning theory, where one is interested in approximation in L2(ℙ), with the probability measure ℙ describing the distribution of the data.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call