Abstract

Neural networks are popular and useful in many fields, but they have the problem of giving high confidence responses for examples that are away from the training data. This makes the neural networks very confident in their prediction while making gross mistakes, thus limiting their reliability for safety-critical applications such as autonomous driving and space exploration, etc. This paper introduces a novel neuron generalization that has the standard dot-product-based neuron and the radial basis function (RBF) neuron as two extreme cases of a shape parameter. Using a rectified linear unit (ReLU) as the activation function results in a novel neuron that has compact support, which means its output is zero outside a bounded domain. To address the difficulties in training the proposed neural network, it introduces a novel training method that takes a pretrained standard neural network that is fine-tuned while gradually increasing the shape parameter to the desired value. The theoretical findings of the paper are bound on the gradient of the proposed neuron and proof that a neural network with such neurons has the universal approximation property. This means that the network can approximate any continuous and integrable function with an arbitrary degree of accuracy. The experimental findings on standard benchmark datasets show that the proposed approach has smaller test errors than the state-of-the-art competing methods and outperforms the competing methods in detecting out-of-distribution samples on two out of three datasets.

Highlights

  • Neural networks have proven to be extremely useful in many applications, including object detection, speech and handwriting recognition, and medical imaging, etc

  • It is of interest to see whether this Lipschitz condition is satisfied for the proposed compact support neural network (CSNN)

  • CSNN-F methods obtain the best results on MNIST and CIFAR-100, and adversarial confidence enhanced training (ACET) and SNGP obtain the best results on CIFAR-10

Read more

Summary

Introduction

Neural networks have proven to be extremely useful in many applications, including object detection, speech and handwriting recognition, and medical imaging, etc. They have become the state-of-the-art in these applications, and in some cases they even surpass human performance. An explanation of why this is happening for the rectified linear unit (ReLU) based networks has been given in [3]. This issue is very important for safety-critical applications such as space exploration, autonomous driving, and medical diagnosis, etc. In these cases it is important that the system knows when the input data are outside its nominal range to alert the human (e.g., driver for autonomous driving or radiologist for medical diagnostic) to take charge in such cases

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call