An Empirical Study on Generalizations of the ReLU Activation Function

Chaity Banerjee,Tathagata Mukherjee,Eduardo Pasiliao

doi:10.1145/3299815.3314450

Abstract

Deep Neural Networks have become the tool of choice for Machine Learning practitioners today. They have been successfully applied for solving a large class of learning problems both in the industry and academia with applications in fields such as Computer Vision, Natural Language Processing, Big data Analytics and Bioinformatics. One important aspect of designing a neural network is the choice of the activation function to be used at the neurons of the different layers. Activation functions are used for introducing non-linearity into the neural network model so that the network can progressively learn more effective feature representations. Several different activation functions have been used in the literature. However Linear, Sigmoid, Tanh and ReLU are the most commonly used activation functions and they are often selected empirically during the network design phase, rather than through a proper data driven process. In this work we empirically study the problem of generalizing the single output ReLU activation by parameterizing the same so that data driven methods can be used to select variations of the single output ReLU. We call this class of activations the Generalized ReLU Activations. Special cases are ReLU as well as variations like the Leaky ReLU that have already been studied in the literature. We report results of extensive experiments on the well known MNIST handwriting dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Empirical Study on Generalizations of the ReLU Activation Function

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

The Multi-phase ReLU Activation Function
Chaity Banerjee ... Tathagata Mukherjee
-
Chaity Banerjee, et. al.Chaity Banerjee ... Tathagata Mukherjee
02 Apr 2020
02 Apr 2020

RETRACTED: Breast cancer diagnosis using multiple activation deep neural network
K Vijayakumar ... Sudhir Kumar Sharma
Concurrent Engineering | VOL. 29
K Vijayakumar, et. al.K Vijayakumar ... Sudhir Kumar Sharma
25 Jun 2021
Concurrent Engineering | VOL. 29

Activation functions and their characteristics in deep neural networks
Bin Ding ... Jun Zhou
-
Bin Ding, et. al.Bin Ding ... Jun Zhou
01 Jun 2018
01 Jun 2018

Abstract Neural Networks
Matthew Sotoudeh ... Aditya V Thakur
-
Matthew Sotoudeh, et. al.Matthew Sotoudeh ... Aditya V Thakur
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Empirical Study on Generalizations of the ReLU Activation Function

Abstract

Talk to us

Similar Papers