Feature representations using the reflected rectified linear unit (RReLU) activation

Chaity Banerjee,Tathagata Mukherjee,Eduardo Pasiliao

doi:10.26599/bdma.2019.9020024

Abstract

Deep Neural Networks (DNNs) have become the tool of choice for machine learning practitioners today. One important aspect of designing a neural network is the choice of the activation function to be used at the neurons of the different layers. In this work, we introduce a four-output activation function called the Reflected Rectified Linear Unit (RReLU) activation which considers both a feature and its negation during computation. Our activation function is “sparse”, in that only two of the four possible outputs are active at a given time. We test our activation function on the standard MNIST and CIFAR-10 datasets, which are classification problems, as well as on a novel Computational Fluid Dynamics (CFD) dataset which is posed as a regression problem. On the baseline network for the MNIST dataset, having two hidden layers, our activation function improves the validation accuracy from 0.09 to 0.97 compared to the well-known ReLU activation. For the CIFAR-10 dataset, we use a deep baseline network that achieves 0.78 validation accuracy with 20 epochs but overfits the data. Using the RReLU activation, we can achieve the same accuracy without overfitting the data. For the CFD dataset, we show that the RReLU activation can reduce the number of epochs from 100 (using ReLU) to 10 while obtaining the same levels of performance.

Highlights

Background and Related WorkActivation functions have been studied for a very long time, since the early days of neural networks
This paper is organized as follows: In Section 2, we present a detailed study of the various activation functions that have been studied in the context of neural networks; in Section 3, we formally introduce the fouroutput variation of the Rectified Linear Unit (ReLU) activation and study some of its properties
If the single neuron does nothing but passes on this input to its output, the output of the neuron is wTx, the operation that is being performed at the neuron is nothing but a simple aggregation and if this happens at each of the nodes of the hidden layer, the neural network becomes a system that outputs a linear combination of its inputs, which is not much of a learning

Summary

Introduction

Background and Related WorkActivation functions have been studied for a very long time, since the early days of neural networks. Let us suppose that the input to a shallow (single hidden layer) neural network is given by the vector x. If the single neuron does nothing but passes on this input to its output, the output of the neuron is wTx, the operation that is being performed at the neuron is nothing but a simple aggregation and if this happens at each of the nodes of the hidden layer, the neural network becomes a system that outputs a linear combination of its inputs, which is not much of a learning. Note that if is the identity mapping, the output to a node is the linear combination of the input values. This is known as a linear activation and as we have seen above, it does not do much

Objectives

Methods

Findings

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Big Data Mining and Analytics	Publication Date: Jun 1, 2020
Citations: 18	License type: cc-by

R Discovery Prime

R Discovery Prime

Feature representations using the reflected rectified linear unit (RReLU) activation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Big Data Mining and Analytics

Lead the way for us

Similar Papers

Ferrimagnet-Based Neuromorphic Device Mimicking the Ventral Visual Pathway for High-Accuracy Target Recognition.
Junwei Zeng ... Yang Guo
ACS applied materials & interfaces | VOL. -
Junwei Zeng, et. al.Junwei Zeng ... Yang Guo
21 Oct 2024
ACS applied materials & interfaces | VOL. -

RETRACTED: Breast cancer diagnosis using multiple activation deep neural network
K Vijayakumar ... Sudhir Kumar Sharma
Concurrent Engineering | VOL. 29
K Vijayakumar, et. al.K Vijayakumar ... Sudhir Kumar Sharma
25 Jun 2021
Concurrent Engineering | VOL. 29

Deep Leaky Single-peaked Triangle Neural Networks
Chuan-Hui Shan ... Xi-Rong Guo
International Journal of Control, Automation and Systems | VOL. 17
Chuan-Hui Shan, et. al.Chuan-Hui Shan ... Xi-Rong Guo
19 Aug 2019
International Journal of Control, Automation and Systems | VOL. 17

SPARCNet
Adam Page ... Tinoosh Mohsenin
ACM Journal on Emerging Technologies in Computing Systems | VOL. 13
Adam Page, et. al.Adam Page ... Tinoosh Mohsenin
12 May 2017
ACM Journal on Emerging Technologies in Computing Systems | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Feature representations using the reflected rectified linear unit (RReLU) activation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Big Data Mining and Analytics