GPU-based empirical evaluation of activation functions in convolutional neural networks

Raniah Zaheer,Humera Shaziya

doi:10.1109/icisc.2018.8398903

Abstract

Activation functions are important components of Convolutional Neural Networks (CNN) that introduces nonlinearity in the model to compute complex functions. There are different types of activation functions used with CNNs in different applications, however it turns out that an effective activation function yields better results and improves performance of the model. In this study four of the widely used activation functions are chosen to analyze and evaluate to figure out their efficiency in terms of the model's accuracy. Sigmoid, hyperbolic tangent, rectified linear unit (ReLU) and exponential linear unit (ELU) activation functions have been used with most of the successful models. A CNN model has been implemented on the MNIST dataset to perform the analysis task. The experiments have been performed on Nvidia GPU 940MX to accelerate the training and testing of the CNN model. It has been observed that ReLU the most popular activation function performs better than sigmoid and tanh and a recent activation function ELU performs better than ReLU.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

GPU-based empirical evaluation of activation functions in convolutional neural networks

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Performance Analysis of Activation Functions on Convolutional Neural Networks Using Cloud GPU
Vamshi Krishna Kayala ...
-
Vamshi Krishna Kayala, et. al.Vamshi Krishna Kayala ...
01 Jan 2020
01 Jan 2020

Classification of COVID-19 from CT chest images using Convolutional Wavelet Neural Network
Dina A Abdulqader ... Marwa T Naser
International Journal of Electrical and Computer Engineering (IJECE) | VOL. 13
Dina A Abdulqader, et. al.Dina A Abdulqader ... Marwa T Naser
01 Feb 2023
International Journal of Electrical and Computer Engineering (IJECE) | VOL. 13

Elastic exponential linear units for convolutional neural networks
Daeho Kim ... Jaeil Kim
Neurocomputing | VOL. 406
Daeho Kim, et. al.Daeho Kim ... Jaeil Kim
26 Mar 2020
Neurocomputing | VOL. 406

Fractional ordering of activation functions for neural networks: A case study on Texas wind turbine
Bhukya Ramadevi ... Kishore Bingi
Engineering Applications of Artificial Intelligence | VOL. 127
Bhukya Ramadevi, et. al.Bhukya Ramadevi ... Kishore Bingi
18 Oct 2023
Engineering Applications of Artificial Intelligence | VOL. 127

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

GPU-based empirical evaluation of activation functions in convolutional neural networks

Abstract

Talk to us

Similar Papers