SinLU: Sinu-Sigmoidal Linear Unit

Ashis Paul,Zong Woo Geem,Ram Sarkar,Rajarshi Bandyopadhyay,Jin Hee Yoon

doi:10.3390/math10030337

Ashis Paul, Zong Woo Geem + Show 3 more

Open Access

https://doi.org/10.3390/math10030337

Copy DOI

Abstract

Non-linear activation functions are integral parts of deep neural architectures. Given the large and complex dataset of a neural network, its computational complexity and approximation capability can differ significantly based on what activation function is used. Parameterizing an activation function with the introduction of learnable parameters generally improves the performance. Herein, a novel activation function called Sinu-sigmoidal Linear Unit (or SinLU) is proposed. SinLU is formulated as SinLU(x)=(x+asinbx)·σ(x), where σ(x) is the sigmoid function. The proposed function incorporates the sine wave, allowing new functionalities over traditional linear unit activations. Two trainable parameters of this function control the participation of the sinusoidal nature in the function, and help to achieve an easily trainable, and fast converging function. The performance of the proposed SinLU is compared against widely used activation functions, such as ReLU, GELU and SiLU. We showed the robustness of the proposed activation function by conducting experiments in a wide array of domains, using multiple types of neural network-based models on some standard datasets. The use of sine wave with trainable parameters results in a better performance of SinLU than commonly used activation functions.

Highlights

In the data-driven realm of deep learning, neural networks (NNs) along with nonlinear activation functions have revolutionized multiple domains from images, videos, and natural languages
We demonstrated its efficiency and robustness, and found that any deep learning model with this novel activation function outperforms the models with other popular activation functions across domains, such as image classification and sequential data classification
In a recent study by Liu et al [24], the authors proposed the Tanh exponential activation function (TanhExp), which improves the performance of lightweight or mobile neural networks used for real-time computer vision tasks, and contains fewer parameters than usual

Summary

Introduction

In the data-driven realm of deep learning, neural networks (NNs) along with nonlinear activation functions have revolutionized multiple domains from images, videos, and natural languages. The non-linearity of activation functions allows NNs to understand the complex nature of the data by creating deeper connections among the nodes of NNs. The state-of-the-art architectures, whether it is the classic convolutional neural network (CNN). The recent transformer [1], all have evolved from connected layers of artificial neurons All of these heavy architectures have activation functions associated with their components. In regard to defining new activation functions , one promising approach is by approximating better activation, by learning. One such way is to introduce learnable parameters to an activation function, which can be trained individually or together with the model through backpropagation. Keeping the above facts in mind, in this paper, we propose a new non-linear trainable activation function, called Sinu-sigmoidal Linear Unit (SinLU).

Related Work

Formulation of SinLU

Properties of SinLU

Experimental Results and Discussion

Lightweight Neural Networks

Performance over Noise

Deeper Models Overfitting

CNN on MNIST-like Datasets

Transfer Learning

With Gradient Activation Function

Sequential Data

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Mathematics	Publication Date: Jan 23, 2022
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

SinLU: Sinu-Sigmoidal Linear Unit

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematics

Lead the way for us

Similar Papers

Fractional ordering of activation functions for neural networks: A case study on Texas wind turbine
Bhukya Ramadevi ... Kishore Bingi
Engineering Applications of Artificial Intelligence | VOL. 127
Bhukya Ramadevi, et. al.Bhukya Ramadevi ... Kishore Bingi
18 Oct 2023
Engineering Applications of Artificial Intelligence | VOL. 127

Smish: A Novel Activation Function for Deep Learning Methods
Xueliang Wang ... Honge Ren
Electronics | VOL. 11
Xueliang Wang, et. al.Xueliang Wang ... Honge Ren
11 Feb 2022
Electronics | VOL. 11

Parametric RSigELU: a new trainable activation function for deep learning
Serhat Kiliçarslan ... Mete Celik
Neural Computing and Applications | VOL. 36
Serhat Kiliçarslan, et. al.Serhat Kiliçarslan ... Mete Celik
28 Feb 2024
Neural Computing and Applications | VOL. 36

A Quantum Activation Function for Neural Networks: Proposal and Implementation
Saurabh Kumar ... Soumik Adhikary
-
Saurabh Kumar, et. al.Saurabh Kumar ... Soumik Adhikary
18 Jul 2021
18 Jul 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SinLU: Sinu-Sigmoidal Linear Unit

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematics