TanhExp: A smooth activation function with high convergence speed for lightweight neural networks

Xinyu Liu,Xiaoguang Di

doi:10.1049/cvi2.12020

Abstract

Lightweight or mobile neural networks used for real-time computer vision tasks contain fewer parameters than normal networks, which lead to a constrained performance. Herein, a novel activation function named as Tanh Exponential Activation Function (TanhExp) is proposed which can improve the performance for these networks on image classification task significantly. The definition of TanhExp is f(x) = x tanh(ex). The simplicity, efficiency, and robustness of TanhExp on various datasets and network models is demonstrated and TanhExp outperforms its counterparts in both convergence speed and accuracy. Its behaviour also remains stable even with noise added and dataset altered. It is shown that without increasing the size of the network, the capacity of lightweight neural networks can be enhanced by TanhExp with only a few training epochs and no extra parameters added.

Highlights

Lightweight neural networks, known as mobile neural networks, are specially designed for realizing real-time visual information processing
We propose a Tanh Exponential Activation Function (TanhExp), which combines the advantages of activation functions similar to Rectified Linear Unit (ReLU) and other non-piecewise activation functions together
We introduce the Tanh Exponential Activation Function(TanhExp), which can be defined in Eq (3): f (x) = x tanh(ex) where tanh refers to the hyperbolic tangent function: ex − e−x tanh(x) = ex + e−x

Summary

Introduction

Lightweight neural networks, known as mobile neural networks, are specially designed for realizing real-time visual information processing. They tune deep neural network architectures to strike an optimal balance between accuracy and performance, tailored for mobile and resource limitted environments [1]. These networks are necessary for computer vision tasks which require real-time computation [2,3,4,5]. Noticing that the powerful ability of a neural network to fit a non-linear function lays upon the activation function inside, we consider that an effective activation function can help boost the performance of these networks without sacrificing size and rapidity

Objectives

Methods

Findings

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IET Computer Vision	Publication Date: Feb 23, 2021
Citations: 31	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

TanhExp: A smooth activation function with high convergence speed for lightweight neural networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IET Computer Vision

Lead the way for us

Similar Papers

MMReLU: A Simple and Smooth Activation Function with High Convergence Speed
Longda Wu ... Shuai Wang
-
Longda Wu, et. al.Longda Wu ... Shuai Wang
10 Dec 2021
10 Dec 2021

Gradient Descent for Non-convex Problems in Modern Machine Learning

-

27 Jun 2019
27 Jun 2019

Surface-Fitting and Analysis of Scattered Data via Radial and Related Basis Functions with Applications to Neural Networks
Francis J Narcowich
-
Francis J NarcowichFrancis J Narcowich
30 Sep 2001
30 Sep 2001

Why, in Deep Learning, Non-smooth Activation Function Works Better Than Smooth Ones
Daniel Cruz ... Ricardo Godoy
-
Daniel Cruz, et. al.Daniel Cruz ... Ricardo Godoy
01 Jan 2023
01 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TanhExp: A smooth activation function with high convergence speed for lightweight neural networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IET Computer Vision