A comparative performance analysis of different activation functions in LSTM networks for classification

Amir Farzad,Hoda Mashayekhi,Hamid Hassanpour

doi:10.1007/s00521-017-3210-6

Abstract

In recurrent neural networks such as the long short-term memory (LSTM), the sigmoid and hyperbolic tangent functions are commonly used as activation functions in the network units. Other activation functions developed for the neural networks are not thoroughly analyzed in LSTMs. While many researchers have adopted LSTM networks for classification tasks, no comprehensive study is available on the choice of activation functions for the gates in these networks. In this paper, we compare 23 different kinds of activation functions in a basic LSTM network with a single hidden layer. Performance of different activation functions and different number of LSTM blocks in the hidden layer are analyzed for classification of records in the IMDB, Movie Review, and MNIST data sets. The quantitative results on all data sets demonstrate that the least average error is achieved with the Elliott activation function and its modifications. Specifically, this family of functions exhibits better results than the sigmoid activation function which is popular in LSTM networks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A comparative performance analysis of different activation functions in LSTM networks for classification

Abstract

Talk to us

Similar Papers

More From: Neural Computing and Applications

Lead the way for us

Journal: Neural Computing and Applications	Publication Date: Oct 19, 2017
Citations: 86

Similar Papers

Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) network
Alex Sherstinsky
Physica D: Nonlinear Phenomena | VOL. 404
Alex SherstinskyAlex Sherstinsky
21 Jan 2020
Physica D: Nonlinear Phenomena | VOL. 404

Research and Application of Deformation Prediction Model for Deep Foundation Pit Based on LSTM
Hailin Li ... Xue Du
Wireless Communications and Mobile Computing | VOL. 2022
Hailin Li, et. al.Hailin Li ... Xue Du
06 Jul 2022
Wireless Communications and Mobile Computing | VOL. 2022

Evolving Recurrent Neural Network Controllers by Incremental Fitness Shaping
Kaan Akinci ... Andrew Philippides
-
Kaan Akinci, et. al.Kaan Akinci ... Andrew Philippides
01 Jan 2019
01 Jan 2019

Memristor-based LSTM network with in situ training and its applications
Xiaoyang Liu ... Donald C Wunsch Ii
Neural Networks | VOL. 131
Xiaoyang Liu, et. al.Xiaoyang Liu ... Donald C Wunsch Ii
04 Aug 2020
Neural Networks | VOL. 131

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A comparative performance analysis of different activation functions in LSTM networks for classification

Abstract

Talk to us

Similar Papers

More From: Neural Computing and Applications