ReLTanh: An activation function with vanishing gradient resistance for SAE-based DNNs and its application to rotating machinery fault diagnosis

Xin Wang,Yi Qin,Yi Wang,Sheng Xiang,Haizhou Chen

doi:10.1016/j.neucom.2019.07.017

Abstract

Tanh is a sigmoidal activation function that suffers from vanishing gradient problem, so researchers have proposed some alternative functions including rectified linear unit (ReLU), however those vanishing-proof functions bring some other problem such as bias shift problem and noise-sensitiveness as well. Mainly for overcoming vanishing gradient problem as well as avoiding to introduce other problems, we propose a new activation function named Rectified Linear Tanh (ReLTanh) by improving traditional Tanh. ReLTanh is constructed by replacing Tanh’s saturated waveforms in positive and negative inactive regions with two straight lines, and the slopes of the lines are calculated by the Tanh’s derivatives at two learnable thresholds. The middle Tanh waveform provides ReLTanh with the ability of nonlinear fitting, and the linear parts contribute to the relief of vanishing gradient problem. Besides, thresholds of ReLTanh that determines the slopes of line parts are learnable, so it can tolerate the variation of inputs and help to minimize the cost function and maximize the data fitting performance. Theoretical proofs by mathematical derivations demonstrate that ReLTanh is available to diminish vanishing gradient problem and feasible to train thresholds. For verifying the practical feasibility and effectiveness of ReLTanh, fault diagnosis experiments for planetary gearboxes and rolling bearings are conducted by stacked autoencoder-based deep neural network (SAE-based DNNs). ReLTanh alleviates successfully vanishing gradient problem and the it learns faster, more steadily and precisely than Tanh, which is consistent with the theoretical analysis. Additionally, ReLTanh surpasses other popular activation functions such as ReLU family, Hexpo and Swish, which shows that ReLTanh has certain applying potential and researching value.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ReLTanh: An activation function with vanishing gradient resistance for SAE-based DNNs and its application to rotating machinery fault diagnosis

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Jul 19, 2019
Citations: 118

Similar Papers

Predicting Wavelet-Transformed Stock Prices Using a Vanishing Gradient Resilient Optimized Gated Recurrent Unit with a Time Lag
Luyandza Sindi Mamba ... Antony Ngunyi
Journal of Data Analysis and Information Processing | VOL. 11
Luyandza Sindi Mamba, et. al.Luyandza Sindi Mamba ... Antony Ngunyi
01 Jan 2023
Journal of Data Analysis and Information Processing | VOL. 11

Fast Activation Function Approach for Deep Learning Based Online Anomaly Intrusion Detection
Khaled Alrawashdeh ... Carla Purdy
-
Khaled Alrawashdeh, et. al.Khaled Alrawashdeh ... Carla Purdy
01 May 2018
01 May 2018

An Intelligent Deep Feature Learning Method With Improved Activation Functions for Machine Fault Diagnosis
Wei You ... Changqing Shen
IEEE Access | VOL. 8
Wei You, et. al.Wei You ... Changqing Shen
01 Jan 2020
IEEE Access | VOL. 8

RSigELU: A nonlinear activation function for deep neural networks
Serhat Kiliçarslan ... Mete Celik
Expert Systems with Applications | VOL. 174
Serhat Kiliçarslan, et. al.Serhat Kiliçarslan ... Mete Celik
03 Mar 2021
Expert Systems with Applications | VOL. 174

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ReLTanh: An activation function with vanishing gradient resistance for SAE-based DNNs and its application to rotating machinery fault diagnosis

Abstract

Talk to us

Similar Papers

More From: Neurocomputing