Investigation of activation functions in deep belief network

Mian Mian Lau,King Hann Lim

doi:10.1109/iccre.2017.7935070

Abstract

Deep Belief Network (DBN) is made up of stacked Restricted Boltzmann Machine layers associated with global weight fine-tuning for pattern recognition. However, DBN suffers from vanishing gradient problem due to the saturation characteristic of activation function. Therefore, the selection of activation function in DBN is critical to reduce the network complexity and improve performance of pattern recognition. Unsaturated activation functions such as rectified linear unit and leaky rectified linear unit were recently proposed to avoid the effect of vanishing gradient for a deep learning neural network. In this paper, we investigated the network performance with both saturated and unsaturated activation functions. Besides that, the randomization of training samples would significantly improve the performance of DBN. The experimental results showed that hyperbolic tangent activation function achieved the lowest error rate which is 1.99% on MNIST handwritten digit dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Investigation of activation functions in deep belief network

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

An improved fault diagnosis method based on deep wavelet neural network
Yibo Liu ... Qingyu Yang
-
Yibo Liu, et. al.Yibo Liu ... Qingyu Yang
01 Jun 2018
01 Jun 2018

Combined deep learning classifiers for stock market prediction: integrating stock price and news sentiments
Shilpa B L ... Shambhavi B R
Kybernetes | VOL. 52
Shilpa B L, et. al.Shilpa B L ... Shambhavi B R
09 Nov 2021
Kybernetes | VOL. 52

Novel deep generative simultaneous recurrent model for efficient representation learning
M Alam ... K.M Iftekharuddin
Neural Networks | VOL. 107
M Alam, et. al.M Alam ... K.M Iftekharuddin
09 Aug 2018
Neural Networks | VOL. 107

Deep Learning Neural Networks Trained with MODIS Satellite-Derived Predictors for Long-Term Global Solar Radiation Prediction
Sujan Ghimire ... Ravinesh C Deo
Energies | VOL. 12
Sujan Ghimire, et. al.Sujan Ghimire ... Ravinesh C Deo
22 Jun 2019
Energies | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Investigation of activation functions in deep belief network

Abstract

Talk to us

Similar Papers