Convergence of Gradient Descent Algorithm for Diagonal Recurrent Neural Networks

Dongpo Xu,Xiaoshuai Ding,Zhengxue Li,Wei Wu,Di Qu Di Qu

doi:10.1109/bicta.2007.4806412

Abstract

Recurrent neural networks have been used for analysis and prediction of time series. This paper is concerned with the convergence of the gradient descent algorithm for training the diagonal recurrent neural networks. The existing convergence results consider the online gradient training algorithm based on the assumption that a very large number of (or infinitely many in theory) training samples of the time series are available, and accordingly the stochastic process theory is used to establish some convergence results of probability nature. In this paper, we consider the case that only a small number of training samples of the time series are available such that the stochastic treatment of the problem is no longer appropriate. Instead, we use the offline gradient descent algorithm for training the diagonal recurrent neural network, and we accordingly prove some convergence results of deterministic nature. The monotonicity of the error function in the iteration is also guaranteed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Convergence of Gradient Descent Algorithm for Diagonal Recurrent Neural Networks

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Effects of transfer learning for handwritten digit classification in a small training sample size situation
Yoshihiro Mitani ... Yoshihiko Hamamoto
-
Yoshihiro Mitani, et. al.Yoshihiro Mitani ... Yoshihiko Hamamoto
17 Dec 2022
17 Dec 2022

A multi-view ensemble model based on semi-supervised feature learning for small sample classification of PolSAR images
Mohsen Darvishnezhad
International Journal of Remote Sensing | VOL. 45
Mohsen DarvishnezhadMohsen Darvishnezhad
28 Jan 2024
International Journal of Remote Sensing | VOL. 45

Dataset artificial augmentation with a small number of training samples for reflectance estimation.
Jingjing Zhang ... Yuke He
Optics Express | VOL. 31
Jingjing Zhang, et. al.Jingjing Zhang ... Yuke He
17 Feb 2023
Optics Express | VOL. 31

Multi-View Feature Construction Using Genetic Programming for Rolling Bearing Fault Diagnosis [Application Notes
Bo Peng ... Mengjie Zhang
IEEE Computational Intelligence Magazine | VOL. 16
Bo Peng, et. al.Bo Peng ... Mengjie Zhang
01 Aug 2021
IEEE Computational Intelligence Magazine | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Convergence of Gradient Descent Algorithm for Diagonal Recurrent Neural Networks

Abstract

Talk to us

Similar Papers