On the vanishing and exploding gradient problem in Gated Recurrent Units

Alexander Rehmer,Andreas Kroll

doi:10.1016/j.ifacol.2020.12.1342

On the vanishing and exploding gradient problem in Gated Recurrent Units

Alexander Rehmer, Andreas Kroll

Open Access

https://doi.org/10.1016/j.ifacol.2020.12.1342

Copy DOI

Export

Save

Cite

Journal: IFAC PapersOnLine	Publication Date: Jan 1, 2020
Citations: 54

Affiliation: University of Kassel

#Class Of Recurrent Neural Networks #Conventional Recurrent Neural Networks #Exploding Gradient Problem #Recurrent Neural Networks #Identification Of State Space Models #Gradient Problem #Identification Of Nonlinear Models #Video Processing #Speech Recognition #Speech Recognition Processing

Abstract
Full-Text
Similar Papers

Abstract

Listen

Recurrent Neural Networks are applied in areas such as speech recognition, natural language and video processing, and the identification of nonlinear state space models. Conventional Recurrent Neural Networks, e.g. the Elman Network, are hard to train. A more recently developed class of recurrent neural networks, so-called Gated Units, outperform their counterparts on virtually every task. This paper aims to provide additional insights into the differences between RNNs and Gated Units in order to explain the superior perfomance of gated recurrent units. It is argued, that Gated Units are easier to optimize not because they solve the vanishing gradient problem, but because they circumvent the emergence of large local gradients.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: IFAC PapersOnLine

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

On the vanishing and exploding gradient problem in Gated Recurrent Units

Abstract

Published Version

Talk to us

Similar Papers

More From: IFAC PapersOnLine

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

On the vanishing and exploding gradient problem in Gated Recurrent Units

Abstract

Published Version

Talk to us

Similar Papers

More From: IFAC PapersOnLine