Residual Recurrent Neural Networks for Learning Sequential Representations

Boxuan Yue,Jun Liang,Junwei Fu

doi:10.3390/info9030056

Abstract

Recurrent neural networks (RNN) are efficient in modeling sequences for generation and classification, but their training is obstructed by the vanishing and exploding gradient issues. In this paper, we reformulate the RNN unit to learn the residual functions with reference to the hidden state instead of conventional gated mechanisms such as long short-term memory (LSTM) and the gated recurrent unit (GRU). The residual structure has two main highlights: firstly, it solves the gradient vanishing and exploding issues for large time-distributed scales; secondly, the residual structure promotes the optimizations for backward updates. In the experiments, we apply language modeling, emotion classification and polyphonic modeling to evaluate our layer compared with LSTM and GRU layers. The results show that our layer gives state-of-the-art performance, outperforms LSTM and GRU layers in terms of speed, and supports an accuracy competitive with that of the other methods.

Highlights

Recurrent neural networks (RNNs) have proved to be efficient to learn sequential data, such as in acoustic modeling [1,2], natural language process [3,4], machine translation [5,6], and sentiment analysis [7,8]
The fourth section demonstrates the results of our network and compares the results with simple RNN, long short-term memory (LSTM) and gated recurrent unit (GRU) in various fields: airline travel information system (ATIS) database [22], Internet movie database (IMDB) [23] and polyphonic database [24]
Our results provide a competitive accuracy with LSTM in this task of spoken language understanding, but better than the RNN and GRU

Summary

Introduction

Recurrent neural networks (RNNs) have proved to be efficient to learn sequential data, such as in acoustic modeling [1,2], natural language process [3,4], machine translation [5,6], and sentiment analysis [7,8]. The output layer determines the degree of the memory exposure Another gated RNN unit, gated recurrent unit (GRU) [17], has been introduced by Cho et al in the context of machine translation. In the proposed residual recurrent networks (Res-RNN), we use residual learning to solve the gradient issues in the process of horizontal propagation in training. We propose our Res-RNN unit and analyze how residual learning helps to train the RNNs. The fourth section demonstrates the results of our network and compares the results with simple RNN, LSTM and GRU in various fields: airline travel information system (ATIS) database [22], Internet movie database (IMDB) [23] and polyphonic database [24]. The experiments show that our novel recurrent unit can provide state-of-the-art performance

Gradient Issues

Residual-Shortcut Structure

Analysis of Res-RNN

Experiments and Discussion

ATIS Database

Findings

Polyphonic

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Information	Publication Date: Mar 6, 2018
Citations: 66	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Residual Recurrent Neural Networks for Learning Sequential Representations

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information

Lead the way for us

Similar Papers

Comparison of Hybrid Recurrent Neural Networks for Univariate Time Series Forecasting
Anibal Flores ... Hugo Tito
-
Anibal Flores, et. al.Anibal Flores ... Hugo Tito
25 Aug 2020
25 Aug 2020

Analysis of Gradient Vanishing of RNNs and Performance Comparison
Seol-Hyun Noh
Information | VOL. 12
Seol-Hyun NohSeol-Hyun Noh
25 Oct 2021
Information | VOL. 12

Performance of Three Slim Variants of The Long Short-Term Memory (LSTM) Layer
Daniel Kent ... Fathi Salem
-
Daniel Kent, et. al.Daniel Kent ... Fathi Salem
01 Aug 2019
01 Aug 2019

An Investigation into the Detection of Human Scratching Activity Based on Deep Learning Models
Kevin Wang
-
Kevin WangKevin Wang
28 Apr 2023
28 Apr 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Residual Recurrent Neural Networks for Learning Sequential Representations

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information