Dynamic temporal residual network for sequence modeling

Ruijie Yan,Liangrui Peng,Shanyu Xiao,Shengjin Wang,Michael T Johnson

doi:10.1007/s10032-019-00328-x

Abstract

The long short-term memory (LSTM) network with gating mechanism has been widely used in sequence modeling tasks including handwriting and speech recognition. As an LSTM network can be unfolded along the temporal dimension and its temporal depth is equal to the length of the input feature sequence, the introduction of gating might not be sufficient to completely model the dynamic temporal dependencies in sequential data. Inspired by the residual learning in ResNet, this paper proposes a dynamic temporal residual network (DTRN) by incorporating residual learning into an LSTM network along the temporal dimension. DTRN involves two networks: Its primary network consists of modified LSTM units with weighted shortcut connections for adjacent temporal outputs, while its secondary network generates dynamic weights for the shortcut connections. To validate the performance of DTRN, we conduct experiments on three commonly used public handwriting recognition datasets (IFN/ENIT, IAM and Rimes) and one speech recognition dataset (TIMIT). The experimental results show that the proposed DTRN has outperformed previously reported methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dynamic temporal residual network for sequence modeling

Abstract

Talk to us

Similar Papers

More From: International Journal on Document Analysis and Recognition (IJDAR)

Lead the way for us

Journal: International Journal on Document Analysis and Recognition (IJDAR)	Publication Date: Jul 2, 2019
Citations: 5

Similar Papers

Dynamic Temporal Residual Learning for Speech Recognition
Jiaqi Xie ... Michael T Johnson
-
Jiaqi Xie, et. al.Jiaqi Xie ... Michael T Johnson
11 Apr 2020
11 Apr 2020

Using MLSTM and Multioutput Convolutional LSTM Algorithms for Detecting Anomalous Patterns in Streamed Data of Unmanned Aerial Vehicles
Ahmad Alos ... Zouhair Dahrouj
IEEE Aerospace and Electronic Systems Magazine | VOL. 37
Ahmad Alos, et. al.Ahmad Alos ... Zouhair Dahrouj
01 Jun 2022
IEEE Aerospace and Electronic Systems Magazine | VOL. 37

A novel machine learning-based framework for the water quality parameters prediction using hybrid long short-term memory and locally weighted scatterplot smoothing methods
Ana Dodig ... Elisa Ricci
Journal of Hydroinformatics | VOL. 26
Ana Dodig, et. al.Ana Dodig ... Elisa Ricci
12 Apr 2024
Journal of Hydroinformatics | VOL. 26

Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) network
Alex Sherstinsky
Physica D: Nonlinear Phenomena | VOL. 404
Alex SherstinskyAlex Sherstinsky
21 Jan 2020
Physica D: Nonlinear Phenomena | VOL. 404

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dynamic temporal residual network for sequence modeling

Abstract

Talk to us

Similar Papers

More From: International Journal on Document Analysis and Recognition (IJDAR)