Learning Sequence Representations by Non-local Recurrent Neural Memory

Wenjie Pei,Guangming Lu,Yu-Wing Tai,Xin Feng,Canmiao Fu,Qiong Cao

doi:10.1007/s11263-022-01648-y

Abstract

The key challenge of sequence representation learning is to capture the long-range temporal dependencies. Typical methods for supervised sequence representation learning are built upon recurrent neural networks to capture temporal dependencies. One potential limitation of these methods is that they only model one-order information interactions explicitly between adjacent time steps in a sequence, hence the high-order interactions between nonadjacent time steps are not fully exploited. It greatly limits the capability of modeling the long-range temporal dependencies since the temporal features learned by one-order interactions cannot be maintained for a long term due to temporal information dilution and gradient vanishing. To tackle this limitation, we propose the non-local recurrent neural memory (NRNM) for supervised sequence representation learning, which performs non-local operations by means of self-attention mechanism to learn full-order interactions within a sliding temporal memory block and models global interactions between memory blocks in a gated recurrent manner. Consequently, our model is able to capture long-range dependencies. Besides, the latent high-level features contained in high-order interactions can be distilled by our model. We validate the effectiveness and generalization of our NRNM on three types of sequence applications across different modalities, including sequence classification, step-wise sequential prediction and sequence similarity learning. Our model compares favorably against other state-of-the-art methods specifically designed for each of these sequence applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning Sequence Representations by Non-local Recurrent Neural Memory

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Vision

Lead the way for us

Similar Papers

Non-Local Recurrent Neural Memory for Supervised Sequence Modeling
Canmiao Fu ... Qiong Cao
-
Canmiao Fu, et. al.Canmiao Fu ... Qiong Cao
01 Oct 2019
01 Oct 2019

B2-ViT Net: Broad Vision Transformer Network With Broad Attention for Seizure Prediction.
Shuiling Shi ... Wenqi Liu
IEEE Transactions on Neural Systems and Rehabilitation Engineering | VOL. 32
Shuiling Shi, et. al.Shuiling Shi ... Wenqi Liu
01 Jan 2024
IEEE Transactions on Neural Systems and Rehabilitation Engineering | VOL. 32

Weakly Supervised Temporal Action Detection With Temporal Dependency Learning
Bairong Li ... Yuesheng Zhu
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32
Bairong Li, et. al.Bairong Li ... Yuesheng Zhu
01 Jul 2022
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32

FMRI-S4: Learning Short- and Long-Range Dynamic fMRI Dependencies Using 1D Convolutions and State Space Models
Ahmed El-Gazzar ... Guido Van Wingen
-
Ahmed El-Gazzar, et. al.Ahmed El-Gazzar ... Guido Van Wingen
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Sequence Representations by Non-local Recurrent Neural Memory

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Vision