Temporal Memory Attention for Video Semantic Segmentation

Hao Wang,Weining Wang,Jing Liu

doi:10.1109/icip42928.2021.9506731

Abstract

Video semantic segmentation requires to utilize the complex temporal relations between frames of the video sequence. Previous works usually exploit accurate optical flow to leverage the temporal relations, which suffer much from heavy computational cost. In this paper, we propose a Temporal Memory Attention Network (TMANet) to adaptively integrate the long-range temporal relations over the video sequence based on the self-attention mechanism without exhaustive optical flow prediction. Specially, we construct a memory using several past frames to store the temporal information of the current frame. We then propose a temporal memory attention module to capture the relation between the current frame and the memory to enhance the representation of the current frame. Our method achieves new state-of-the-art performances on two challenging video semantic segmentation datasets, particularly 80.3% mIoU on Cityscapes and 76.5% mIoU on CamVid with ResNet-50.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Temporal Memory Attention for Video Semantic Segmentation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

TDSNet: A temporal difference based network for video semantic segmentation
Haochen Yuan ... Zesu Cai
Information Sciences | VOL. 686
Haochen Yuan, et. al.Haochen Yuan ... Zesu Cai
01 Aug 2024
Information Sciences | VOL. 686

Every Frame Counts: Joint Learning of Video Segmentation and Optical Flow
Mingyu Ding ... Bolei Zhou
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34
Mingyu Ding, et. al.Mingyu Ding ... Bolei Zhou
03 Apr 2020
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34

Video Semantic Segmentation via Sparse Temporal Transformer
Jiangtong Li ... Chen Qian
-
Jiangtong Li, et. al.Jiangtong Li ... Chen Qian
17 Oct 2021
17 Oct 2021

Accel: A Corrective Fusion Network for Efficient Semantic Segmentation on Video
Samvit Jain ... Xin Wang
-
Samvit Jain, et. al.Samvit Jain ... Xin Wang
01 Jun 2019
01 Jun 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Temporal Memory Attention for Video Semantic Segmentation

Abstract

Talk to us

Similar Papers