Multi-Granularity Context Network for Efficient Video Semantic Segmentation.

Zhiyuan Liang,Yiqian Wu,Xiangdong Dai,Xiaogang Jin,Jianbing Shen

doi:10.1109/tip.2023.3269982

Abstract

Current video semantic segmentation tasks involve two main challenges: how to take full advantage of multi-frame context information, and how to improve computational efficiency. To tackle the two challenges simultaneously, we present a novel Multi-Granularity Context Network (MGCNet) by aggregating context information at multiple granularities in a more effective and efficient way. Our method first converts image features into semantic prototypes, and then conducts a non-local operation to aggregate the per-frame and short-term contexts jointly. An additional long-term context module is introduced to capture the video-level semantic information during training. By aggregating both local and global semantic information, a strong feature representation is obtained. The proposed pixel-to-prototype non-local operation requires less computational cost than traditional non-local ones, and is video-friendly since it reuses the semantic prototypes of previous frames. Moreover, we propose an uncertainty-aware and structural knowledge distillation strategy to boost the performance of our method. Experiments on Cityscapes and CamVid datasets with multiple backbones demonstrate that the proposed MGCNet outperforms other state-of-the-art methods with high speed and low latency.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-Granularity Context Network for Efficient Video Semantic Segmentation.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing

Lead the way for us

Journal: IEEE Transactions on Image Processing	Publication Date: Jan 1, 2023
Citations: 2

Similar Papers

Image Visual Attention Mechanism-based Global and Local Semantic Information Fusion for Multi-modal English Machine Translation
Xiaobin Guo Xiaobin Guo
電腦學刊 | VOL. 33
Xiaobin Guo Xiaobin GuoXiaobin Guo Xiaobin Guo
01 Apr 2022
電腦學刊 | VOL. 33

A Nested Chinese Restaurant Topic Model for Short Texts with Document Embeddings
Yue Niu ... Jing Li
Applied Sciences | VOL. 11
Yue Niu, et. al.Yue Niu ... Jing Li
18 Sep 2021
Applied Sciences | VOL. 11

Semantic Segmentation With Global Encoding and Dilated Decoder in Street Scenes
Lei Fan ... Jiapeng Yan
IEEE Access | VOL. 6
Lei Fan, et. al.Lei Fan ... Jiapeng Yan
01 Jan 2018
IEEE Access | VOL. 6

Lite‐weight semantic segmentation with AG self‐attention
Bing Liu ... Zhaohao Zhong
IET Computer Vision | VOL. 18
Bing Liu, et. al.Bing Liu ... Zhaohao Zhong
28 Jul 2023
IET Computer Vision | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Granularity Context Network for Efficient Video Semantic Segmentation.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing