GSTO: Gated Scale-Transfer Operation for Multi-Scale Feature Learning in Semantic Segmentation

Zhuoying Wang,Yangyan Li,Weisi Lin,Yongtao Wang,Zhi Tang,Haibin Ling,Ying Chen

doi:10.1109/icpr48806.2021.9412965

Abstract

Existing CNN-based methods for semantic segmentation heavily depend on multi-scale features to meet the requirements of both semantic comprehension and detail preservation. State-of-the-art segmentation networks widely exploit conventional scale-transfer operations, i.e., up-sampling and down-sampling to learn multi-scale features. In this work, we find that these operations lead to scale-confused features and suboptimal performance because they are spatial-invariant and directly transit all feature information cross scales without spatial selection. To address this issue, we propose the Gated Scale-Transfer Operation (GSTO) to properly transit spatial-filtered features to another scale. Specifically, GSTO can work either with or without extra supervision. Unsupervised GSTO is learned from the feature itself while the supervised one is guided by the supervised probability matrix. Both forms of GSTO are lightweight and plug-and-play, which can be flexibly integrated into networks or modules for learning better multi-scale features. In particular, by plugging GSTO into HRNet, we get a more powerful backbone (namely GSTO-HRNet) for pixel labeling, and it achieves new state-of-the-art results on multiple benchmarks for semantic segmentation including Cityscapes, LIP, and Pascal Context, with a negligible extra computational cost. Moreover, experiment results demonstrate that GSTO can also significantly boost the performance of multi-scale feature aggregation modules like PPM and ASPP.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

GSTO: Gated Scale-Transfer Operation for Multi-Scale Feature Learning in Semantic Segmentation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

GSTO: Gated Scale-Transfer Operation for Multi-Scale Feature Learning in Semantic Segmentation

-

29 Dec 2020
29 Dec 2020

Many heads are better than one: A multiscale neural information feature fusion framework for spatial route selections decoding from multichannel neural recordings of pigeons
Mengmeng Li ... Hong Wan
Brain Research Bulletin | VOL. 184
Mengmeng Li, et. al.Mengmeng Li ... Hong Wan
12 Mar 2022
Brain Research Bulletin | VOL. 184

Multi-scale Adaptive Feature Fusion Network for Semantic Segmentation in Remote Sensing Images
Ronghua Shang ... Jiyu Zhang
Remote Sensing | VOL. 12
Ronghua Shang, et. al.Ronghua Shang ... Jiyu Zhang
09 Mar 2020
Remote Sensing | VOL. 12

Semantic Segmentation of Point Cloud Scene via Multi-Scale Feature Aggregation and Adaptive Fusion
Baoyun Guo ... Na Sun
Photogrammetric Engineering & Remote Sensing | VOL. 90
Baoyun Guo, et. al.Baoyun Guo ... Na Sun
01 Sep 2024
Photogrammetric Engineering & Remote Sensing | VOL. 90

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

GSTO: Gated Scale-Transfer Operation for Multi-Scale Feature Learning in Semantic Segmentation

Abstract

Talk to us

Similar Papers