Two-Stage Cascaded Decoder for Semantic Segmentation of RGB-D Images

Yuchun Yue,Jingsheng Lei,Lu Yu,Wujie Zhou

doi:10.1109/lsp.2021.3084855

Abstract

Exploiting RGB and depth information can boost the performance of semantic segmentation. However, owing to the differences between RGB images and the corresponding depth maps, such multimodal information should be effectively used and combined. Most existing methods use the same fusion strategy to explore multilevel complementary information at various levels, likely ignoring different feature contributions at various levels for segmentation. To address this problem, we propose a network using a two-stage cascaded decoder (TCD), embedding a detail polishing module, to effectively integrate high- and low-level features and suppress noise from low-level details. Additionally, we introduce a depth filter and fusion module to extract informative regions from depth cues with the guidance of RGB images. The proposed TCD network achieves comparable performance to state-of-the-art RGB-D semantic segmentation methods on the benchmark NYUDv2 and SUN RGB-D datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Two-Stage Cascaded Decoder for Semantic Segmentation of RGB-D Images

Abstract

Talk to us

Similar Papers

More From: IEEE Signal Processing Letters

Lead the way for us

Journal: IEEE Signal Processing Letters	Publication Date: Jan 1, 2021
Citations: 25

Similar Papers

AMCFNet: Asymmetric multiscale and crossmodal fusion network for RGB-D semantic segmentation in indoor service robots
Wujie Zhou ... Lu Yu
Journal of Visual Communication and Image Representation | VOL. 97
Wujie Zhou, et. al.Wujie Zhou ... Lu Yu
16 Oct 2023
Journal of Visual Communication and Image Representation | VOL. 97

RAFNet: RGB-D attention feature fusion network for indoor semantic segmentation
Xingchao Yan ... Weikuan Jia
Displays | VOL. 70
Xingchao Yan, et. al.Xingchao Yan ... Weikuan Jia
04 Sep 2021
Displays | VOL. 70

BDR6D: Bidirectional Deep Residual Fusion Network for 6D Pose Estimation
Penglei Liu ... Qieshi Zhang
IEEE Transactions on Automation Science and Engineering | VOL. 21
Penglei Liu, et. al.Penglei Liu ... Qieshi Zhang
01 Apr 2024
IEEE Transactions on Automation Science and Engineering | VOL. 21

Hybrid-Attention Network for RGB-D Salient Object Detection
Yuzhen Chen ... Wujie Zhou
Applied Sciences | VOL. 10
Yuzhen Chen, et. al.Yuzhen Chen ... Wujie Zhou
21 Aug 2020
Applied Sciences | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Two-Stage Cascaded Decoder for Semantic Segmentation of RGB-D Images

Abstract

Talk to us

Similar Papers

More From: IEEE Signal Processing Letters