PGDENet: Progressive Guided Fusion and Depth Enhancement Network for RGB-D Indoor Scene Parsing

Wujie Zhou,Jingsheng Lei,Jian Wan,Enquan Yang,Lu Yu

doi:10.1109/tmm.2022.3161852

Abstract

Scene parsing is a fundamental task in computer vision. Various RGB-D (color and depth) scene parsing methods based on fully convolutional networks have achieved excellent performance. However, color and depth information are different in nature and existing methods cannot optimize the cooperation of high-level and low-level information when aggregating modal information, which introduces noise or loss of key information in the aggregated features and generates inaccurate segmentation maps. The features extracted from the depth branch are weak because of the low quality of the depth map, which results in unsatisfactory feature representation. To address these drawbacks, we propose a progressive guided fusion and depth enhancement network (PGDENet) for RGB-D indoor scene parsing. First, high-quality RGB images are used to improve depth data through a depth enhancement module, in which the depth maps are strengthened in terms of channel and spatial correlations. Then, we integrate information from the RGB and enhance depth modalities using a progressive complementary fusion module, in which we start with high-level semantic information and move down layerwise to guide the fusion of adjacent layers while reducing hierarchy-based differences. Extensive experiments are conducted on two public indoor scene datasets, and the results show that the proposed PGDENet outperforms state-of-the-art methods in RGB-D scene parsing.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

PGDENet: Progressive Guided Fusion and Depth Enhancement Network for RGB-D Indoor Scene Parsing

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia

Lead the way for us

Journal: IEEE Transactions on Multimedia	Publication Date: Jan 1, 2023
Citations: 32

Similar Papers

FASFLNet: feature adaptive selection and fusion lightweight network for RGB-D indoor scene parsing.
Xiaohong Qian ... Wujie Zhou
Optics Express | VOL. 31
Xiaohong Qian, et. al.Xiaohong Qian ... Wujie Zhou
17 Feb 2023
Optics Express | VOL. 31

FRNet: Feature Reconstruction Network for RGB-D Indoor Scene Parsing
Wujie Zhou ... Lu Yu
IEEE Journal of Selected Topics in Signal Processing | VOL. 16
Wujie Zhou, et. al.Wujie Zhou ... Lu Yu
01 Jun 2022
IEEE Journal of Selected Topics in Signal Processing | VOL. 16

SalientDSO: Bringing Attention to Direct Sparse Odometry
Huai-Jen Liang ... Yiannis Aloimonos
IEEE Transactions on Automation Science and Engineering | VOL. 16
Huai-Jen Liang, et. al.Huai-Jen Liang ... Yiannis Aloimonos
01 Oct 2019
IEEE Transactions on Automation Science and Engineering | VOL. 16

Cross-layer Navigation Convolutional Neural Network for Fine-grained Visual Classification
Chenyu Guo ... Jiyang Xie
-
Chenyu Guo, et. al.Chenyu Guo ... Jiyang Xie
01 Dec 2021
01 Dec 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PGDENet: Progressive Guided Fusion and Depth Enhancement Network for RGB-D Indoor Scene Parsing

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia