RAFNet: RGB-D attention feature fusion network for indoor semantic segmentation

Xingchao Yan,Sujuan Hou,Awudu Karim,Weikuan Jia

doi:10.1016/j.displa.2021.102082

Abstract

Semantic segmentation based on the complementary information from RGB and depth images has recently gained great popularity, but due to the difference between RGB and depth maps, how to effectively use RGB-D information is still a problem. In this paper, we propose a novel RGB-D semantic segmentation network named RAFNet, which can selectively gather features from the RGB and depth information. Specifically, we construct an architecture with three parallel branches and propose several complementary attention modules. This structure enables a fusion branch and we add the Bi-directional Multi-step Propagation (BMP) strategy to it, which can not only retain the feature streams of the original RGB and depth branches but also fully utilize the feature flow of the fusion branch. There are three kinds of complementary attention modules that we have constructed. The RGB-D fusion module can effectively extract important features from the RGB and depth branch streams. The refinement module can reduce the loss of semantic information and the context aggregation module can help propagate and integrate information better. We train and evaluate our model on NYUDv2 and SUN-RGBD datasets, and prove that our model achieves state-of-the-art performances.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

RAFNet: RGB-D attention feature fusion network for indoor semantic segmentation

Abstract

Talk to us

Similar Papers

More From: Displays

Lead the way for us

Similar Papers

ACNET: Attention Based Network to Exploit Complementary Features for RGBD Semantic Segmentation
Xinxin Hu ... Kailun Yang
-
Xinxin Hu, et. al.Xinxin Hu ... Kailun Yang
01 Sep 2019
01 Sep 2019

Finger Spelling Recognition from RGB-D Information Using Kernel Descriptor
K Otiniano Rodriguez ... G Camara Chavez
-
K Otiniano Rodriguez, et. al.K Otiniano Rodriguez ... G Camara Chavez
01 Aug 2013
01 Aug 2013

Two-Stage Cascaded Decoder for Semantic Segmentation of RGB-D Images
Yuchun Yue ... Wujie Zhou
IEEE Signal Processing Letters | VOL. 28
Yuchun Yue, et. al.Yuchun Yue ... Wujie Zhou
01 Jan 2020
IEEE Signal Processing Letters | VOL. 28

BDR6D: Bidirectional Deep Residual Fusion Network for 6D Pose Estimation
Penglei Liu ... Qieshi Zhang
IEEE Transactions on Automation Science and Engineering | VOL. 21
Penglei Liu, et. al.Penglei Liu ... Qieshi Zhang
01 Apr 2024
IEEE Transactions on Automation Science and Engineering | VOL. 21

Journal: Displays	Publication Date: Sep 4, 2021
Citations: 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

RAFNet: RGB-D attention feature fusion network for indoor semantic segmentation

Abstract

Talk to us

Similar Papers

More From: Displays