ResFusion: deeply fused scene parsing network for RGB‐D images

Juting Dai,Xinyi Tang

doi:10.1049/iet-cvi.2018.5218

Abstract

Scene parsing is a very challenging work for complex and diverse scenes. In this study, the authors address the problem of semantic segmentation of indoor scenes for red, green, blue‐depth (RGB‐D) images. Most existing works use only the colour or photometric information for this problem. Here, they present an approach to fusing feature maps between colour network branch and depth network branch to integrate the photometric information and geometric information, which improves the semantic segmentation performance. They propose a novel convolutional neural network that uses ResNet as a baseline network. Their proposed network adopts a spatial pyramid pooling module to make full use of different sub‐region representations. Their proposed network also adopts multiple feature maps fusion modules to integrate texture and structure information between the colour branch and depth branch. Moreover, their proposed network has multiple auxiliary loss branches together with the main loss function to prevent the gradient of frontal layers disappear and accelerate the training phase of the fusion part. Comprehensive experimental evaluations show that their proposed network ‘ResFusion’ improves the performance greatly over the baseline network and has achieved competitive performance compared with other state‐of‐the‐art methods on the challenging SUN RGB‐D benchmark.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ResFusion: deeply fused scene parsing network for RGB‐D images

Abstract

Talk to us

Similar Papers

More From: IET Computer Vision

Lead the way for us

Journal: IET Computer Vision	Publication Date: Sep 3, 2018
Citations: 5

Similar Papers

The Network of Attention-Aware Multimodal fusion for RGB-D Indoor Semantic Segmentation Method
Qiankun Zhao ... Yingcai Wan
-
Qiankun Zhao, et. al.Qiankun Zhao ... Yingcai Wan
15 Aug 2022
15 Aug 2022

Real-Time High-Performance Semantic Image Segmentation of Urban Street Scenes
Genshun Dong ... Yan Yan
IEEE Transactions on Intelligent Transportation Systems | VOL. 22
Genshun Dong, et. al.Genshun Dong ... Yan Yan
01 Jun 2021
IEEE Transactions on Intelligent Transportation Systems | VOL. 22

Fusing geometrical and visual information via superpoints for the semantic segmentation of 3D road scenes
Liuyuan Deng ... Yuesheng He
Tsinghua Science and Technology | VOL. 25
Liuyuan Deng, et. al.Liuyuan Deng ... Yuesheng He
01 Aug 2020
Tsinghua Science and Technology | VOL. 25

3SP-Net: Semantic Segmentation Network with Stereo Image Pairs for Urban Scene Parsing
Lingli Zhou ... Haofeng Zhang
-
Lingli Zhou, et. al.Lingli Zhou ... Haofeng Zhang
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ResFusion: deeply fused scene parsing network for RGB‐D images

Abstract

Talk to us

Similar Papers

More From: IET Computer Vision