INDOOR SEMANTIC SEGMENTATION FROM RGB-D IMAGES BY INTEGRATING FULLY CONVOLUTIONAL NETWORK WITH HIGHER-ORDER MARKOV RANDOM FIELD

J Yang,Z Kang

doi:10.5194/isprs-archives-xlii-4-717-2018

Abstract

Abstract. Indoor scenes have the characteristics of abundant semantic categories, illumination changes, occlusions and overlaps among objects, which poses great challenges for indoor semantic segmentation. Therefore, we in this paper develop a method based on higher-order Markov random field model for indoor semantic segmentation from RGB-D images. Instead of directly using RGB-D images, we first train and perform RefineNet model only using RGB information for generating the high-level semantic information. Then, the spatial location relationship from depth channel and the spectral information from color channels are integrated as a prior for a marker-controlled watershed algorithm to obtain the robust and accurate visual homogenous regions. Finally, higher-order Markov random field model encodes the short-range context among the adjacent pixels and the long-range context within each visual homogenous region for refining the semantic segmentations. To evaluate the effectiveness and robustness of the proposed method, experiments were conducted on the public SUN RGB-D dataset. Experimental results indicate that compared with using RGB information alone, the proposed method remarkably improves the semantic segmentation results, especially at object boundaries.

Highlights

Semantic segmentation is a fundamental problem in computer vision, which decomposes a scene into meaningful parts and assigns semantic labels to them (Wolf et al, 2015)
To address the issues raised from the state-of-the-art of the semantic segmentation for indoor scenes, we develop a method based on higher-order Markov random field model for indoor semantic segmentation from RGB-D images
We develop a method based on higher-order Markov random field (MRF) model, which combines the highlevel semantic information derived from RefineNet and the lowlevel visual information captured from a marker-controlled watershed algorithm, for indoor semantic segmentation from RGB-D images

Summary

INTRODUCTION

Semantic segmentation is a fundamental problem in computer vision, which decomposes a scene into meaningful parts and assigns semantic labels to them (Wolf et al, 2015). Müller and Behnke (2014) conducted conditional random filed, into which color, depth and 3D scene features were incorporated, for semantic annotation of RGB-D images. These conventional methods usually consist of segmentation, feature extraction and classification and their final results depend on the results of each stage (Husain et al, 2016). Occlusions and overlaps among objects in indoor scenes, the spatial location relationship from depth channel and the spectral information from color channels are integrated as prior information for a marker-controlled watershed algorithm to derive the robust and accurate visual homogenous regions, which will encode the low-level visual features for complementarily reconstructing the detailed boundaries.

METHODOLOGY

Initial semantic segmentation using RefineNet

Region-level label consistency based on higher-order MRF model

EXPERIMENTATION AND ANALYSIS

Experimental data and evaluation criteria

Experimental analysis

CONCLUSION

Findings

Background

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

INDOOR SEMANTIC SEGMENTATION FROM RGB-D IMAGES BY INTEGRATING FULLY CONVOLUTIONAL NETWORK WITH HIGHER-ORDER MARKOV RANDOM FIELD

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences

Lead the way for us

Journal: ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences	Publication Date: Sep 19, 2018
License type: CC BY 4.0

Similar Papers

Indoor Scene Semantic Segmentation Based on RGB-D Image and Convolution Neural Network
Guitang Wang ... Yongbin Chen
Journal of Physics: Conference Series | VOL. 1637
Guitang Wang, et. al.Guitang Wang ... Yongbin Chen
01 Sep 2020
Journal of Physics: Conference Series | VOL. 1637

SCENE SEMANTIC SEGMENTATION FROM INDOOR RGB-D IMAGES USING ENCODE-DECODER FULLY CONVOLUTIONAL NETWORKS
Z Wang ... Z Kang
ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences | VOL. XLII-2/W7
Z Wang, et. al.Z Wang ... Z Kang
12 Sep 2017
ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences | VOL. XLII-2/W7

Indoor Scene Understanding with RGB-D Images: Bottom-up Segmentation, Object Detection and Semantic Segmentation
Saurabh Gupta ... Jitendra Malik
International Journal of Computer Vision | VOL. 112
Saurabh Gupta, et. al.Saurabh Gupta ... Jitendra Malik
21 Nov 2014
International Journal of Computer Vision | VOL. 112

A survey of RGB-D image semantic segmentation by deep learning
Amani Y Noori
-
Amani Y NooriAmani Y Noori
19 Mar 2021
19 Mar 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

INDOOR SEMANTIC SEGMENTATION FROM RGB-D IMAGES BY INTEGRATING FULLY CONVOLUTIONAL NETWORK WITH HIGHER-ORDER MARKOV RANDOM FIELD

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences