Erroneous pixel prediction for semantic image segmentation

Lixue Gong,Yiqun Zhang,Yunke Zhang,Weiwei Xu,Yin Yang

doi:10.1007/s41095-021-0235-7

Lixue Gong, Yiqun Zhang + Show 3 more

Open Access

https://doi.org/10.1007/s41095-021-0235-7

Copy DOI

Journal: Computational Visual Media	Publication Date: Oct 27, 2021
Citations: 9	License type: open-access

Affiliation: Zhejiang University, Clemson University

Abstract

We consider semantic image segmentation. Our method is inspired by Bayesian deep learning which improves image segmentation accuracy by modeling the uncertainty of the network output. In contrast to uncertainty, our method directly learns to predict the erroneous pixels of a segmentation network, which is modeled as a binary classification problem. It can speed up training comparing to the Monte Carlo integration often used in Bayesian deep learning. It also allows us to train a branch to correct the labels of erroneous pixels. Our method consists of three stages: (i) predict pixel-wise error probability of the initial result, (ii) redetermine new labels for pixels with high error probability, and (iii) fuse the initial result and the redetermined result with respect to the error probability. We formulate the error-pixel prediction problem as a classification task and employ an error-prediction branch in the network to predict pixel-wise error probabilities. We also introduce a detail branch to focus the training process on the erroneous pixels. We have experimentally validated our method on the Cityscapes and ADE20K datasets. Our model can be easily added to various advanced segmentation networks to improve their performance. Taking DeepLabv3+ as an example, our network can achieve 82.88% of mIoU on Cityscapes testing dataset and 45.73% on ADE20K validation dataset, improving corresponding DeepLabv3+ results by 0.74% and 0.13% respectively.

Highlights

The goal of semantic image segmentation is to obtain a high-level representation of an image by assigning each pixel a semantic class label
Deep convolutional neural networks (DCNN) trained on large scale image segmentation datasets such as PASCAL VOC 2012 [1], Cityscapes [2], and ADE20K [3] have significantly improved the accuracy of image segmentation
Our network trained on Cityscapes can achieve mean intersection over union (mIoU) at 82.88% on the testing dataset when using DeepLabv3+ as the semantic branch [12], which is 0.74% higher than the original network

Summary

Introduction

The goal of semantic image segmentation is to obtain a high-level representation of an image by assigning each pixel a semantic class label. Deep convolutional neural networks (DCNN) trained on large scale image segmentation datasets such as PASCAL VOC 2012 [1], Cityscapes [2], and ADE20K [3] have significantly improved the accuracy of image segmentation. While end-to-end training a DCNN can effectively learn multi-scale features for various vision tasks, the down-sampling operations in the encoder designed to enlarge the receptive field are likely to lose detailed information required for pixel-level image segmentation [4]. Even with state-of-the-art image segmentation algorithms, we can still see a large number of pixels with wrong labels in regions with indistinct RGB information, at object boundaries and in small-scale objects. The erroneous pixels whose largest label probabilities in one layer are greater than the threshold, which we refer to as hard erroneous pixels, are accepted as part of the result and overlooked in subsequent layers

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Erroneous pixel prediction for semantic image segmentation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computational Visual Media

Lead the way for us

Similar Papers

Bayesian Deep Learning with Monte Carlo Dropout for Qualification of Semantic Segmentation
Clement Dechesne ... Sebastien Lefevre
-
Clement Dechesne, et. al.Clement Dechesne ... Sebastien Lefevre
11 Jul 2021
11 Jul 2021

Exploring uncertainty measures in convolutional neural network for semantic segmentation of oral cancer images.
Bofan Song ...
Journal of biomedical optics | VOL. 27
Bofan Song, et. al.Bofan Song ...
03 Nov 2022
Journal of biomedical optics | VOL. 27

Bayesian Deep Learning for Segmentation for Autonomous Safe Planetary Landing
Kento Tomita ... Katherine A Skinner
Journal of Spacecraft and Rockets | VOL. 59
Kento Tomita, et. al.Kento Tomita ... Katherine A Skinner
20 Sep 2022
Journal of Spacecraft and Rockets | VOL. 59

Exploring Uncertainty Measures for Image-caption Embedding-and-retrieval Task
Kenta Hama ... Kuniaki Uehara
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 17
Kenta Hama, et. al.Kenta Hama ... Kuniaki Uehara
21 Apr 2021
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Erroneous pixel prediction for semantic image segmentation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computational Visual Media