A top-down manner-based DCNN architecture for semantic image segmentation.

Kai Qiao,Linyuan Wang,Lei Zeng,Jian Chen,Bin Yan

doi:10.1371/journal.pone.0174508

Abstract

Given their powerful feature representation for recognition, deep convolutional neural networks (DCNNs) have been driving rapid advances in high-level computer vision tasks. However, their performance in semantic image segmentation is still not satisfactory. Based on the analysis of visual mechanism, we conclude that DCNNs in a bottom-up manner are not enough, because semantic image segmentation task requires not only recognition but also visual attention capability. In the study, superpixels containing visual attention information are introduced in a top-down manner, and an extensible architecture is proposed to improve the segmentation results of current DCNN-based methods. We employ the current state-of-the-art fully convolutional network (FCN) and FCN with conditional random field (DeepLab-CRF) as baselines to validate our architecture. Experimental results of the PASCAL VOC segmentation task qualitatively show that coarse edges and error segmentation results are well improved. We also quantitatively obtain about 2%-3% intersection over union (IOU) accuracy improvement on the PASCAL VOC 2011 and 2012 test sets.

Highlights

Semantic image segmentation is one of the central and important computer vision tasks
The output of fully convolutional network (FCN)-8s illustrates an impressive performance on the PASCAL Visual Object Class (VOC) benchmark and achieve 20% relative improvement to 62.2% mean intersection over union (IOU) in 2012 test set
In order to compare mutual promotion of semantic labels and superpixels for segmentation results, we both test two architectures that one only uses superpixels to improve semantic labels, which is denoted by deep convolutional neural networks (DCNNs)-Sp, and the other one represents overall architecture that performs mutual promotion of semantic labels and superpixels, which is denoted by DCNN-Sp-v2

Summary

Introduction

Semantic image segmentation is one of the central and important computer vision tasks. Compared with image classification aiming at labeling at the image level, semantic image segmentation needs to assign a semantic label at each pixel. Classifying region proposals and refining labels to obtain final segmentation is a common technique. Carreira et al [1] used constrained parametric min-cuts [2] to generate 150 region proposals per image and predicted each region with the use of variants of scale-invariant feature transform and local binary pattern. Jimei et al [3] presented a scalable scene parsing algorithm based on image retrieval and superpixel matching, and obtained good performance. Tighe et al [4] combined region-level features with per-exemplar sliding window detectors for interpreting a scene. Despite being the focus of considerable attention, such a task remains challenging

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS ONE	Publication Date: Mar 24, 2017
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A top-down manner-based DCNN architecture for semantic image segmentation.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE

Lead the way for us

Similar Papers

A comparative study of pre-trained convolutional neural networks for semantic segmentation of breast tumors in ultrasound
Wilfrido Gómez-Flores ... Wagner Coelho De Albuquerque Pereira
Computers in Biology and Medicine | VOL. 126
Wilfrido Gómez-Flores, et. al.Wilfrido Gómez-Flores ... Wagner Coelho De Albuquerque Pereira
08 Oct 2020
Computers in Biology and Medicine | VOL. 126

Parallel Fully Convolutional Network for Semantic Segmentation
Jian Ji ... Xiaocong Lu
IEEE Access | VOL. 9
Jian Ji, et. al.Jian Ji ... Xiaocong Lu
30 Dec 2020
IEEE Access | VOL. 9

Can Ground Truth Label Propagation from Video Help Semantic Segmentation?
Siva Karthik Mustikovela ... Carsten Rother
-
Siva Karthik Mustikovela, et. al.Siva Karthik Mustikovela ... Carsten Rother
01 Jan 2015
01 Jan 2015

Semantic segmentation of remote sensing image based on deep fusion networks and conditional random field
Chunjiao Xiao ... Jun Chen
National Remote Sensing Bulletin | VOL. 24
Chunjiao Xiao, et. al.Chunjiao Xiao ... Jun Chen
01 Jan 2020
National Remote Sensing Bulletin | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A top-down manner-based DCNN architecture for semantic image segmentation.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE