Encoder–decoder with double spatial pyramid for semantic segmentation

Huifang Kong,Lei Fan,Jie Hu,Yao Fang,Xiaoxue Zhang

doi:10.1117/1.jei.28.6.063007

Abstract

Semantic segmentation, as a dense pixelwise classification task, is of great significance to scene understanding. Many approaches based on convolutional neural network still suffer from two kinds of challenges: (1) insufficient semantic information results in semantic obfuscation between similar categories, (2) loss of spatial information leads to inaccurate location of inconspicuous objects. To tackle these challenges, we design a network with an encoder–decoder architecture based on two proposed modules: global pyramid attention module (GPAM) and pyramid decoder module (PDM). Specifically, GPAM exploits an attention mechanism as global prior knowledge to adaptively capture discriminative features for enhancing semantic representation, and PDM employs small convolutions connected in parallel to predict adjacent position relationships for refining spatial information. A series of ablation experiments are conducted to demonstrate the effectiveness of our designs, and our network achieves a mean intersection over union score of 83.4% on PASCAL VOC 2012 dataset and 78.5% on Cityscapes dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Encoder–decoder with double spatial pyramid for semantic segmentation

Abstract

Talk to us

Similar Papers

More From: Journal of Electronic Imaging

Lead the way for us

Journal: Journal of Electronic Imaging	Publication Date: Dec 3, 2019
Citations: 2

Similar Papers

Image Semantic Space Segmentation Based on Cascaded Feature Fusion and Asymmetric Convolution Module
Xiaojuan Li ... Xingmin Ma
Wireless Communications and Mobile Computing | VOL. 2022
Xiaojuan Li, et. al.Xiaojuan Li ... Xingmin Ma
20 Apr 2022
Wireless Communications and Mobile Computing | VOL. 2022

Real-Time Semantic Understanding and Segmentation of Urban Scenes for Vehicle Visual Sensors by Optimized DCNN Algorithm
Yanyi Li ... Jian Shi
Applied Sciences | VOL. 12
Yanyi Li, et. al.Yanyi Li ... Jian Shi
03 Aug 2022
Applied Sciences | VOL. 12

E-HRNet: Enhanced Semantic Segmentation Using Squeeze and Excitation
Jin-Seong Kim ... Chun-Bo Sim
Electronics | VOL. 12
Jin-Seong Kim, et. al.Jin-Seong Kim ... Chun-Bo Sim
27 Aug 2023
Electronics | VOL. 12

Implementation of a Lightweight Semantic Segmentation Algorithm in Road Obstacle Detection.
Bushi Liu ... Yang Gu
Sensors (Basel, Switzerland) | VOL. 20
Bushi Liu, et. al.Bushi Liu ... Yang Gu
10 Dec 2020
Sensors (Basel, Switzerland) | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Encoder–decoder with double spatial pyramid for semantic segmentation

Abstract

Talk to us

Similar Papers

More From: Journal of Electronic Imaging