SiameseDenseU-Net-based Semantic Segmentation of Urban Remote Sensing Images

Rongsheng Dong,Fengying Li,Lulu Bai

doi:10.1155/2020/1515630

Rongsheng Dong, Fengying Li + Show 1 more

Open Access

https://doi.org/10.1155/2020/1515630

Copy DOI

Abstract

Boundary pixel blur and category imbalance are common problems that occur during semantic segmentation of urban remote sensing images. Inspired by DenseU-Net, this paper proposes a new end-to-end network—SiameseDenseU-Net. First, the network simultaneously uses both true orthophoto (TOP) images and their corresponding normalized digital surface model (nDSM) as the input of the network structure. The deep image features are extracted in parallel by downsampling blocks. Information such as shallow textures and high-level abstract semantic features are fused throughout the connected channels. The features extracted by the two parallel processing chains are then fused. Finally, a softmax layer is used to perform prediction to generate dense label maps. Experiments on the Vaihingen dataset show that SiameseDenseU-Net improves the F1-score by 8.2% and 7.63% compared with the Hourglass-ShapeNetwork (HSN) model and with the U-Net model. Regarding the boundary pixels, when using the same focus loss function based on median frequency balance weighting, compared with the original DenseU-Net, the small-target “car” category F1-score of SiameseDenseU-Net improved by 0.92%. The overall accuracy and the average F1-score also improved to varying degrees. The proposed SiameseDenseU-Net is better at identifying small-target categories and boundary pixels, and it is numerically and visually superior to the contrast model.

Highlights

In the computer vision field, semantic segmentation is an important issue
We further explore the potential of convolutional neural network (CNN) for end-to-end semantic segmentation of high-resolution remote sensing images
SiameseDenseU-Net uses two similar parallel DenseU-Nets, each of which is composed of an encoder and a decoder. e encoder consists of five consecutive sets of downsampled blocks that double the number of feature dimensions, while the decoder consists of five consecutive sets of upsampling blocks that halve the number of feature dimensions. e input feature extracts the context information through the downsampling block to obtain a hierarchical feature and recovers the resolution of the extracted features via the upsampling block, restoring the spatial position information lost by the encoder

Summary

Introduction

In the computer vision field, semantic segmentation is an important issue. Image semantic segmentation aims to determine the most proposed class label for each pixel in an image drawn from a set of predefined limited labels. In 2012, the AlexNet network, proposed by Krizhevsky et al [1], caused a new upsurge in imaging applications in the field of deep learning. Tsogkas and Kokkinos [2] combined a convolutional neural network (CNN) with a fully connected conditional random field (CRF) approach to learn the lost prior information. Long et al [5] proposed a fully convolutional network (FCN) to classify images at the pixel level. Unlike the classic CNN, FCN can accept an input image of any size and restore it to the same size as the input

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Mathematical Problems in Engineering	Publication Date: Mar 23, 2020
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

SiameseDenseU-Net-based Semantic Segmentation of Urban Remote Sensing Images

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematical Problems in Engineering

Lead the way for us

Similar Papers

Classification of precancerous lesions based on fusion of multiple hierarchical features
Huijun Zhou ... Zijian Zhang
Computer Methods and Programs in Biomedicine | VOL. 229
Huijun Zhou, et. al.Huijun Zhou ... Zijian Zhang
06 Dec 2022
Computer Methods and Programs in Biomedicine | VOL. 229

DenseU-Net-Based Semantic Segmentation of Small Objects in Urban Remote Sensing Images
Rongsheng Dong ... Fengying Li
IEEE Access | VOL. 7
Rongsheng Dong, et. al.Rongsheng Dong ... Fengying Li
01 Jan 2019
IEEE Access | VOL. 7

Pedestrian Attributes Recognition in Surveillance Scenarios Using Multi-Task Lightweight Convolutional Neural Network
Pu Yan ... Li Zhuo
Applied Sciences | VOL. 9
Pu Yan, et. al.Pu Yan ... Li Zhuo
07 Oct 2019
Applied Sciences | VOL. 9

Working Condition Recognition of a Mineral Flotation Process Using the DSFF-DenseNet-DT
Hongchang Liu ... Liujun Li
Applied Sciences | VOL. 12
Hongchang Liu, et. al.Hongchang Liu ... Liujun Li
29 Nov 2022
Applied Sciences | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SiameseDenseU-Net-based Semantic Segmentation of Urban Remote Sensing Images

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematical Problems in Engineering