MLCRNet: Multi-Level Context Refinement for Semantic Segmentation in Aerial Images

Zhifeng Huang,Qian Zhang,Guixu Zhang

doi:10.3390/rs14061498

Zhifeng Huang, Qian Zhang + Show 1 more

Open Access

https://doi.org/10.3390/rs14061498

Copy DOI

Journal: Remote Sensing	Publication Date: Mar 20, 2022
Citations: 9	License type: CC BY 4.0

Affiliation: East China Normal University

Abstract

In this paper, we focus on the problem of contextual aggregation in the semantic segmentation of aerial images. Current contextual aggregation methods only aggregate contextual information within specific regions to improve feature representation, which may yield poorly robust contextual information. To address this problem, we propose a novel multi-level context refinement network (MLCRNet) that aggregates three levels of contextual information effectively and efficiently in an adaptive manner. First, we designed a local-level context aggregation module to capture local information around each pixel. Second, we integrate multiple levels of context, namely, local-level, image-level, and semantic-level, to aggregate contextual information from a comprehensive perspective dynamically. Third, we propose an efficient multi-level context transform (EMCT) module to address feature redundancy and to improve the efficiency of our multi-level contexts. Finally, based on the EMCT module and feature pyramid network (FPN) framework, we propose a multi-level context feature refinement (MLCR) module to enhance feature representation by leveraging multi-level contextual information. Extensive empirical evidence demonstrates that our MLCRNet achieves state-of-the-art performance on the ISPRS Potsdam and Vaihingen datasets.

Full Text