Abstract

In the field of remote sensing, the classification of land cover is a pivotal and challenging issue. Standard models fail to capture global and semantic information in remote sensing images despite the fact that a convolutional neural network provides robust support for semantic segmentation. In addition, owing to disparities in semantic levels and spatial resolution, the simple fusion of low-level and high-level features may diminish the efficiency. To address these deficiencies, an attention-guided multi-level feature fusion network (AMFFNet) is proposed in this study. The proposed AMFFNet approach is designed as an encoder–decoder network with the inclusion of a multi-level feature fusion module (MFF) and a dual attention map module (DAM). A DAM models the semantic association of features from a spatial and channel perspective, and an MFF bridges the semantic and resolution gaps between high-level and low-level features. Furthermore, we propose a residual-based boundary refinement upsample module to further optimize the object boundaries. The experimental results indicate that the proposed strategy can considerably enhance the accuracy of land cover classification, achieving a mean intersection over union of 90.39% on the LandCover.ai dataset and 63.14% on the Gaofen Image Dataset with 15 categories (GID-15).

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call