Global Semantics Research Articles

The encoder-decoder structure is the basic structure of most semantic segmentation models and is adopted by a large number of segmentation models. How to effectively extract image features and achieve high-precision mapping through the optimal design of encoder and decoder is the key issue of current research. SegFormer designs an encoder with excellent performance, which fully extracts the feature information of different semantic granularity in the image with a large receptive field. Even if a simple fully connected layer decoder is used, excellent segmentation results can also be achieved. However, this simplified decoder does not make full use of the advantages of the SegFormer encoder. Therefore, a decoder structure with dual-path multi-scale feature fusion is designed in this paper, and the decoder is redesigned according to the characteristics of the SegFormer encoder. The decoder adopts a dual-path structure, one path passes the abstract global information layer by layer to the local detail information through the layer-by-layer upsampling fusion module (LFM), and gradually upsamples the feature maps obtained from the encoder, and then use the channel fusion module to learn the importance of different channels in the deep abstract semantic feature map and the shallow local detail feature map, and perform dynamic fusion to obtain a feature map containing both abstract semantic information and local details. The other path takes advantage of the large receptive field of the feature map output by the SegFormer encoder, and uses the weighted hybrid multi-scale feature extraction module (WMF) to extract multi-scale features containing global semantics from the deep semantic feature map finally output by the encoder. Finally, the Deep Feature Fusion Module (DFM) is used to fuse the outputs of the first two modules, fully mining the multi-scale global information in the encoder, and obtaine the feature maps with rich semantic information, which effectively improves the algorithm model performance.

• Multi-resolution feature extractor for strong-semantic feature encoding. • Dual-attention module for semantic-relevant feature promotion. • Semantic enhancement principle for target-oriented feature representation. • Hierarchical attentive network for high-quality water body extraction. Water is a kind of vital natural resource, which acts as the lifeblood of the ecosystem and the energy source for the living and production activities of humans. Regularly mapping the conditions of water resources and taking effective measures to prevent them from pollutions and shortages are very important and necessary to maintain the sustainability of the ecosystem. As a preliminary step for image-based water resource analysis, the complete recognition and accurate extraction of water bodies are important prerequisites in many applications. Nevertheless, due to the issues of topology diversities, appearance variabilities, and land cover interferences, there is still a large gap to achieve the human-level water bodies interpretation quality. This paper presents a hierarchical attentive high-resolution network, abbreviated as WaterHRNet, for extracting water bodies from remote sensing imagery. First, by building a multibranch high-resolution feature extractor integrated with global feature semantics aggregation, the WaterHRNet behaves laudably to supply high-quality, strong-semantic feature representations. Furthermore, by inlaying an effective feature attention scheme with the comprehensive exploitation of both the spatial and channel feature significances, the WaterHRNet is forced to strengthen the semantic-determinate, task-aware feature encodings. In addition, by designing a hierarchical processing principle with the progressive enhancement of category-attentive feature semantics, the WaterHRNet performs effectively to export semantic-discriminative, target-oriented feature representations for precise water body segmentation. The WaterHRNet is elaborately verified both quantitatively and qualitatively on three remote sensing datasets. Evaluation results show that the WaterHRNet achieves an average precision of 98.44%, average recall of 97.84%, average IoU of 96.35%, and average F 1 -score of 98.14%. Comparative analyses also demonstrate the superior performance and excellent feasibility of the WaterHRNet in segmenting water bodies.

Global Semantics Research Articles

Related Topics

Articles published on Global Semantics

Continuous frame motion sensitive self-supervised collaborative network for video representation learning

Global-and-Local Collaborative Learning for Co-Salient Object Detection.

An Online Hashing Algorithm for Image Retrieval Based on Optical-Sensor Network.

Bishift Networks for Thick Cloud Removal with Multitemporal Remote Sensing Images

Adaptive Text Denoising Network for Image Caption Editing

Human Co-Parsing Guided Alignment for Occluded Person Re-identification.

A Dual-Path Multi-Scale Feature Fusion Decoder for SegFormer

An Unsupervised Domain Adaptation Model Based on Multi-Level Joint Alignment for Multi-Modal Cardiac Image Segmentation

VCGAN: Video Colorization With Hybrid Generative Adversarial Network

Основные языковые источники современной терминологии

WaterHRNet: A multibranch hierarchical attentive network for water body extraction with remote sensing images

Components.js: Semantic dependency injection

Transformer-Based Model with Dynamic Attention Pyramid Head for Semantic Segmentation of VHR Remote Sensing Imagery.

Triplet Contrastive Learning for Aspect Level Sentiment Classification

IAF-LG: An Interactive Attention Fusion Network With Local and Global Perspective for Aspect-Based Sentiment Analysis

Further Exploration of Deep Aggregation for Shadow Detection

Subgraph-based feature fusion models for semantic similarity computation in heterogeneous knowledge graphs

Self-supervised graph representation learning via positive mining

Semantic-aware network embedding via optimized random walk and paragaraph2vec

Global Types and Event Structure Semantics for Asynchronous Multiparty Sessions

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Global Semantics Research Articles

Related Topics

Articles published on Global Semantics

Continuous frame motion sensitive self-supervised collaborative network for video representation learning

Global-and-Local Collaborative Learning for Co-Salient Object Detection.

An Online Hashing Algorithm for Image Retrieval Based on Optical-Sensor Network.

Bishift Networks for Thick Cloud Removal with Multitemporal Remote Sensing Images

Adaptive Text Denoising Network for Image Caption Editing

Human Co-Parsing Guided Alignment for Occluded Person Re-identification.

A Dual-Path Multi-Scale Feature Fusion Decoder for SegFormer

An Unsupervised Domain Adaptation Model Based on Multi-Level Joint Alignment for Multi-Modal Cardiac Image Segmentation

VCGAN: Video Colorization With Hybrid Generative Adversarial Network

Основные языковые источники современной терминологии

WaterHRNet: A multibranch hierarchical attentive network for water body extraction with remote sensing images

Components.js: Semantic dependency injection

Transformer-Based Model with Dynamic Attention Pyramid Head for Semantic Segmentation of VHR Remote Sensing Imagery.

Triplet Contrastive Learning for Aspect Level Sentiment Classification

IAF-LG: An Interactive Attention Fusion Network With Local and Global Perspective for Aspect-Based Sentiment Analysis

Further Exploration of Deep Aggregation for Shadow Detection

Subgraph-based feature fusion models for semantic similarity computation in heterogeneous knowledge graphs

Self-supervised graph representation learning via positive mining

Semantic-aware network embedding via optimized random walk and paragaraph2vec

Global Types and Event Structure Semantics for Asynchronous Multiparty Sessions