Multi-scale Features Research Articles

As one of the major global food crops, the monitoring and management of the winter wheat planting area is of great significance for agricultural production and food security worldwide. Today, the development of high-resolution remote sensing imaging technology has provided rich sources of data for extracting the visual planting information of winter wheat. However, the existing research mostly focuses on extracting the planting plots that have a simple terrain structure. In the face of diverse terrain features combining mountainous areas, plains, and saline alkali land, as well as small-scale but complex planting structures, the extraction of planting plots through remote sensing imaging is subjected to great challenges in terms of recognition accuracy and model complexity. In this paper, we propose a modified Segformer model for extracting winter wheat planting plots with complex structures in rural areas based on the 0.8 m high-resolution multispectral data obtained from the Gaofen-2 satellite, which significantly improves the extraction accuracy and efficiency under complex conditions. In the encoder and decoder of this method, new modules were developed for the purpose of optimizing the feature extraction and fusion process. Specifically, the improvement measures of the proposed method include: (1) The MixFFN module in the original Segformer model is replaced with the Multi-Scale Feature Fusion Fully-connected Network (MSF-FFN) module, which enhances the model’s representation ability in handling complex terrain features through multi-scale feature extraction and position embedding convolution; furthermore, the DropPath mechanism is introduced to reduce the possibility of overfitting while improving the model’s generalization ability. (2) In the decoder part, after fusing features at four different scales, a CoordAttention module is added, which can precisely locate important regions with enhanced features in the images by utilizing the coordinate attention mechanism, therefore further improving the model’s extraction accuracy. (3) The model’s input data are strengthened by incorporating multispectral indices, which are also conducive to the improvement of the overall extraction accuracy. The experimental results show that the accuracy rate of the modified Segformer model in extracting winter wheat planting plots is significantly increased compared to traditional segmentation models, with the mean Intersection over Union (mIOU) and mean Pixel Accuracy (mPA) reaching 89.88% and 94.67%, respectively (an increase of 1.93 and 1.23 percentage points, respectively, compared to the baseline model). Meanwhile, the parameter count and computational complexity are significantly reduced compared to other similar models. Furthermore, when multispectral indices are input into the model, the mIOU and mPA reach 90.97% and 95.16%, respectively (an increase of 3.02 and 1.72 percentage points, respectively, compared to the baseline model).

Read full abstract

Substantial advancements have been achieved in hyperspectral image (HSI) classification through contemporary deep learning techniques. Nevertheless, the incorporation of an excessive number of irrelevant tokens in large-scale remote sensing data results in inefficient long-range modeling. To overcome this hurdle, this study introduces the Group-Sensitive Selective Perception Transformer (GSAT) framework, which builds upon the Vision Transformer (ViT) to enhance HSI classification outcomes. The innovation of the GSAT architecture is primarily evident in several key aspects. Firstly, the GSAT incorporates a Group-Sensitive Pixel Group Mapping (PGM) module, which organizes pixels into distinct groups. This allows the global self-attention mechanism to function within these groupings, effectively capturing local interdependencies within spectral channels. This grouping tactic not only boosts the model’s spatial awareness but also lessens computational complexity, enhancing overall efficiency. Secondly, the GSAT addresses the detrimental effects of superfluous tokens on model efficacy by introducing the Sensitivity Selection Framework (SSF) module. This module selectively identifies the most pertinent tokens for classification purposes, thereby minimizing distractions from extraneous information and bolstering the model’s representational strength. Furthermore, the SSF refines local representation through multi-scale feature selection, enabling the model to more effectively encapsulate feature data across various scales. Additionally, the GSAT architecture adeptly represents both global and local features of HSI data by merging global self-attention with local feature extraction. This integration strategy not only elevates classification precision but also enhances the model’s versatility in navigating complex scenes, particularly in urban mapping scenarios where it significantly outclasses previous deep learning methods. The advent of the GSAT architecture not only rectifies the inefficiencies of traditional deep learning approaches in processing extensive remote sensing imagery but also markededly enhances the performance of HSI classification tasks through the deployment of group-sensitive and selective perception mechanisms. It presents a novel viewpoint within the domain of hyperspectral image classification and is poised to propel further advancements in the field. Empirical testing on six standard HSI datasets confirms the superior performance of the proposed GSAT method in HSI classification, especially within urban mapping contexts, where it exceeds the capabilities of prior deep learning techniques. In essence, the GSAT architecture markedly refines HSI classification by pioneering group-sensitive pixel group mapping and selective perception mechanisms, heralding a significant breakthrough in hyperspectral image processing.

Read full abstract

Multi-scale Features Research Articles

Related Topics

Articles published on Multi-scale Features

Enhanced semantic-positional feature fusion network via diverse pre-trained encoders for remote sensing image water-body segmentation

An Improved Multi-Scale Feature Extraction Network for Rice Disease and Pest Recognition

HRTBDA: a network for post-disaster building damage assessment based on remote sensing images

MDAR: A Multiscale Features-Based Network for Remotely Measuring Human Heart Rate Utilizing Dual-Branch Architecture and Alternating Frame Shifts in Facial Videos.

IVA-former: invisible–visible query guided amodal mask measurement network for desktop object via hierarchical transformer

Electrical load forecasting based on the fusion of multi-scale features extracted by using neural ordinary differential equation

GCS-YOLOv8: A Lightweight Face Extractor to Assist Deepfake Detection.

Remote Sensing LiDAR and Hyperspectral Classification with Multi-Scale Graph Encoder–Decoder Network

A novel intelligent fault diagnosis method for gearbox based on multi-dimensional attention denoising convolution.

A multi-scale feature extraction and fusion-based model for retinal vessel segmentation in fundus images.

Cascade contour-enhanced panoptic segmentation for robotic vision perception.

Classification, Localization and Quantization of Eddy Current Detection Defects in CFRP Based on EDC-YOLO.

YOLO-ESL: An Enhanced Pedestrian Recognition Network Based on YOLO

A Student Facial Expression Recognition Model Based on Multi-Scale and Deep Fine-Grained Feature Attention Enhancement.

Extraction of Winter Wheat Planting Plots with Complex Structures from Multispectral Remote Sensing Images Based on the Modified Segformer Model

High-resolution population mapping based on SDGSAT-1 glimmer imagery and deep learning: a case study of the Guangdong-Hong Kong-Marco Greater Bay Area

Multi-scale Adaptive Feature Fusion Hashing for Image Retrieval

Hyperspectral Image Classification Algorithm for Forest Analysis Based on a Group-Sensitive Selective Perceptual Transformer

Aero-engine defect detection by integrating attention and multi-scale features

Multi-scale dual-channel feature embedding decoder for biomedical image segmentation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multi-scale Features Research Articles

Related Topics

Articles published on Multi-scale Features

Enhanced semantic-positional feature fusion network via diverse pre-trained encoders for remote sensing image water-body segmentation

An Improved Multi-Scale Feature Extraction Network for Rice Disease and Pest Recognition

HRTBDA: a network for post-disaster building damage assessment based on remote sensing images

MDAR: A Multiscale Features-Based Network for Remotely Measuring Human Heart Rate Utilizing Dual-Branch Architecture and Alternating Frame Shifts in Facial Videos.

IVA-former: invisible–visible query guided amodal mask measurement network for desktop object via hierarchical transformer

Electrical load forecasting based on the fusion of multi-scale features extracted by using neural ordinary differential equation

GCS-YOLOv8: A Lightweight Face Extractor to Assist Deepfake Detection.

Remote Sensing LiDAR and Hyperspectral Classification with Multi-Scale Graph Encoder–Decoder Network

A novel intelligent fault diagnosis method for gearbox based on multi-dimensional attention denoising convolution.

A multi-scale feature extraction and fusion-based model for retinal vessel segmentation in fundus images.

Cascade contour-enhanced panoptic segmentation for robotic vision perception.

Classification, Localization and Quantization of Eddy Current Detection Defects in CFRP Based on EDC-YOLO.

YOLO-ESL: An Enhanced Pedestrian Recognition Network Based on YOLO

A Student Facial Expression Recognition Model Based on Multi-Scale and Deep Fine-Grained Feature Attention Enhancement.

Extraction of Winter Wheat Planting Plots with Complex Structures from Multispectral Remote Sensing Images Based on the Modified Segformer Model

High-resolution population mapping based on SDGSAT-1 glimmer imagery and deep learning: a case study of the Guangdong-Hong Kong-Marco Greater Bay Area

Multi-scale Adaptive Feature Fusion Hashing for Image Retrieval

Hyperspectral Image Classification Algorithm for Forest Analysis Based on a Group-Sensitive Selective Perceptual Transformer

Aero-engine defect detection by integrating attention and multi-scale features

Multi-scale dual-channel feature embedding decoder for biomedical image segmentation