In power grid surveying, it is often necessary to fuse panchromatic and multispectral imagery for the design of power lines. Despite the abundance of deep learning networks for fusing these images, the results often suffer from spectral information loss or structural blurring. This study introduces a fusion model specifically tailored for power grid surveying that significantly enhances the representation of spatial-spectral features in remote sensing images. The model comprises three main modules: a TransforRS-Mamba module that integrates the sequence processing capabilities of the Mamba model with the attention mechanism of the Transformer to effectively merge spatial and spectral features; an improved spatial proximity-aware attention mechanism (SPPAM) that utilizes a spatial constraint matrix to greatly enhance the recognition of complex object relationships; and an optimized spatial proximity-constrained gated fusion module (SPCGF) that integrates spatial proximity constraints with residual connections to boost the recognition accuracy of key object features. To validate the effectiveness of the proposed method, extensive comparative and ablation experiments were conducted on GF-2 satellite images and the QuickBird (QB) dataset. Both qualitative and quantitative analyses indicate that our method outperforms 11 existing methods in terms of fusion effectiveness, particularly in reducing spectral distortion and spatial detail loss. However, the model's generalization performance across different data sources and environmental conditions has yet to be evaluated. Future research will explore the integration of various satellite datasets and assess the model's performance in diverse environmental contexts.
Read full abstract