Abstract

In recent years, learning algorithms based on deep convolution frameworks have gradually become the research hotspots in hyperspectral image classification tasks. However, in the classification process, high-dimensionality problems with large amounts of data and feature redundancy with interspectral correlation of hyperspectral images have not been solved efficiently. Therefore, this paper investigates data dimensionality reduction and feature extraction and proposes a novel multiscale dense cross-attention mechanism algorithm with covariance pooling (MDCA-CP) for hyperspectral image scene classification. The multisize convolution module can detect subtle changes in the hyperspectral images’ spatial and spectral dimensions between the pixels in the local areas and are suitable for extracting hyperspectral data with complex and diverse types of structures. For traditional algorithms that assign attention weights in a one-way manner, thus leading to the loss of feature information, the dense cross-attention mechanism proposed in this study can jointly distribute the attention weights horizontally and vertically to efficiently capture the most representative features. In addition, this study also uses covariance pooling to further extract the features of hyperspectral images from the second order. Experiments have been conducted on three well-known hyperspectral datasets, and the results thus obtained show that the MDCA-CP algorithm is superior compared to the other well-known methods.

Highlights

  • Hyperspectral remote sensing technology can generate a set of images with rich information

  • Because different categories of the same feature target exhibit different optical behavior and shown obvious differences in the band space, the feature type’s pixel level can be used for identification and classification. is technology is nowadays being used in agriculture [1,2,3], environmental remote sensing [4,5,6], physics [7, 8], ocean [9,10,11], and other fields, among which the research on hyperspectral image (HSI) classification is extremely important. e internal information of an image contains various types of features

  • Multiscale convolution can detect the spatial dimensions of HSIs. e subtle changes between the pixels in the local areas in the spectral dimension can be applied to the feature extraction of the hyperspectral data of complex and diverse types and structures

Read more

Summary

Introduction

Hyperspectral remote sensing technology can generate a set of images with rich information. Mobile Information Systems researchers have chosen to add local spatial connections to improve the models [16, 17] and have achieved positive results to a certain extent These methods are primarily based on manual and shallow models, which rely heavily on expert knowledge and have poor generalization ability, making it difficult to extract the representative discriminating features. The SAE-LR deep learning method has shown great potential in HSI classification, the autoencoder model usually flattens the local image patches into vectors and feeds them into the model This method destroys the image’s two-dimensional structure, resulting in the loss of spatial information and the training time is insufficient. Multiscale convolution can detect the spatial dimensions of HSIs. e subtle changes between the pixels in the local areas in the spectral dimension can be applied to the feature extraction of the hyperspectral data of complex and diverse types and structures.

Related Work
Methodology
Cross-Attention Mechanism
Experiments
Methods
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call