Abstract

Scene classification is an important tool for remote sensing image interpretation, and it has fundamental applications in research and industry. However, given complex backgrounds and scale variations, remote sensing images have large intraclass diversity and interclass similarity, which bring challenges to accurate classification of remote sensing images. We proposed a scene classification method using joint learning and multiscale attention to alleviate the aforementioned problems. To fully utilize the multiscale information of the image and improve the adaptability of the proposed method to objects with various sizes, different from general methods that fuse different scales of features for classification, joint learning using multiscale features is developed to optimize the whole network. Specifically, we leverage a pretrained deep convolutional neural network as the feature extractor to extract low-level, medium-level, and high-level feature maps from the images. Then, due to the poor semantics of low-level and medium-level feature maps compared with the high-level feature maps, we design a multiscale attention module to enhance the semantic information and suppress the noise information. Finally, the global mean pooling is used to obtain the feature vectors and different classifiers are used for different feature vectors. And the decision-level fusion is adopted to obtain more reliable predictions. The experimental results on the AID and NWPU-RESISC45 datasets show that the proposed method makes a significant improvement in terms of overall accuracies compared with the baselines. And the overall accuracies of our method on the two datasets are 97.49% and 95.20%, respectively, which achieves state-of-the-art performance. The code will be public at a Github repository available at https://github.com/Cbanyungong/JLMSAF.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call