Infrared imaging plays a vital role in critical surveillance, military reconnaissance, and industrial inspection applications due to its advantages such as strong concealment and the ability to operate around the clock. However, the combination of low infrared image resolution and complex background scenarios poses significant challenges for traditional deep learning models in accurately extracting the most discriminative features for classification. These models are often disrupted by irrelevant features, especially when data for new classes is scarce. Current few-shot learning approaches heavily rely on comparing image patches, but the scarcity of data can significantly degrade the performance of recognition algorithms. To address these challenges, we propose the Class Patch Similarity Weighted Embedding (CPSWE) framework for few-shot infrared target classification. The CPSWE framework employs a ViT architecture for feature extraction. By introducing class embeddings and calculating similarity-based weights for each patch, CPSWE reweights the patch features to enhance their relevance to the target class. This approach improves the discriminability of class-related features, leading to better generalization in few-shot settings. Furthermore, we introduce an infrared dataset specifically designed for few-shot learning, combining multiple open-source datasets to support research in this area. Extensive experiments on the few-shot learning benchmark dataset miniImageNet and the infrared dataset miniIRNet show that CPSWE outperforms existing few-shot learning methods, achieving significant improvements in classification accuracy on infrared image datasets with limited labeled samples.
Read full abstract