KLSANet: Key local semantic alignment Network for few-shot image classification

Zhe Sun,Pengfei Guo,Wang Zheng

doi:10.1016/j.neunet.2024.106456

Abstract

Few-shot image classification involves recognizing new classes with a limited number of labeled samples. Current local descriptor-based methods, while leveraging consistent low-level features across visible and invisible classes, face challenges including redundant adjacent information, irrelevant partial representation, and limited interpretability. This paper proposes KLSANet, a few-shot image classification approach based on key local semantic alignment network, which aligns key local semantics for accurate classification. Furthermore, we introduce a key local screening module to mitigate the influence of semantically irrelevant image parts on classification. KLSANet demonstrates superior performance on three benchmark datasets (CUB, Stanford Dogs, Stanford Cars), outperforming state-of-the-art methods in 1-shot and 5-shot settings with average improvements of 3.95% and 2.56% respectively. Visualization experiments demonstrate the interpretability of KLSANet predictions. Code is available at: https://github.com/ZitZhengWang/KLSANet.

Full Text