TST_MFL: Two-stage training based metric fusion learning for few-shot image classification

Zhe Sun,Wang Zheng,Pengfei Guo,Mingyang Wang

doi:10.1016/j.inffus.2024.102611

Abstract

Addressing the limitations of most few-shot learning (FSL) methods, particularly their insufficient single-feature discriminability and generalization during pre-training encoding, this paper introduces a novel approach: Two-Stage Training based Metric Fusion Learning (TST_MFL) for few-shot image classification. TST_MFL innovatively integrates a two-stage fusion metric learning strategy, enhancing both pre-training and meta-training phases to overcome the prevalent focus on single-level feature extraction in current few-shot image classification methods. In pre-training stage, we employ prediction distillation loss and local difference loss to facilitate mutual learning between global and local features, thus obtaining more discriminative features. Moreover, meta-training further enhances feature generalization and integrates global-local metrics to improve few-shot classification accuracy. Our approach demonstrates competitive performance across three fine-grained datasets and one coarse-grained dataset. Notably, experimental results in 1-shot and 5-shot settings of fine-grained datasets underscore the superiority of TST_MFL. The code for this work is available at: https://github.com/ZitZhengWang/TST_MFL.

Full Text