Abstract

Addressing the limitations of most few-shot learning (FSL) methods, particularly their insufficient single-feature discriminability and generalization during pre-training encoding, this paper introduces a novel approach: Two-Stage Training based Metric Fusion Learning (TST_MFL) for few-shot image classification. TST_MFL innovatively integrates a two-stage fusion metric learning strategy, enhancing both pre-training and meta-training phases to overcome the prevalent focus on single-level feature extraction in current few-shot image classification methods. In pre-training stage, we employ prediction distillation loss and local difference loss to facilitate mutual learning between global and local features, thus obtaining more discriminative features. Moreover, meta-training further enhances feature generalization and integrates global-local metrics to improve few-shot classification accuracy. Our approach demonstrates competitive performance across three fine-grained datasets and one coarse-grained dataset. Notably, experimental results in 1-shot and 5-shot settings of fine-grained datasets underscore the superiority of TST_MFL. The code for this work is available at: https://github.com/ZitZhengWang/TST_MFL.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.