Prior-knowledge and attention based meta-learning for few-shot learning

Yunxiao Qin,Weiguo Zhang,Chenxu Zhao,Zezheng Wang,Xiangyu Zhu,Jingping Shi,Guojun Qi,Zhen Lei

doi:10.1016/j.knosys.2020.106609

Abstract

Recently, meta-learning has been shown to be a promising way to solve few-shot learning. In this paper, inspired by the human cognition process, which utilizes both prior-knowledge and visual attention when learning new knowledge, we present a novel paradigm of meta-learning approach that capitalizes on three developments to introduce attention mechanism and prior-knowledge to meta-learning. In our approach, prior-knowledge is responsible for helping the meta-learner express the input data in a high-level representation space, and the attention mechanism enables the meta-learner to focus on key data features in the representation space. Compared with the existing meta-learning approaches that pay little attention to prior-knowledge and visual attention, our approach alleviates the meta-learner’s few-shot cognition burden. Furthermore, we discover a Task-Over-Fitting (TOF) problem,11When tested on J-shot classification tasks, the meta-learner trained on K-shot tasks does not perform as well as the one trained on J-shot tasks, where K and J are different unsigned integers denoting different numbers of shots for the meta-learner. which indicates that the meta-learner has poor generalization across different K-shot learning tasks. To model the TOF problem, we propose a novel Cross-Entropy across Tasks (CET) metric.22A metric for quantizing the extent to which a meta-learning method suffers from the TOF problem. Extensive experiments demonstrate that our techniques improve the meta-learner to state-of-the-art performance on several few-shot learning benchmarks while also substantially alleviating the TOF problem.

Full Text