Are LSTMs good few-shot learners?

Mike Huisman,Thomas M Moerland,Aske Plaat,Jan N Van Rijn

doi:10.1007/s10994-023-06394-x

Mike Huisman, Thomas M Moerland + Show 2 more

Open Access

https://doi.org/10.1007/s10994-023-06394-x

Copy DOI

Journal: Machine Learning	Publication Date: Sep 7, 2023
Citations: 6	License type: CC BY 4.0

Affiliation: Leiden University

Abstract

Deep learning requires large amounts of data to learn new tasks well, limiting its applicability to domains where such data is available. Meta-learning overcomes this limitation by learning how to learn. Hochreiter et al. (International conference on artificial neural networks, Springer, 2001) showed that an LSTM trained with backpropagation across different tasks is capable of meta-learning. Despite promising results of this approach on small problems, and more recently, also on reinforcement learning problems, the approach has received little attention in the supervised few-shot learning setting. We revisit this approach and test it on modern few-shot learning benchmarks. We find that LSTM, surprisingly, outperform the popular meta-learning technique MAML on a simple few-shot sine wave regression benchmark, but that LSTM, expectedly, fall short on more complex few-shot image classification benchmarks. We identify two potential causes and propose a new method called Outer Product LSTM (OP-LSTM) that resolves these issues and displays substantial performance gains over the plain LSTM. Compared to popular meta-learning baselines, OP-LSTM yields competitive performance on within-domain few-shot image classification, and performs better in cross-domain settings by 0.5–1.9% in accuracy score. While these results alone do not set a new state-of-the-art, the advances of OP-LSTM are orthogonal to other advances in the field of meta-learning, yield new insights in how LSTM work in image classification, allowing for a whole range of new research directions. For reproducibility purposes, we publish all our research code publicly.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Are LSTMs good few-shot learners?

Abstract

Talk to us

Similar Papers

More From: Machine Learning

Lead the way for us

Similar Papers

Image classification based on few-shot learning algorithms: a review
Qiao Qi ... Azlin Ahmad
Indonesian Journal of Electrical Engineering and Computer Science | VOL. 35
Qiao Qi, et. al.Qiao Qi ... Azlin Ahmad
01 Aug 2024
Indonesian Journal of Electrical Engineering and Computer Science | VOL. 35

Few-shot image classification with composite rotation based self-supervised auxiliary task
Pratik Mazumder ... Vinay P Namboodiri
Neurocomputing | VOL. 489
Pratik Mazumder, et. al.Pratik Mazumder ... Vinay P Namboodiri
24 Feb 2022
Neurocomputing | VOL. 489

Few-Shot Image Classification: Current Status and Research Trends
Ying Liu ... Qi Tian
Electronics | VOL. 11
Ying Liu, et. al.Ying Liu ... Qi Tian
31 May 2022
Electronics | VOL. 11

Deep metric learning for few-shot image classification: A Review of recent developments
Xiaoxu Li ... Jing-Hao Xue
Pattern Recognition | VOL. 138
Xiaoxu Li, et. al.Xiaoxu Li ... Jing-Hao Xue
02 Feb 2023
Pattern Recognition | VOL. 138

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Are LSTMs good few-shot learners?

Abstract

Talk to us

Similar Papers

More From: Machine Learning