Amortized Bayesian Meta-Learning with Accelerated Gradient Descent Steps

Zhewei Zhang,Xuejing Li,Shengjin Wang

doi:10.3390/app13158653

Abstract

Recent meta-learning models often learn priors from observed tasks using a network optimized via stochastic gradient descent (SGD), which usually takes more training steps to convergence. In this paper, we propose an accelerated Bayesian meta-learning structure with a stochastic inference network (ABML-SIN). The proposed model aims to solve the training procedure of Bayesian meta-learning to improve the training speed and efficiency. Current approaches of meta-learning hardly converge within a few descent steps, owing to the small number of training samples. Therefore, we introduce an accelerated gradient descent learning network based on teacher–student architecture to learn the meta-latent variable θt for task t. With this amortized fast inference network, the meta-learner is able to learn the task-specific latent θt within a few training steps; thus, it improves the learning speed of the meta-learner. To refine the latent variables generated from the transductive amortization network of the meta-learner, SIN—followed by a conventional SGD-optimized network—is introduced as the student–teacher network to online-update the parameters. SIN extracts the local latent variables and accelerates the convergence of the meta-learning network. Our experiments on simulation data demonstrate that the proposed method provides generalization and scalability on unseen samples, and produces competitive/superior uncertainty estimations on few-shot learning tasks on two widely adopted 2D datasets with fewer training epochs compared to the state-of-the-art meta-learning approaches. Furthermore, the parameters generated by SIN act as perturbations on latent weights, enhancing the probability of accelerating the training efficiency of the meta-learner. Extensive qualitative experiments show that our method performs well across different meta-learning tasks in both simulated and real-world circumstances.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Amortized Bayesian Meta-Learning with Accelerated Gradient Descent Steps

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Journal: Applied Sciences	Publication Date: Jul 27, 2023
License type: CC BY 4.0

Similar Papers

The Implementation of Gradient Descent Based Methods Using Parallel Computing in R for Regression Tasks
Lala Septem Riza ... Muhammad Aziz Ashari
-
Lala Septem Riza, et. al.Lala Septem Riza ... Muhammad Aziz Ashari
01 Aug 2018
01 Aug 2018

Learning to Class-Adaptively Manipulate Embeddings for Few-Shot Learning
Fei Zhou ... Lei Zhang
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33
Fei Zhou, et. al.Fei Zhou ... Lei Zhang
01 Sep 2023
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33

Dataset artificial augmentation with a small number of training samples for reflectance estimation.
Jingjing Zhang ... Yuke He
Optics Express | VOL. 31
Jingjing Zhang, et. al.Jingjing Zhang ... Yuke He
17 Feb 2023
Optics Express | VOL. 31

Does Few-Shot Learning Suffer from Backdoor Attacks?
Xinwei Liu ... Xiaochun Cao
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38
Xinwei Liu, et. al.Xinwei Liu ... Xiaochun Cao
24 Mar 2024
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Amortized Bayesian Meta-Learning with Accelerated Gradient Descent Steps

Abstract

Talk to us

Similar Papers

More From: Applied Sciences