Learning to Generate Parameters of ConvNets for Unseen Image Data.

Shiye Wang,Kaituo Feng,Changsheng Li,Ye Yuan,Guoren Wang

doi:10.1109/tip.2024.3445731

Abstract

Typical Convolutional Neural Networks (ConvNets) depend heavily on large amounts of image data and resort to an iterative optimization algorithm (e.g., SGD or Adam) to learn network parameters, making training very time- and resource-intensive. In this paper, we propose a new training paradigm and formulate the parameter learning of ConvNets into a prediction task: considering that there exist correlations between image datasets and their corresponding optimal network parameters of a given ConvNet, we explore if we can learn a hyper-mapping between them to capture the relations, such that we can directly predict the parameters of the network for an image dataset never seen during the training phase. To do this, we put forward a new hypernetwork-based model, called PudNet, which intends to learn a mapping between datasets and their corresponding network parameters, then predicts parameters for unseen data with only a single forward propagation. Moreover, our model benefits from a series of adaptive hyper-recurrent units sharing weights to capture the dependencies of parameters among different network layers. Extensive experiments demonstrate that our proposed method achieves good efficacy for unseen image datasets in two kinds of settings: Intra-dataset prediction and Inter-dataset prediction. Our PudNet can also well scale up to large-scale datasets, e.g., ImageNet-1K. It takes 8,967 GPU seconds to train ResNet-18 on the ImageNet-1K using GC from scratch and obtain a top-5 accuracy of 44.65%. However, our PudNet costs only 3.89 GPU seconds to predict the network parameters of ResNet-18 achieving comparable performance (44.92%), more than 2,300 times faster than the traditional training paradigm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning to Generate Parameters of ConvNets for Unseen Image Data.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

Lead the way for us

Similar Papers

A multi-category scanning acoustic image dataset: design, collection, and evaluation
Yue Zhao ... Jun Luo
-
Yue Zhao, et. al.Yue Zhao ... Jun Luo
29 Dec 2022
29 Dec 2022

Novel energy model to analyze the effect of MAC and network parameters on asynchronous IEEE 802.15.4 multi-hop wireless networks lifetime
Y Raja Vara Prasad ... Rajalakshmi Pachamuthu
-
Y Raja Vara Prasad, et. al.Y Raja Vara Prasad ... Rajalakshmi Pachamuthu
01 Dec 2014
01 Dec 2014

Capsule-Net for Urdu Digits Recognition
Talha Iqbal ... Hazrat Ali
-
Talha Iqbal, et. al.Talha Iqbal ... Hazrat Ali
01 Sep 2019
01 Sep 2019

NEAP-F: Network Epoch Accuracy Prediction Framework (Student Abstract)
Arushi Chauhan ... Richa Singh
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Arushi Chauhan, et. al.Arushi Chauhan ... Richa Singh
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning to Generate Parameters of ConvNets for Unseen Image Data.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on image processing : a publication of the IEEE Signal Processing Society