Bayesian neural architecture search using a training-free performance metric

Andrés Camero,Hao Wang,Enrique Alba,Thomas Bäck

doi:10.1016/j.asoc.2021.107356

Abstract

Recurrent neural networks (RNNs) are a powerful approach for time series prediction. However, their performance is strongly affected by their architecture and hyperparameter settings. The architecture optimization of RNNs is a time-consuming task, where the search space is typically a mixture of real, integer and categorical values. To allow for shrinking and expanding the size of the network, the representation of architectures often has a variable length. In this paper, we propose to tackle the architecture optimization problem with a variant of the Bayesian Optimization (BO) algorithm. To reduce the evaluation time of candidate architectures the Mean Absolute Error Random Sampling (MRS), a training-free method to estimate the network performance, is adopted as the objective function for BO. Also, we propose three fixed-length encoding schemes to cope with the variable-length architecture representation. The result is a new perspective on accurate and efficient design of RNNs, that we validate on three problems. Our findings show that (1) the BO algorithm can explore different network architectures using the proposed encoding schemes and successfully designs well-performing architectures, and (2) the optimization time is significantly reduced by using MRS, without compromising the performance as compared to the architectures obtained from the actual training procedure.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Soft Computing	Publication Date: Mar 29, 2021
Citations: 15	License type: other-oa

R Discovery Prime

R Discovery Prime

Bayesian neural architecture search using a training-free performance metric

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing

Lead the way for us

Similar Papers

LSBO-NAS: Latent Space Bayesian Optimization for Neural Architecture Search
Xuan Rao ... Qiuye Wu
-
Xuan Rao, et. al.Xuan Rao ... Qiuye Wu
02 Dec 2022
02 Dec 2022

Prediction and analysis of train arrival delay based on XGBoost and Bayesian optimization
Rui Shi ... Yanqiu Li
Applied Soft Computing | VOL. 109
Rui Shi, et. al.Rui Shi ... Yanqiu Li
24 May 2021
Applied Soft Computing | VOL. 109

Bayesian Optimized Echo State Network Applied to Short-Term Load Forecasting
Gabriel Trierweiler Ribeiro ... Naylene Fraccanabbia
Energies | VOL. 13
Gabriel Trierweiler Ribeiro, et. al.Gabriel Trierweiler Ribeiro ... Naylene Fraccanabbia
11 May 2020
Energies | VOL. 13

Targeting Sustainable Transportation Development: The Support Vector Machine and the Bayesian Optimization Algorithm for Classifying Household Vehicle Ownership
Zhiqiang Xu ... Mahdi Aghaabbasi
Sustainability | VOL. 14
Zhiqiang Xu, et. al.Zhiqiang Xu ... Mahdi Aghaabbasi
05 Sep 2022
Sustainability | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bayesian neural architecture search using a training-free performance metric

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing