Rethinking Bi-Level Optimization in Neural Architecture Search: A Gibbs Sampling Perspective

Chao Xue,Yonggang Hu,Junchi Yan,Xiaoxing Wang,Xiaokang Yang,Kewei Sun

doi:10.1609/aaai.v35i12.17262

Abstract

One-Shot architecture search, which aims to explore all possible operations jointly based on a single model, has been an active direction of Neural Architecture Search (NAS). As a well-known one-shot solution, Differentiable Architecture Search (DARTS) performs continuous relaxation on the architecture's importance and results in a bi-level optimization problem. However, as many recent studies have shown, DARTS cannot always work robustly for new tasks, which is mainly due to the approximate solution of the bi-level optimization. In this paper, one-shot neural architecture search is addressed by adopting a directed probabilistic graphical model to represent the joint probability distribution over data and model. Then, neural architectures are searched for and optimized by Gibbs sampling. We rethink the bi-level optimization problem as the task of Gibbs sampling from the posterior distribution, which expresses the preferences for different models given the observed dataset. We evaluate our proposed NAS method -- GibbsNAS on the search space used in DARTS/ENAS and the search space of NAS-Bench-201. Experimental results on multiple search space show the efficacy and stability of our approach.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Rethinking Bi-Level Optimization in Neural Architecture Search: A Gibbs Sampling Perspective

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: May 18, 2021
Citations: 7

Similar Papers

Neural architecture search based on dual attention mechanism for image classification.
Cong Jin ... Jinjie Huang
Mathematical Biosciences and Engineering | VOL. 20
Cong Jin, et. al.Cong Jin ... Jinjie Huang
01 Jan 2021
Mathematical Biosciences and Engineering | VOL. 20

Evolutionary Search for Complete Neural Network Architectures With Partial Weight Sharing
Haoyu Zhang ... Kuangrong Hao
IEEE transactions on evolutionary computation : a publication of the IEEE Neural Networks Council | VOL. 26
Haoyu Zhang, et. al.Haoyu Zhang ... Kuangrong Hao
01 Oct 2022
IEEE transactions on evolutionary computation : a publication of the IEEE Neural Networks Council | VOL. 26

TA-DARTS: Temperature Annealing of Discrete Operator Distribution for Effective Differential Architecture Search
Jiyong Shin ... Dae-Ki Kang
Applied sciences | VOL. 13
Jiyong Shin, et. al.Jiyong Shin ... Dae-Ki Kang
08 Sep 2023
Applied sciences | VOL. 13

PWSNAS: Powering Weight Sharing NAS With General Search Space Shrinking Framework.
Yiming Hu ... Qingyi Gu
IEEE transactions on neural networks | VOL. 34
Yiming Hu, et. al.Yiming Hu ... Qingyi Gu
01 Nov 2023
IEEE transactions on neural networks | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Rethinking Bi-Level Optimization in Neural Architecture Search: A Gibbs Sampling Perspective

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence