Gradient Descent Effects on Differential Neural Architecture Search: A Survey

Santanu Santra,Jun-Wei Hsieh,Chi-Fang Lin

doi:10.1109/access.2021.3090918

Santanu Santra, Jun-Wei Hsieh + Show 1 more

Open Access

https://doi.org/10.1109/access.2021.3090918

Copy DOI

Abstract

Gradient Descent, an effective way to search for the local minimum of a function, can minimize training and validation loss of neural architectures and also be incited in an appropriate order to decrease the searching cost of neural architecture search. In recent trends, the neural architecture search (NAS) is enormously used to construct an automatic architecture for a specific task. Mostly well-performed neural architecture search methods have adopted reinforcement learning, evolutionary algorithms, or gradient descent algorithms to find the best-performing candidate architecture. Among these methods, gradient descent-based architecture search approaches outperform all other methods in terms of efficiency, simplicity, computational cost, and validation error. In view of this, an in-depth survey is necessary to cover the usefulness of gradient descent method and how this can benefit neural architecture search. We begin our survey with basic concepts of neural architecture search, gradient descent, and their unique properties. Our survey then delves into the impact of gradient descent method on NAS and explores the effect of gradient descent in the search process to generate the candidate architecture. At the same time, our survey reviews mostly used gradient-based search approaches in NAS. Finally, we provide the current research challenges and open problems in the NAS-based approaches, which need to be addressed in future research.

Highlights

A UTOMATIC machine learning (AutoML) has become a favorable solution for developing deep learning (DL) systems without any human efforts
LESSONS LEARNED This study has reviewed various Gradient descent (GD)-based neural architecture search (NAS) approaches from different directions, and here we summarize the lessons learned from this survey
Gradient descent is a better solution for architecture search in NAS approaches and ignoring it will increase architecture search cost in terms of GPU days

Summary

INTRODUCTION

A UTOMATIC machine learning (AutoML) has become a favorable solution for developing deep learning (DL) systems without any human efforts. The model generation stage is either created by machine learning experts or by an automatic design process. The search space construction stage explores a large set of possible network architectures that can match or outperform expert-designed architectures. As NAS has been recognized as the core technology of neural architecture designing in next-generation, researchers have focused on extending their knowledge to automatic architecture design processes Along this line, Elsken et al [9] presented a survey on NAS, where they have addressed the elementary ideas of NAS, different search approaches and performance estimation strategies, and future directions of NAS. We present an in-depth study that explores architecture optimization strategies to generate candidate architectures with good performance and helps readers obtain possible research ideas and further directions, which inspires us to write this survey article

CONTRIBUTIONS

BACKGROUND

PRELIMINARY OF GRADIENT DESCENT METHOD

GRADIENT DESCENT PROBLEMS

EXPLODING GRADIENT PROBLEM

LEARNING RATE

MOMENTUM

ADAGRAD METHOD

ADADELTA

ADAMAX

STABILITY AND CONVERGENCE ANALYSIS

VIII. PERFORMANCE EVALUATION The commonly used performance evaluation metrics are:

RESEARCH ISSUES AND CHALLENGES

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 23	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Gradient Descent Effects on Differential Neural Architecture Search: A Survey

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

EST-NAS: An evolutionary strategy with gradient descent for neural architecture search
Zicheng Cai ... Hai-Lin Liu
Applied Soft Computing | VOL. 146
Zicheng Cai, et. al.Zicheng Cai ... Hai-Lin Liu
13 Jul 2023
Applied Soft Computing | VOL. 146

ETNAS: An energy consumption task-driven neural architecture search
Dong Dong ... Jason Wang
Sustainable Computing: Informatics and Systems | VOL. 40
Dong Dong, et. al.Dong Dong ... Jason Wang
10 Nov 2023
Sustainable Computing: Informatics and Systems | VOL. 40

Gradient-Based Neural Architecture Search: A Comprehensive Evaluation
Sarwat Ali ... M. Arif Wani
Machine Learning and Knowledge Extraction | VOL. 5
Sarwat Ali, et. al.Sarwat Ali ... M. Arif Wani
14 Sep 2023
Machine Learning and Knowledge Extraction | VOL. 5

Efficient and lightweight convolutional neural network architecture search methods for object classification
Chuen-Horng Lin ... Yung-Kuan Chan
Pattern Recognition | VOL. 156
Chuen-Horng Lin, et. al.Chuen-Horng Lin ... Yung-Kuan Chan
06 Jul 2024
Pattern Recognition | VOL. 156

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Gradient Descent Effects on Differential Neural Architecture Search: A Survey

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access