Neural Architecture Search by Estimation of Network Structure Distributions

Anton Muravev,Jenni Raitoharju,Moncef Gabbouj

doi:10.1109/access.2021.3052996

Anton Muravev, Jenni Raitoharju + Show 1 more

Open Access

https://doi.org/10.1109/access.2021.3052996

Copy DOI

Journal: IEEE Access	Publication Date: Aug 19, 2019
Citations: 76	License type: CC BY 4.0

Affiliation: Tampere University

Abstract

The influence of deep learning is continuously expanding across different domains, and its new applications are ubiquitous. The question of neural network design thus increases in importance, as traditional empirical approaches are reaching their limits. Manual design of network architectures from scratch relies heavily on trial and error, while using existing pretrained models can introduce redundancies or vulnerabilities. Automated neural architecture design is able to overcome these problems, but the most successful algorithms operate on significantly constrained design spaces, assuming the target network to consist of identical repeating blocks. While such approach allows for faster search, it does so at the cost of expressivity. We instead propose an alternative probabilistic representation of a whole neural network structure under the assumption of independence between layer types. Our matrix of probabilities is equivalent to the population of models, but allows for discovery of structural irregularities, while being simple to interpret and analyze. We construct an architecture search algorithm, inspired by the estimation of distribution algorithms, to take advantage of this representation. The probability matrix is tuned towards generating high-performance models by repeatedly sampling the architectures and evaluating the corresponding networks, while gradually increasing the model depth. Our algorithm is shown to discover non-regular models which cannot be expressed via blocks, but are competitive both in accuracy and computational cost, while not utilizing complex dataflows or advanced training techniques, as well as remaining conceptually simple and highly extensible.

Highlights

The recent successes of deep learning have attracted significant interest in numerous fields of knowledge [1]
Computer vision in particular has witnessed the development of multiple successful models, based on convolutional neural networks (CNNs), for tasks such as classification [2], [3], semantic segmentation [4], and detection [5]
We propose a CNN architecture search method based on the optimization of the above prototype, denoted Architecture Search by Estimation of network structure Distributions (ASED)

Summary

Introduction

The recent successes of deep learning have attracted significant interest in numerous fields of knowledge [1]. While the growth of deep learning solutions over the years is impressive, their adoption brings many significant challenges of both theoretical and practical nature. In addition to well-known problems such as overfitting or vanishing gradients, which have been subjects of extensive research over the years, new issues are being discovered, which are not yet fully understood. The lack of interpretability of decisions made by deep models [6], [7] is a difficult problem to tackle, but has attracted increasing research attention recently [8]. Further concerns have been raised regarding secure practical use of common deep models, as they were shown to be vulnerable to attacks utilizing malicious data [9]

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Neural Architecture Search by Estimation of Network Structure Distributions

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Scaling Up Estimation of Distribution Algorithms for Continuous Optimization
Weishan Dong ... Tianshi Chen
IEEE Transactions on Evolutionary Computation | VOL. 17
Weishan Dong, et. al.Weishan Dong ... Tianshi Chen
01 Dec 2013
IEEE Transactions on Evolutionary Computation | VOL. 17

Repeated Bouts of Advanced Strength Training Techniques: Effects on Volume Load, Metabolic Responses, and Muscle Activation in Trained Individuals.
William Wallace ... Jacob Rauch
Sports | VOL. 7
William Wallace, et. al.William Wallace ... Jacob Rauch
06 Jan 2019
Sports | VOL. 7

Biological Neural Network Structure and Spike Activity Prediction Based on Multi-Neuron Spike Train Data
Tielin Zhang ... Yi Zeng
International Journal of Intelligence Science | VOL. 05
Tielin Zhang, et. al.Tielin Zhang ... Yi Zeng
01 Jan 2015
International Journal of Intelligence Science | VOL. 05

A linear model based on Kalman filter for improving neural network classification performance
Joko Siswantoro ... Bahari Idrus
Expert Systems with Applications | VOL. 49
Joko Siswantoro, et. al.Joko Siswantoro ... Bahari Idrus
21 Dec 2015
Expert Systems with Applications | VOL. 49

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Neural Architecture Search by Estimation of Network Structure Distributions

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access