FENAS: Flexible and Expressive Neural Architecture Search

Ramakanth Pasunuru,Mohit Bansal

doi:10.18653/v1/2020.findings-emnlp.258

Abstract

Architecture search is the automatic process of designing the model or cell structure that is optimal for the given dataset or task. Recently, this approach has shown good improvements in terms of performance (tested on language modeling and image classification) with reasonable training speed using a weight sharing-based approach called Efficient Neural Architecture Search (ENAS). In this work, we propose a novel architecture search algorithm called Flexible and Expressible Neural Architecture Search (FENAS), with more flexible and expressible search space than ENAS, in terms of more activation functions, input edges, and atomic operations. Also, our FENAS approach is able to reproduce the well-known LSTM and GRU architectures (unlike ENAS), and is also able to initialize with them for finding architectures more efficiently. We explore this extended search space via evolutionary search and show that FENAS performs significantly better on several popular text classification tasks and performs similar to ENAS on standard language model benchmark. Further, we present ablations and analyses on our FENAS approach.

Highlights

Architecture search enables automatic ways of finding the best model architecture and cell structures for the given task or dataset, as opposed to the traditional approach of manually tuning among different architecture choices
Comparing our Flexible and Expressible Neural Architecture Search (FENAS) approach with the previous Neural architecture search (NAS) approaches, FENAS performs on Penn Treebank (PTB) and significantly better on several downstream GLUE tasks
FENAS search space is larger than Efficient Neural Architecture Search (ENAS) because of more activation functions and more inputs to the computational nodes

Summary

Introduction

Architecture search enables automatic ways of finding the best model architecture and cell structures for the given task or dataset, as opposed to the traditional approach of manually tuning among different architecture choices. This idea has been successfully applied to the tasks of language modeling and image classification (Zoph and Le, 2017; Zoph et al, 2018; Cai et al, 2018; Liu et al, 2018a,b). The first approach of architecture search involved an RNN controller which samples a model architecture and uses the validation performance of this architecture trained on the given dataset as feedback (or reward) to sample the architecx[t] hh[[t-t1]] h[t] x[t] h[t-1] tanh ReLU (1) (2) add h[t] (3) Node 1 x[t] h[t-1] tanh x[t]

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

FENAS: Flexible and Expressive Neural Architecture Search

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2020
Citations: 40	License type: cc-by

Similar Papers

Evaluating the Effectiveness of Efficient Neural Architecture Search for Sentence-Pair Tasks
Ansel Maclaughlin ... Sriram Venkatapathy
-
Ansel Maclaughlin, et. al.Ansel Maclaughlin ... Sriram Venkatapathy
01 Jan 2020
01 Jan 2020

BNAS: Efficient Neural Architecture Search Using Broad Scalable Architecture.
Zixiang Ding ... Yaran Chen
IEEE transactions on neural networks and learning systems | VOL. 33
Zixiang Ding, et. al.Zixiang Ding ... Yaran Chen
01 Sep 2022
IEEE transactions on neural networks and learning systems | VOL. 33

Continual and Multi-Task Architecture Search
Ramakanth Pasunuru ... Mohit Bansal
-
Ramakanth Pasunuru, et. al.Ramakanth Pasunuru ... Mohit Bansal
01 Jan 2019
01 Jan 2019

Comparative Analysis of Neural Architecture Search Methods for Classification of Cultural Heritage Sites
Sunil V Gurlahosur ... Uma Mudenagudi
-
Sunil V Gurlahosur, et. al.Sunil V Gurlahosur ... Uma Mudenagudi
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FENAS: Flexible and Expressive Neural Architecture Search

Abstract

Highlights

Summary

Talk to us

Similar Papers