EDD: Efficient Differentiable DNN Architecture and Implementation Co-search for Embedded AI Solutions

Yuhong Li,Deming Chen,Yao Chen,Xinheng Liu,Jinjun Xiong,Cong Hao,Xiaofan Zhang,Wen-Mei Hwu

doi:10.1109/dac18072.2020.9218749

Abstract

High quality AI solutions require joint optimization of AI algorithms and their hardware implementations. In this work, we are the first to propose a fully simultaneous, Efficient Differentiable DNN (deep neural network) architecture and implementation co-search (EDD) methodology. We formulate the co-search problem by fusing DNN search variables and hardware implementation variables into one solution space, and maximize both algorithm accuracy and hardware implementation quality. The formulation is differentiable with respect to the fused variables, so that gradient descent algorithm can be applied to greatly reduce the search time. The formulation is also applicable for various devices with different objectives. In the experiments, we demonstrate the effectiveness of our EDD methodology by searching for three representative DNNs, targeting low-latency GPU implementation and FPGA implementations with both recursive and pipelined architectures. Each model produced by EDD achieves similar accuracy as the best existing DNN models searched by neural architecture search (NAS) methods on ImageNet, but with superior performance obtained within 12 GPU-hour searches. Our DNN targeting GPU is 1.40× faster than the state-of-the-art solution reported in Proxyless [1], and our DNN targeting FPGA delivers 1.45× higher throughput than the state-of-the-art solution reported in DNNBuilder [2].

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

EDD: Efficient Differentiable DNN Architecture and Implementation Co-search for Embedded AI Solutions

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Framework for Neural Network Architecture and Compile Co-optimization
Weiwei Chen ... Ying Xu
ACM Transactions on Embedded Computing Systems | VOL. 22
Weiwei Chen, et. al.Weiwei Chen ... Ying Xu
29 Oct 2022
ACM Transactions on Embedded Computing Systems | VOL. 22

FLASH: F ast Neura l A rchitecture S earch with H ardware Optimization
Guihong Li ... Radu Marculescu
ACM Transactions on Embedded Computing Systems | VOL. 20
Guihong Li, et. al.Guihong Li ... Radu Marculescu
17 Sep 2021
ACM Transactions on Embedded Computing Systems | VOL. 20

Lightening the Load with Highly Accurate Storage- and Energy-Efficient LightNNs
Ruizhou Ding ... R D (Shawn) Blanton
ACM Transactions on Reconfigurable Technology and Systems | VOL. 11
Ruizhou Ding, et. al.Ruizhou Ding ... R D (Shawn) Blanton
30 Sep 2018
ACM Transactions on Reconfigurable Technology and Systems | VOL. 11

A multi-perspective revisit to the optimization methods of Neural Architecture Search and Hyper-parameter optimization for non-federated and federated learning environments
Salabat Khan ... Do Hyuen Kim
Computers and Electrical Engineering | VOL. 110
Salabat Khan, et. al.Salabat Khan ... Do Hyuen Kim
20 Jul 2023
Computers and Electrical Engineering | VOL. 110

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

EDD: Efficient Differentiable DNN Architecture and Implementation Co-search for Embedded AI Solutions

Abstract

Talk to us

Similar Papers