Software-Defined Design Space Exploration for an Efficient DNN Accelerator Architecture

Ye Yu,Yingmin Li,Shuai Che,Weifeng Zhang,Niraj K Jha

doi:10.1109/tc.2020.2983694

Ye Yu, Yingmin Li + Show 3 more

Open Access

https://doi.org/10.1109/tc.2020.2983694

Copy DOI

Abstract

Deep neural networks (DNNs) have been shown to outperform conventional machine learning algorithms across a wide range of applications, e.g., image recognition, object detection, robotics, and natural language processing. However, the high computational complexity of DNNs often necessitates extremely fast and efficient hardware. The problem gets worse as the size of neural networks grows exponentially. As a result, customized hardware accelerators have been developed to accelerate DNN processing without sacrificing model accuracy. However, previous accelerator design studies have not fully considered the characteristics of the target applications, which may lead to sub-optimal architecture designs. On the other hand, new DNN models have been developed for better accuracy, but their compatibility with the underlying hardware accelerator is often overlooked. In this article, we propose an application-driven framework for architectural design space exploration of DNN accelerators. This framework is based on a hardware analytical model of individual DNN operations. It models the accelerator design task as a multi-dimensional optimization problem. We demonstrate that it can be efficaciously used in application-driven accelerator architecture design: we use the framework to optimize the accelerator configurations for eight representative DNNs and select the configuration with the highest geometric mean performance. The geometric mean performance improvement of the selected DNN configuration relative to the architectural configuration optimized only for each individual DNN ranges from 12.0 to 117.9 percent. Given a target DNN, the framework can generate efficient accelerator design solutions with optimized performance and area. Furthermore, we explore the opportunity to use the framework for accelerator configuration optimization under simultaneous diverse DNN applications. The framework is also capable of improving neural network models to best fit the underlying hardware resources. We demonstrate that it can be used to analyze the relationship between the operations of the target DNNs and the corresponding accelerator configurations, based on which the DNNs can be tuned for better processing efficiency on the given accelerator without sacrificing accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Computers	Publication Date: Jan 1, 2021
Citations: 63	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

Software-Defined Design Space Exploration for an Efficient DNN Accelerator Architecture

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computers

Lead the way for us

Similar Papers

High-performance and energy-efficient deep learning for resource-constrained devices
Ao Ren
-
Ao RenAo Ren
10 May 2021
10 May 2021

A comparative evaluation of deep convolutional neural network and deep neural network-based land use/land cover classifications of mining regions using fused multi-sensor satellite data
Ajay Kumar ... Amit Kumar Gorai
Advances in Space Research | VOL. 72
Ajay Kumar, et. al.Ajay Kumar ... Amit Kumar Gorai
04 Sep 2023
Advances in Space Research | VOL. 72

Understanding adversarial attack and defense towards deep compressed neural networks
Qi Liu ... Wujie Wen
-
Qi Liu, et. al.Qi Liu ... Wujie Wen
03 May 2018
03 May 2018

Attack on Deep Steganalysis Neural Networks
Shiyu Li ... Shunzhi Jiang
-
Shiyu Li, et. al.Shiyu Li ... Shunzhi Jiang
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Software-Defined Design Space Exploration for an Efficient DNN Accelerator Architecture

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computers