A Flexible and Energy-Efficient Convolutional Neural Network Acceleration With Dedicated ISA and Accelerator

Xiaobai Chen,Zhiyi Yu

doi:10.1109/tvlsi.2018.2810831

Abstract

State-of-the-art convolutional neural networks (CNNs) usually have a large number of layers and filter weights which bring huge computation and communication overheads. A general purpose instruction set architecture (ISA) is flexible but has low code density and high power consumption. The existing CNN-specific accelerators are much more efficient but usually are inflexible or require a complex controller to handle the computation and data transfer of different CNNs. In this brief, we propose a new CNN-specific ISA which embeds the parallel computation and data reuse parameters in the instructions. An instruction generator deploys the instruction parameters according to the feature of CNNs and hardware’s computation and storage resources. In addition, a reconfigurable accelerator with 225 multipliers and 24 adder trees is realized to obtain efficient parallel computation and data transfer. Compared with x86 processors, our design has 392 times better energy efficiency and 16 times higher code density. Compared with other state-of-the-art accelerators, our solution has a higher flexibility to support all popular CNNs and a higher energy efficiency.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Flexible and Energy-Efficient Convolutional Neural Network Acceleration With Dedicated ISA and Accelerator

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Lead the way for us

Journal: IEEE Transactions on Very Large Scale Integration (VLSI) Systems	Publication Date: Jul 1, 2018
Citations: 19

Similar Papers

Neural Network Instruction Set Extension and Code Mapping Mechanism
Wenqi Lou ... Chao Wang
International Journal of Software and Informatics | VOL. 11
Wenqi Lou, et. al.Wenqi Lou ... Chao Wang
01 Jan 2020
International Journal of Software and Informatics | VOL. 11

RV-CNN: Flexible and Efficient Instruction Set for CNNs Based on RISC-V Processors
Wenqi Lou ... Lei Gong
-
Wenqi Lou, et. al.Wenqi Lou ... Lei Gong
01 Jan 2019
01 Jan 2019

WORDA: A Winograd Offline-Runtime Decomposition Algorithm for Faster CNN Inference
Jacob Nelson ... Syed Rafay Hasan
-
Jacob Nelson, et. al.Jacob Nelson ... Syed Rafay Hasan
09 Aug 2021
09 Aug 2021

Functionally-Predefined Kernel: a Way to Reduce CNN Computation
Yuta Inouchi ... Shinobu Miwa
-
Yuta Inouchi, et. al.Yuta Inouchi ... Shinobu Miwa
01 Aug 2019
01 Aug 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Flexible and Energy-Efficient Convolutional Neural Network Acceleration With Dedicated ISA and Accelerator

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Very Large Scale Integration (VLSI) Systems