Simulating Neural Network Processors

Jian Hu,Xianlong Zhang,Xiaohua Shi,Mohammad Farukh Hashmi

doi:10.1155/2022/7500195

Jian Hu, Xianlong Zhang + Show 2 more

Open Access

https://doi.org/10.1155/2022/7500195

Copy DOI

Abstract

Deep learning has achieved competing results compared with human beings in many fields. Traditionally, deep learning networks are executed on CPUs and GPUs. In recent years, more and more neural network accelerators have been introduced in both academia and industry to improve the performance and energy efficiency for deep learning networks. In this paper, we introduce a flexible and configurable functional NN accelerator simulator, which could be configured to simulate u-architectures for different NN accelerators. The extensible and configurable simulator is helpful for system-level exploration of u-architecture, as well as operator optimization algorithm developments. The simulator is a functional simulator that simulates the latencies of calculation and memory access and the concurrent process between modules, and it gives the number of program execution cycles after the simulation is completed. We also integrated the simulator into the TVM compilation stack as an optional backend. Users can use TVM to write operators and execute them on the simulator.

Highlights

Deep learning has been applied to image recognition, object detection, speech recognition, and other fields
Users can further assemble new neural networks (NNs) layers from those basic computations, which makes Cambricon instruction set architecture (ISA) more flexible than its predecessors. ey implemented a prototype accelerator of Cambricon ISA, which achieved the same level of performance as DaDianNao in the experiments
(b) e simulator is a functional simulator that simulates the latencies of calculation and memory access and the concurrent process between modules, and it gives the number of program execution cycles after the simulation is completed

Summary

Introduction

Deep learning has been applied to image recognition, object detection, speech recognition, and other fields. CPUs and GPUs are widely used to execute neural networks (NNs), but more and more hardware accelerators have been introduced to improve the performance and energy efficiency of NN computing. Wireless Communications and Mobile Computing reconfigurable DNN processor for IOT devices that uses binary/ternary weights to do calculations, and it applies three techniques to improve energy efficiency and achieves 19.9TOPS/W power efficiency at a power consumption of 10 mW. Ey improve energy efficiency by reducing data movement between memory and processing units, and the latter 3 use analog arithmetic for matrix calculations. TVM [27] is a deep learning compiler stack, and it provides both graph-level and operator-level optimizations and can target different backends including CPU, GPU, and hardware accelerators.

Accelerator Architecture and ISA

Codegen System

Experiments

Conclusions

Disclosure

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Simulating Neural Network Processors

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Wireless Communications and Mobile Computing

Lead the way for us

Journal: Wireless Communications and Mobile Computing	Publication Date: Feb 23, 2022
License type: CC BY 4.0

Similar Papers

Customized FPGA Design and Analysis of Soft-Core Processor for DNN
Harini Sriraman ... Aswathy Ravikumar
Procedia Computer Science | VOL. 218
Harini Sriraman, et. al.Harini Sriraman ... Aswathy Ravikumar
01 Jan 2023
Procedia Computer Science | VOL. 218

Predicting Atrial Fibrillation from Automated Measurements of Left Atrial Volume Using Routine Chest CT Examination: Overlooked and Underrecognized Risk Factors.
Albert De Roos ... Qian Tao
Radiology. Cardiothoracic imaging | VOL. 1
Albert De Roos, et. al.Albert De Roos ... Qian Tao
01 Dec 2019
Radiology. Cardiothoracic imaging | VOL. 1

Research on Rolling Bearing Fault Diagnosis Based on DRS Frequency Spectrum Image and Deep Learning
Zhuoxian Li ... Hao Wang
The International Journal of Acoustics and Vibration | VOL. 28
Zhuoxian Li, et. al.Zhuoxian Li ... Hao Wang
16 Jun 2023
The International Journal of Acoustics and Vibration | VOL. 28

Deep Learning Approach To Recognize Physical Activity Type From Wrist-worn Tri-axial Accelerometer
Mamoun T Mardini ... Todd Manini
Medicine & Science in Sports & Exercise | VOL. 52
Mamoun T Mardini, et. al.Mamoun T Mardini ... Todd Manini
01 Jul 2020
Medicine & Science in Sports & Exercise | VOL. 52

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Simulating Neural Network Processors

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Wireless Communications and Mobile Computing