An On-Chip Fully Connected Neural Network Training Hardware Accelerator Based on Brain Float Point and Sparsity Awareness

Tsung-Han Tsai,Ding-Bang Lin

doi:10.1109/ojcas.2023.3245061

Tsung-Han Tsai, Ding-Bang Lin

Open Access

https://doi.org/10.1109/ojcas.2023.3245061

Copy DOI

Abstract

In recent years, deep neural networks (DNNs) have brought revolutionary progress in various fields with the advent of technology. It is widely used in image pre-processing, image enhancement technology, face recognition, voice recognition, and other applications, gradually replacing traditional algorithms. It shows that the rise of neural networks has led to the reform of artificial intelligence. Since neural network algorithms are computationally intensive, they require GPUs or accelerated hardware for real-time computation. However, the high cost and high power consumption of GPUs result in low energy efficiency. It recently led to much research on accelerated digital circuit hardware design for deep neural networks. In this paper, we propose an efficient and flexible neural network training processor for fully connected layers. Our proposed training processor features low power consumption, high throughput, and high energy efficiency. It uses the sparsity of neuron activations to reduce the number of memory accesses and memory space to achieve an efficient training accelerator. The proposed processor uses a novel reconfigurable computing architecture to maintain high performance when operating Forward Propagation and Backward Propagation. The processor is implemented in Xilinx Zynq UltraSacle+MPSoC ZCU104 FPGA, with an operating frequency of 200MHz and power consumption of 6.444W, and can achieve 102.43 GOPS.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Open Journal of Circuits and Systems	Publication Date: Jan 1, 2023
Citations: 7	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

An On-Chip Fully Connected Neural Network Training Hardware Accelerator Based on Brain Float Point and Sparsity Awareness

Abstract

Talk to us

Similar Papers

More From: IEEE Open Journal of Circuits and Systems

Lead the way for us

Similar Papers

A Quasi-Solid-State Flexible Fiber-Shaped Li-CO2 Battery with Low Overpotential and High Energy Efficiency.
Jingwen Zhou ... Dingwang Yuan
Advanced Materials | VOL. 31
Jingwen Zhou, et. al.Jingwen Zhou ... Dingwang Yuan
25 Nov 2018
Advanced Materials | VOL. 31

(Invited) Energy Efficient Neural Network Training with Analog Synapses: Challenges and Opportunities
Matthew J Marinella ... Alex Hsia
Electrochemical Society Meeting Abstracts | VOL. MA2019-02
Matthew J Marinella, et. al.Matthew J Marinella ... Alex Hsia
01 Sep 2019
Electrochemical Society Meeting Abstracts | VOL. MA2019-02

L1 -Norm Batch Normalization for Efficient Training of Deep Neural Networks.
Shuang Wu ... Lei Deng
IEEE Transactions on Neural Networks and Learning Systems | VOL. 30
Shuang Wu, et. al.Shuang Wu ... Lei Deng
09 Nov 2018
IEEE Transactions on Neural Networks and Learning Systems | VOL. 30

The Application of Neural Network in the Evaluation of the Computer Network Security
Zhenyou Sui
-
Zhenyou SuiZhenyou Sui
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An On-Chip Fully Connected Neural Network Training Hardware Accelerator Based on Brain Float Point and Sparsity Awareness

Abstract

Talk to us

Similar Papers

More From: IEEE Open Journal of Circuits and Systems