Architectural improvements and FPGA implementation of a multimodel neuroprocessor

I.Z Mihu,H.V Caprita

doi:10.1109/iconip.2002.1198975

Abstract

Since neural networks (NNs) require an enormous amount of learning time, various kinds of dedicated parallel computers have been developed. In the paper a 2-D systolic array (SA) of dedicated processing elements (PEs) also called systolic cells (SCs) is presented as the heart of a multimodel neural-network accelerator. The instruction set of the SA allows the implementation of several neural algorithms, including error back propagation and a self organizing feature map algorithm. Several special architectural facilities are presented in the paper in order to improve the 2-D SA performance. A swapping mechanism of the weight matrix allows the implementation of NNs larger than 2-D SA. A systolically propagated instruction word accompanying each input vector inside the 2-D SA allows the operating mode to be changed progressively, avoiding intermediate inactive cycles inside the 2-D SA. An FPGA implementation of the proposed 2-D SA is presented.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Architectural improvements and FPGA implementation of a multimodel neuroprocessor

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

RiSA: A Reinforced Systolic Array for Depthwise Convolutions and Embedded Tensor Reshaping
Hyungmin Cho
ACM Transactions on Embedded Computing Systems | VOL. 20
Hyungmin ChoHyungmin Cho
17 Sep 2021
ACM Transactions on Embedded Computing Systems | VOL. 20

ReSA: Reconfigurable Systolic Array for Multiple Tiny DNN Tensors
Ching-Jui Lee ... Tsung Tai Yeh
ACM Transactions on Architecture and Code Optimization | VOL. -
Ching-Jui Lee, et. al.Ching-Jui Lee ... Tsung Tai Yeh
21 Mar 2024
ACM Transactions on Architecture and Code Optimization | VOL. -

Systolic VLSI and FPGA Realization of Artificial Neural Networks
Pramod Kumar Meher
-
Pramod Kumar MeherPramod Kumar Meher
01 Jan 2009
01 Jan 2009

Heterogeneous Systolic Array Architecture for Compact CNNs Hardware Accelerators
Rui Xu ... Dongsheng Li
IEEE Transactions on Parallel and Distributed Systems | VOL. -
Rui Xu, et. al.Rui Xu ... Dongsheng Li
01 Jan 2020
IEEE Transactions on Parallel and Distributed Systems | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Architectural improvements and FPGA implementation of a multimodel neuroprocessor

Abstract

Talk to us

Similar Papers