A Compressed Data Partition and Loop Scheduling Scheme for Neural Networks

Dejian Li,Rongqiang Fang,Jun Ma,Ting Chong,Dongyan Zhao,Jing Wang,Zengmin Ren

doi:10.1109/access.2022.3204038

Abstract

Neural networks (NNs) have been widely adopted in various application domains, ranging from image and video recognition to natural language processing. Recent studies reveal that deeper NNs with more parameters greatly enhance the output accuracy. However, complex NNs incur intensive memory accesses. Since the weights of even a single layer can exceed the on-chip storage capacity, the data usually need to be partitioned. Compression can effectively reduce the storage space requirements. However, there is no research considering the partition of the spare matrix. In this paper, we propose a sparse NN data partition and loop scheduling scheme. We establish the compression efficiency model of the matrix sparse algorithm and design a partition selection method based on sparsity characteristics analyzed by the compression efficiency model. Finally, we design a loop scheduling scheme based on the proper partition size. The experiment results show that the average memory access of each layer can be compressed to 68% of the original; additionally, the throughput of the three networks is increased to an average of 1.66 times.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Compressed Data Partition and Loop Scheduling Scheme for Neural Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Journal: IEEE Access	Publication Date: Jan 1, 2022
License type: CC BY 4.0

Similar Papers

A scalable sharing architecture for a parallel database system
V Gottemukkala ... U Ramachandran
-
V Gottemukkala, et. al.V Gottemukkala ... U Ramachandran
26 Oct 1994
26 Oct 1994

Data partitioning and scheduling schemes for federated platoon-based vehicular cloud
Wiseborn M Danquah ... D Turgay Altilar
Vehicular Communications | VOL. 38
Wiseborn M Danquah, et. al.Wiseborn M Danquah ... D Turgay Altilar
28 Sep 2022
Vehicular Communications | VOL. 38

A new and efficient FFT algorithm for distributed memory systems
N Anupindi ... Myoung An
-
N Anupindi, et. al.N Anupindi ... Myoung An
19 Dec 1994
19 Dec 1994

Deep stable neural networks: Large-width asymptotics and convergence rates
Stefano Favaro ... Sandra Fortini
Bernoulli | VOL. 29
Stefano Favaro, et. al.Stefano Favaro ... Sandra Fortini
01 Aug 2023
Bernoulli | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Compressed Data Partition and Loop Scheduling Scheme for Neural Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Access