Neural Network Compression Based on Tensor Ring Decomposition.

Kun Xie,Jigang Wen,Can Liu,Xin Wang,Gaogang Xie,Kenli Li,Xiaocan Li

doi:10.1109/tnnls.2024.3383392

Abstract

Deep neural networks (DNNs) have made great breakthroughs and seen applications in many domains. However, the incomparable accuracy of DNNs is achieved with the cost of considerable memory consumption and high computational complexity, which restricts their deployment on conventional desktops and portable devices. To address this issue, low-rank factorization, which decomposes the neural network parameters into smaller sized matrices or tensors, has emerged as a promising technique for network compression. In this article, we propose leveraging the emerging tensor ring (TR) factorization to compress the neural network. We investigate the impact of both parameter tensor reshaping and TR decomposition (TRD) on the total number of compressed parameters. To achieve the maximal parameter compression, we propose an algorithm based on prime factorization that simultaneously identifies the optimal tensor reshaping and TRD. In addition, we discover that different execution orders of the core tensors result in varying computational complexities. To identify the optimal execution order, we construct a novel tree structure. Based on this structure, we propose a top-to-bottom splitting algorithm to schedule the execution of core tensors, thereby minimizing computational complexity. We have performed extensive experiments using three kinds of neural networks with three different datasets. The experimental results demonstrate that, compared with the three state-of-the-art algorithms for low-rank factorization, our algorithm can achieve better performance with much lower memory consumption and lower computational complexity.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Neural Network Compression Based on Tensor Ring Decomposition.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems

Lead the way for us

Similar Papers

Deep Convolutional Neural Network Compression Method: Tensor Ring Decomposition with Variational Bayesian Approach
Weirong Liu ... Jie Liu
Neural Processing Letters | VOL. 56
Weirong Liu, et. al.Weirong Liu ... Jie Liu
13 Mar 2024
Neural Processing Letters | VOL. 56

Hyperspectral Anomaly Detection Based on Tensor Ring Decomposition With Factors TV Regularization
Maoyuan Feng ... Qin Shu
IEEE Transactions on Geoscience and Remote Sensing | VOL. 61
Maoyuan Feng, et. al.Maoyuan Feng ... Qin Shu
01 Jan 2023
IEEE Transactions on Geoscience and Remote Sensing | VOL. 61

Joint-Way Compression for LDPC Neural Decoding Algorithm With Tensor-Ring Decomposition
Yuanhui Liang ... Chan-Tong Lam
IEEE Access | VOL. 11
Yuanhui Liang, et. al.Yuanhui Liang ... Chan-Tong Lam
01 Jan 2023
IEEE Access | VOL. 11

A fast Lanczos-based hierarchical algorithm for tensor ring decomposition
Cheng-Wei Sun ... Liang-Jian Deng
Signal Processing | VOL. 227
Cheng-Wei Sun, et. al.Cheng-Wei Sun ... Liang-Jian Deng
11 Sep 2024
Signal Processing | VOL. 227

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Neural Network Compression Based on Tensor Ring Decomposition.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems