TEC-CNN: Towards Efficient Compressing Convolutional Neural Nets with Low-rank Tensor Decomposition

Yifan Wang,Liang Feng,Fenglin Cai,Lusi Li,Rui Wu,Jie Li

doi:10.1145/3702641

Abstract

Most state-of-the-art convolutional neural networks (CNNs) are characterised by excessive parameterisation, leading to a high computational burden. Tensor decomposition has emerged as a model reduction technique for compressing deep neural networks. Previous approaches have predominantly relied on either Tucker decomposition or Canonical Polyadic (CP) decomposition for CNNs. However, CP decomposition exhibits exceptional compression capabilities in comparison to Tucker decomposition, which results in a more pronounced accuracy loss. This paper introduces an efficient model compression method, termed TEC-CNN, designed to achieve significant compression while preserving accuracy levels comparable to those of the original models. In TEC-CNN, convolutional layers are identified to obtain convolutional kernels by analysing given models under the principles of low-rank tensor decomposition, and then, calculating the ranks of convolutional kernels. Furthermore, an efficient decomposition schema for the convolutional kernel is proposed with approximate kernel tensor for reducing parameters. Additionally, a novel format of a convolutional sequence is presented and constructed with a reduced number of parameters to replace the original convolutional layers. Finally, the effectiveness of TEC-CNN is assessed across a range of computer vision tasks. For instance, in CIFAR-100 classification, ResNet18 is compressed to 4.1 MB, while Unext, when applied to image segmentation using the International Skin Imaging Collaboration (ISIC) dataset, is reduced to 3.419 MB. When employed for fire object detection with Yolov7, TEC-CNN achieves a model size reduction of 71.6 MB. Comprehensive experimental results underscore that our approach achieves significant model compression while preserving model performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

TEC-CNN: Towards Efficient Compressing Convolutional Neural Nets with Low-rank Tensor Decomposition

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications

Lead the way for us

Similar Papers

Stable Low-Rank CP Decomposition for Compression of Convolutional Neural Networks Based on Sensitivity
Chenbin Yang ... Huiyi Liu
Applied Sciences | VOL. 14
Chenbin Yang, et. al.Chenbin Yang ... Huiyi Liu
12 Feb 2024
Applied Sciences | VOL. 14

An Analysis of Low-Rank Decomposition Selection for Deep Convolutional Neural Networks
Baichen Liu ... Yandong Tang
-
Baichen Liu, et. al.Baichen Liu ... Yandong Tang
01 Jan 2021
01 Jan 2021

Optimizing Convolutional Neural Networks Utilizing Tensor Decomposition Techniques for Large-Scale Image Recognition Tasks
Tiancheng Hu
Journal of Student Research | VOL. 12
Tiancheng HuTiancheng Hu
31 Aug 2023
Journal of Student Research | VOL. 12

Hybrid tensor decomposition in neural network compression
Bijiao Wu ... Guoqi Li
Neural Networks | VOL. 132
Bijiao Wu, et. al.Bijiao Wu ... Guoqi Li
19 Sep 2020
Neural Networks | VOL. 132

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TEC-CNN: Towards Efficient Compressing Convolutional Neural Nets with Low-rank Tensor Decomposition

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications