A Training Method for Low Rank Convolutional Neural Networks Based on Alternating Tensor Compose-Decompose Method

Sukho Lee,Hyein Kim,Jungho Yoon,Byeongseon Jeong

doi:10.3390/app11020643

Abstract

Over the past decade, deep learning-based computer vision methods have been shown to surpass previous state-of-the-art computer vision techniques in various fields, and have made great progress in various computer vision problems, including object detection, object segmentation, face recognition, etc. Nowadays, major IT companies are adding new deep-learning-based computer technologies to edge devices such as smartphones. However, since the computational cost of deep learning-based models is still high for edge devices, research is being actively carried out to compress deep learning-based models while not sacrificing high performance. Recently, many lightweight architectures have been proposed for deep learning-based models which are based on low-rank approximation. In this paper, we propose an alternating tensor compose-decompose (ATCD) method for the training of low-rank convolutional neural networks. The proposed training method can better train a compressed low-rank deep learning model than the conventional fixed-structure based training method, so that a compressed deep learning model with higher performance can be obtained in the end of the training. As a representative and exemplary model to which the proposed training method can be applied, we propose a rank-1 convolutional neural network (CNN) which has a structure alternatively containing 3-D rank-1 filters and 1-D filters in the training stage and a 1-D structure in the testing stage. After being trained, the 3-D rank-1 filters can be permanently decomposed into 1-D filters to achieve a fast inference in the test time. The reason that the 1-D filters are not being trained directly in 1-D form in the training stage is that the training of the 3-D rank-1 filters is easier due to the better gradient flow, which makes the training possible even in the case when the fixed structured network with fixed consecutive 1-D filters cannot be trained at all. We also show that the same training method can be applied to the well-known MobileNet architecture so that better parameters can be obtained than with the conventional fixed-structure training method. Furthermore, we show that the 1-D filters in a ResNet like structure can also be trained with the proposed method, which shows the fact that the proposed method can be applied to various structures of networks.

Highlights

In comparison with other convolutional neural network (CNN) models which use 1-D rank-1 filters, we propose the use of 3-D rank-1 filters(W) in the training stage, where the 3-D rank-1 filters are constructed by the outer product of three 1-D vectors, say p, q, and t: W = p ⊗ q ⊗ t
As shown in the experiments, the proposed method learns better parameters due to the over-parametrization produced by the outer product into a 3-D filter, and the MobileNet constructed by the proposed rank-1 training method has a higher accuracy than that of the original MobileNet which is trained with a smaller number of parameters
We proposed a training method which alternatively composes and decomposes the filters in the training stage for better training of low rank filters

Summary

Introduction

The proposed training method do not use a fixed structure of neural network in the training stage, but allows the tensors to be alternatingly composed and decomposed so that a better gradient flow can flow through the tensors in the backpropagation step. This better gradient flow results in better parameter values than with conventional training method so that the compressed model can achieve a higher performance. Thereby, better parameters are obtained than with conventional training with fixed MobileNet structure

Related Works

Works on Compressing the Parameters of Pre-Trained CNNs

Works on Designing a Compressed Model

Works on Edge AI

Preliminaries for the Proposed Method

Bilateral-Projection Based 2DPCA

Flattened Convolutional Neural Networks

Application of the Proposed Training Method to the Rank-1 CNN

Construction of the 3-D Rank-1 Filters

Training Process

Application of the Proposed Training Method to the MobileNet

Experiments

Method

Findings

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Jan 11, 2021
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Training Method for Low Rank Convolutional Neural Networks Based on Alternating Tensor Compose-Decompose Method

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Single image super-resolution via deep learning
Zhenyue Cao ... Zhenkun Wang
-
Zhenyue Cao, et. al.Zhenyue Cao ... Zhenkun Wang
20 May 2022
20 May 2022

An abbreviated review of deep learning-based image classification models
Zaman Talal Abbood ... Layth Kamil Adday Almajmaie
Indonesian Journal of Electrical Engineering and Computer Science | VOL. 30
Zaman Talal Abbood, et. al.Zaman Talal Abbood ... Layth Kamil Adday Almajmaie
01 Apr 2023
Indonesian Journal of Electrical Engineering and Computer Science | VOL. 30

Comparison between conventional and deep learning-based surrogate models in predicting convective heat transfer performance of U-bend channels
Qi Wang ... Kang Huang
Energy and AI | VOL. 8
Qi Wang, et. al.Qi Wang ... Kang Huang
25 Jan 2022
Energy and AI | VOL. 8

Abstract P5-02-06: Improved histologic grading of breast cancer by a novel deep learning-based model
Mattias Rantalainen ... Stephanie Robertson
Cancer Research | VOL. 80
Mattias Rantalainen, et. al.Mattias Rantalainen ... Stephanie Robertson
14 Feb 2020
Abstract P5-02-06: Improved histologic grading of breast cancer by a novel deep learning-based model
Mattias Rantalainen ... Stephanie Robertson

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Training Method for Low Rank Convolutional Neural Networks Based on Alternating Tensor Compose-Decompose Method

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences