Reducing Parameters of Neural Networks via Recursive Tensor Approximation

Kyuahn Kwon,Jaeyong Chung

doi:10.3390/electronics11020214

Abstract

Large-scale neural networks have attracted much attention for surprising results in various cognitive tasks such as object detection and image classification. However, the large number of weight parameters in the complex networks can be problematic when the models are deployed to embedded systems. In addition, the problems are exacerbated in emerging neuromorphic computers, where each weight parameter is stored within a synapse, the primary computational resource of the bio-inspired computers. We describe an effective way of reducing the parameters by a recursive tensor factorization method. Applying the singular value decomposition in a recursive manner decomposes a tensor that represents the weight parameters. Then, the tensor is approximated by algorithms minimizing the approximation error and the number of parameters. This process factorizes a given network, yielding a deeper, less dense, and weight-shared network with good initial weights, which can be fine-tuned by gradient descent.

Highlights

Large neural networks such as convolutional neural networks have demonstrated state-of-the-art performance in a number of benchmarks in computer vision, automatic speech recognition, natural language processing, audio recognition, etc. [1,2,3,4]
We describe a general parameter reduction method using new tensor approximation methods based on divide-and-conquer [20]
This paper makes the following contributions: (1) For neuromorphic computers, we evaluated the methods for reducing the size of the models in terms of the parameter reduction

Summary

Introduction

Large neural networks such as convolutional neural networks have demonstrated state-of-the-art performance in a number of benchmarks in computer vision, automatic speech recognition, natural language processing, audio recognition, etc. [1,2,3,4]. While the enormous computing power available today, mainly driven by GPUs, makes us consider the evaluation easy, it comes with large energy consumption. Weight parameters in neural networks are heavily redundant [6], and exploiting the redundancy, computational cost, and space requirements can be minimized while maintaining the performance. To this end, several methods have been proposed very recently [7,8,9,10,11,12], and all of these methods assume that neural networks are executed in stored-program computers, including GPU-based machines. The traditional computers have several processing bottlenecks, such as limited memory-bandwidth and a limited number of processing elements, and the performance benefit (e.g., speed-up) by the parameter reduction is not as high as the reduction rate

Objectives

Methods

Findings

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronics	Publication Date: Jan 11, 2022
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Reducing Parameters of Neural Networks via Recursive Tensor Approximation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

InSight: An FPGA-Based Neuromorphic Computing System for Deep Neural Networks
Taeyang Hong ... Jaeyong Chung
Journal of Low Power Electronics and Applications | VOL. 10
Taeyang Hong, et. al.Taeyang Hong ... Jaeyong Chung
30 Oct 2020
Journal of Low Power Electronics and Applications | VOL. 10

High-performance and energy-efficient deep learning for resource-constrained devices
Ao Ren
-
Ao RenAo Ren
10 May 2021
10 May 2021

Multi-Sensory CNN Models for Close Proximity Satellite Operations
A Mazouz ... C.P Bridges
-
A Mazouz, et. al.A Mazouz ... C.P Bridges
01 Mar 2019
01 Mar 2019

Complex Network-Based Image Classification Method
Zhuang Ma ... Guangdong Huang
-
Zhuang Ma, et. al.Zhuang Ma ... Guangdong Huang
27 May 2022
27 May 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reducing Parameters of Neural Networks via Recursive Tensor Approximation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics