Dynamic hard pruning of Neural Networks at the edge of the internet

Lorenzo Valerio,Franco Maria Nardini,Andrea Passarella,Raffaele Perego

doi:10.1016/j.jnca.2021.103330

Abstract

Neural Networks (NN), although successfully applied to several Artificial Intelligence tasks, are often unnecessarily over-parametrized. In edge/fog computing, this might make their training prohibitive on resource-constrained devices, contrasting with the current trend of decentralizing intelligence from remote data centres to local constrained devices. Therefore, we investigate the problem of training effective NN models on constrained devices having a fixed, potentially small, memory budget. We target techniques that are both resource-efficient and performance effective while enabling significant network compression. Our Dynamic Hard Pruning (DynHP) technique incrementally prunes the network during training, identifying neurons that marginally contribute to the model accuracy. DynHP enables a tunable size reduction of the final neural network and reduces the NN memory occupancy during training. Freed memory is reused by a dynamic batch sizing approach to counterbalance the accuracy degradation caused by the hard pruning strategy, improving its convergence and effectiveness. We assess the performance of DynHP through reproducible experiments on three public datasets, comparing them against reference competitors. Results show that DynHP compresses a NN up to 10 times without significant performance drops (up to 3.5% additional error w.r.t. the competitors), reducing up to 80% the training memory occupancy.

Highlights

In the last years, AI solutions have been successfully adopted in a variety of different tasks
The available memory is reused to minimise the accuracy degradation caused by the hard pruning strategy by adaptively adjusting the size of the data provided to the neural network to improve its convergence and final effectiveness
We proposed Dynamic Hard Pruning (DynHP), a new resource-efficient Neural networks (NN) learning technique that achieves performances comparable to conventional neural network training algorithms while enabling significant levels of network compression

Summary

Introduction

AI solutions have been successfully adopted in a variety of different tasks. Neural networks (NN) are among the most successful technologies that achieve state-of-the-art performance in several application fields, including image recognition, computer vision, natural language processing, and speech recognition. It has been proven that NNs may suffer over parametrization [12, 27, 34] so that they can be pruned significantly without any loss of accuracy. They can over-fit and even memorize random patterns in the data [36] if not properly regularized. We first introduce the notation used in the rest of the paper, we discuss the state of the art techniques for soft pruning neural networks (during training) that inspired our solution.

Objectives

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Network and Computer Applications	Publication Date: Jan 19, 2022
Citations: 4	License type: cc-by

R Discovery Prime

R Discovery Prime

Dynamic hard pruning of Neural Networks at the edge of the internet

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Network and Computer Applications

Lead the way for us

Similar Papers

Towards performance-maximizing neural network pruning via global channel attention
Yingchun Wang ... Yuanhong Zhang
Neural Networks | VOL. 171
Yingchun Wang, et. al.Yingchun Wang ... Yuanhong Zhang
01 Dec 2023
Neural Networks | VOL. 171

Improved Knowledge Distillationwith Dynamic Network Pruning
Eren Şener ... Emre Akbaş
Gazi Üniversitesi Fen Bilimleri Dergisi Part C: Tasarım ve Teknoloji | VOL. 10
Eren Şener, et. al.Eren Şener ... Emre Akbaş
30 Sep 2022
Gazi Üniversitesi Fen Bilimleri Dergisi Part C: Tasarım ve Teknoloji | VOL. 10

Comparison of Two Microcontroller Boards for On-Device Model Training in a Keyword Spotting Task
Nil Llisterri Gimenez ... Junkyu Lee
-
Nil Llisterri Gimenez, et. al.Nil Llisterri Gimenez ... Junkyu Lee
07 Jun 2022
07 Jun 2022

Evolutionary convolutional neural network for efficient brain tumor segmentation and overall survival prediction
Fatemeh Behrad ... Mohammad Saniee Abadeh
Expert Systems with Applications | VOL. 213
Fatemeh Behrad, et. al.Fatemeh Behrad ... Mohammad Saniee Abadeh
10 Oct 2022
Expert Systems with Applications | VOL. 213

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dynamic hard pruning of Neural Networks at the edge of the internet

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Network and Computer Applications