Deep Learning Model Compression Techniques: Advances, Opportunities, and Perspective

Hubert Msuya,Baraka J Maiseli

doi:10.52339/tjet.v42i2.853

Abstract

Recently, deep learning (DL) models have excelled in a wide range of fields. All of these successes are built on intricate DL models. The hundreds of millions or even billions of parameters and high-performance computing graphical processing units or tensor processing units are largely responsible for their achievement. DL model integration into real-time devices with tight latency limitations, limited memory, and power-constrained requirements is the key driving force behind investigation of DL model compression techniques. Also, there is an increase in data availability that encourages multimodal fusion in DL models to boost the models' predictive accuracy. In order to create compact DL models for deployment that is memory- and computationally efficient, the data included in the network parameters is compressed as much as possible, leaving only the bits necessary to carry out the task. A better trade-off between compression rate and accuracy loss should be established to take model acceleration and compression into consideration without severely reducing the model's performance. In this paper, we examine various DL model compression techniques used for both single- modality and multi-modal deep learning tasks. We explore over numerous DL model compression methods that have advanced in a number of applications. We then come up with the benefits and drawbacks of various compression and acceleration methods such as ineffectiveness in compressing more complicated networks with dimensionality-dependent complex structures, and, ultimately, the field's future prospects are given.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Learning Model Compression Techniques: Advances, Opportunities, and Perspective

Abstract

Talk to us

Similar Papers

More From: Tanzania Journal of Engineering and Technology

Lead the way for us

Journal: Tanzania Journal of Engineering and Technology	Publication Date: Jun 30, 2023
Citations: 1

Similar Papers

Abstract 184: The utility of deep metric learning for breast cancer identification on mammographic images
Justin Du ... Sanjay Aneja
Cancer Research | VOL. 81
Justin Du, et. al.Justin Du ... Sanjay Aneja
01 Jul 2021
Cancer Research | VOL. 81

Explainable artificial intelligence (XAI) for predicting the need for intubation in methanol-poisoned patients: a study comparing deep and machine learning models
Khadijeh Moulaei ... Mitra Rahimi
Scientific Reports | VOL. 14
Khadijeh Moulaei, et. al.Khadijeh Moulaei ... Mitra Rahimi
08 Jul 2024
Scientific Reports | VOL. 14

P–260 Towards better explainable deep learning models for embryo selection in ART
...
Human Reproduction | VOL. 36
, et. al. ...
06 Aug 2021
Human Reproduction | VOL. 36

RamanCMP: A Raman spectral classification acceleration method based on lightweight model and model compression techniques
Zengyun Gong ... Xiaoyi Lv
Analytica Chimica Acta | VOL. 1278
Zengyun Gong, et. al.Zengyun Gong ... Xiaoyi Lv
28 Aug 2023
Analytica Chimica Acta | VOL. 1278

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Learning Model Compression Techniques: Advances, Opportunities, and Perspective

Abstract

Talk to us

Similar Papers

More From: Tanzania Journal of Engineering and Technology