Neural Network Compression for Noisy Storage Devices

Berivan Isik,Armin Alaghi,H.-S. Philip Wong,Tsachy Weissman,Stefano Ermon,Xin Zheng,Kristy Choi

doi:10.1145/3588436

Abstract

Compression and efficient storage of neural network (NN) parameters is critical for applications that run on resource-constrained devices. Despite the significant progress in NN model compression, there has been considerably less investigation in the actual physical storage of NN parameters. Conventionally, model compression and physical storage are decoupled, as digital storage media with error-correcting codes (ECCs) provide robust error-free storage. However, this decoupled approach is inefficient as it ignores the overparameterization present in most NNs and forces the memory device to allocate the same amount of resources to every bit of information regardless of its importance. In this work, we investigate analog memory devices as an alternative to digital media – one that naturally provides a way to add more protection for significant bits unlike its counterpart, but is noisy and may compromise the stored model’s performance if used naively. We develop a variety of robust coding strategies for NN weight storage on analog devices, and propose an approach to jointly optimize model compression and memory resource allocation. We then demonstrate the efficacy of our approach on models trained on MNIST, CIFAR-10, and ImageNet datasets for existing compression techniques. Compared to conventional error-free digital storage, our method reduces the memory footprint by up to one order of magnitude, without significantly compromising the stored model’s accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Neural Network Compression for Noisy Storage Devices

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Embedded Computing Systems

Lead the way for us

Journal: ACM Transactions on Embedded Computing Systems	Publication Date: May 13, 2023
Citations: 4

Similar Papers

Exploiting Hybrid Precision for Training and Inference: A 2T-1FeFET Based Analog Synaptic Weight Cell
Xiaoyu Sun ... Panni Wang
-
Xiaoyu Sun, et. al.Xiaoyu Sun ... Panni Wang
01 Dec 2018
01 Dec 2018

STEGANOGRAFI AUDIO (WAV) MENGGUNAKAN METODE LSB (LEAST SIGNIFICANT BIT)
Arisman Arisman ... Windy Sentanu
CCIT Journal | VOL. 9
Arisman Arisman, et. al.Arisman Arisman ... Windy Sentanu
14 Jan 2016
CCIT Journal | VOL. 9

Steganography implementation on android smartphone using the LSB (least significant bit) to MP3 and WAV audio
Lindawati ... Rita Siburian
-
Lindawati, et. al. Lindawati ... Rita Siburian
01 Jul 2017
01 Jul 2017

Optimization of Convolutional Neural Network Using the Linearly Decreasing Weight Particle Swarm Optimization
...
Proceedings of the Annual Conference of JSAI | VOL. JSAI2022
, et. al. ...
16 Jan 2020
Proceedings of the Annual Conference of JSAI | VOL. JSAI2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Neural Network Compression for Noisy Storage Devices

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Embedded Computing Systems