Single-shot pruning and quantization for hardware-friendly neural network acceleration

Bofeng Jiang,Jun Chen,Yong Liu

doi:10.1016/j.engappai.2023.106816

Single-shot pruning and quantization for hardware-friendly neural network acceleration

Bofeng Jiang, Jun Chen + Show 1 more

https://doi.org/10.1016/j.engappai.2023.106816

Copy DOI

Journal: Engineering Applications of Artificial Intelligence	Publication Date: Aug 27, 2023
Citations: 2

Affiliation: Zhejiang University

#Accuracy Loss #Single Process + Show 8 more

Abstract
Full-Text
Similar Papers

Abstract

Applying CNN on embedded systems is challenging due to model size limitations. Pruning and quantization can help, but are time-consuming to apply separately. Our Single-Shot Pruning and Quantization strategy addresses these issues by quantizing and pruning in a single process. We evaluated our method on CIFAR-10 and CIFAR-100 datasets for image classification. Our model is 69.4% smaller with little accuracy loss, and runs 6–8 times faster on NVIDIA Xavier NX hardware.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Engineering Applications of Artificial Intelligence

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.