Extremely Sparse Networks via Binary Augmented Pruning for Fast Image Classification.

Peisong Wang,Jian Cheng,Fanrong Li,Gang Li

doi:10.1109/tnnls.2021.3120409

Abstract

Network pruning and binarization have been demonstrated to be effective in neural network accelerator design for high speed and energy efficiency. However, most existing pruning approaches achieve a poor tradeoff between accuracy and efficiency, which on the other hand, has limited the progress of neural network accelerators. At the same time, binary networks are highly efficient, however, a large accuracy gap exists between binary networks and their full-precision counterparts. In this article, we investigate the merits of extremely sparse networks with binary connections for image classification through software-hardware codesign. More specifically, we first propose a binary augmented extremely pruning method that can achieve ~98% sparsity with small accuracy degradation. Then we design the hardware architecture based on the resulting sparse and binary networks, which extensively explores the benefits of extreme sparsity with negligible resource consumption introduced by binary branch. Experiments on large-scale ImageNet classification and field-programmable gate array (FPGA) demonstrate that the proposed software-hardware architecture can achieve a prominent tradeoff between accuracy and efficiency.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE transactions on neural networks and learning systems	Publication Date: Aug 1, 2023
Citations: 3	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

Extremely Sparse Networks via Binary Augmented Pruning for Fast Image Classification.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems

Lead the way for us

Similar Papers

Structured representation in deep neural network systems
Caiwen Ding
-
Caiwen DingCaiwen Ding
10 May 2021
10 May 2021

A Multi-Mode 8k-MAC HW-Utilization-Aware Neural Processing Unit With a Unified Multi-Precision Datapath in 4-nm Flagship Mobile SoC
Jun-Seok Park ... Jihoon Bang
IEEE Journal of Solid-State Circuits | VOL. 58
Jun-Seok Park, et. al.Jun-Seok Park ... Jihoon Bang
01 Jan 2023
IEEE Journal of Solid-State Circuits | VOL. 58

A 7.663-TOPS 8.2-W Energy-efficient FPGA Accelerator for Binary Convolutional Neural Networks (Abstract Only)
Yixing Li ... Kai Xu
-
Yixing Li, et. al.Yixing Li ... Kai Xu
22 Feb 2017
A 7.663-TOPS 8.2-W Energy-efficient FPGA Accelerator for Binary Convolutional Neural Networks (Abstract Only)
Yixing Li ... Kai Xu

Design of 32 bit Parallel Processor Core for High Energy Efficiency using Instruction-Levels Dynamic Voltage Scaling Technique
Yil-Suk Yang ... Woo-H Kwon
JSTS:Journal of Semiconductor Technology and Science | VOL. 9
Yil-Suk Yang, et. al.Yil-Suk Yang ... Woo-H Kwon
31 Mar 2009
JSTS:Journal of Semiconductor Technology and Science | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Extremely Sparse Networks via Binary Augmented Pruning for Fast Image Classification.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems