Data-Independent Structured Pruning of Neural Networks via Coresets.

Ben Mussay,Dan Feldman,Margarita Osadchy,Vladimir Braverman,Samson Zhou

doi:10.1109/tnnls.2021.3088587

Abstract

Model compression is crucial for the deployment of neural networks on devices with limited computational and memory resources. Many different methods show comparable accuracy of the compressed model and similar compression rates. However, the majority of the compression methods are based on heuristics and offer no worst case guarantees on the tradeoff between the compression rate and the approximation error for an arbitrarily new sample. We propose the first efficient structured pruning algorithm with a provable tradeoff between its compression rate and the approximation error for any future test sample. Our method is based on the coreset framework, and it approximates the output of a layer of neurons/filters by a coreset of neurons/filters in the previous layer and discards the rest. We apply this framework in a layer-by-layer fashion from the bottom to the top. Unlike previous works, our coreset is data-independent, meaning that it provably guarantees the accuracy of the function for any input [Formula: see text], including an adversarial one.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Data-Independent Structured Pruning of Neural Networks via Coresets.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems

Lead the way for us

Journal: IEEE transactions on neural networks and learning systems	Publication Date: Dec 1, 2022
Citations: 5

Similar Papers

Optimising digital signal processor‐based defect detection in smart manufacturing with lightweight convolutional neural networks
Han Yue ... Rucen Wang
IET Collaborative Intelligent Manufacturing | VOL. 6
Han Yue, et. al.Han Yue ... Rucen Wang
12 Jan 2024
IET Collaborative Intelligent Manufacturing | VOL. 6

A Full-Image Full-Resolution End-to-End-Trainable CNN Framework for Image Forgery Detection
Francesco Marra ... Luisa Verdoliva
IEEE Access | VOL. 8
Francesco Marra, et. al.Francesco Marra ... Luisa Verdoliva
01 Jan 2020
IEEE Access | VOL. 8

Lightweight Hardware Architecture for Object Detection in Driver Assistance Systems
Bhaumik Vaidya ... Chirag Paunwala
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 36
Bhaumik Vaidya, et. al.Bhaumik Vaidya ... Chirag Paunwala
06 Apr 2022
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 36

Lasso Regression Based Channel Pruning for Efficient Object Detection Model
Xiang Li ... Li Chen
-
Xiang Li, et. al.Xiang Li ... Li Chen
01 Jun 2019
01 Jun 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data-Independent Structured Pruning of Neural Networks via Coresets.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems