Channel Compression: Rethinking Information Redundancy Among Channels in CNN Architecture

Jinhua Liang,Tao Zhang,Guoqing Feng

doi:10.1109/access.2020.3015714

Jinhua Liang, Tao Zhang + Show 1 more

Open Access

https://doi.org/10.1109/access.2020.3015714

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 40	License type: CC BY 4.0

Affiliation: Tianjin University

Abstract

Model compression and acceleration are attracting increasing attentions due to the demand for embedded devices and mobile applications. Research on efficient convolutional neural networks (CNNs) aims at removing feature redundancy by decomposing or optimizing the convolutional calculation. In this work, feature redundancy is assumed to exist among channels in CNN architectures, which provides some leeway to boost calculation efficiency. Aiming at channel compression, a novel convolutional construction named compact convolution is proposed to embrace the progress in spatial convolution, channel grouping and pooling operation. Specifically, the depth-wise separable convolution and the point-wise interchannel operation are utilized to efficiently extract features. Different from the existing channel compression method which usually introduces considerable learnable weights, the proposed compact convolution can reduce feature redundancy with no extra parameters. With the point-wise interchannel operation, compact convolutions implicitly squeeze the channel dimension of feature maps. To explore the rules on reducing channel redundancy in neural networks, the comparison is made among different point-wise interchannel operations. Moreover, compact convolutions are extended to tackle with multiple tasks, such as acoustic scene classification, sound event detection and image classification. The extensive experiments demonstrate that our compact convolution not only exhibits high effectiveness in several multimedia tasks, but also can be efficiently implemented by benefiting from parallel computation.

Highlights

Convolutional neural networks (CNNs) are attracting considerable attention in an increasing array of area, such as computer vision [1]–[3], computational acoustics [4]–[6] and natural language processing [7]–[9]
We found that feature redundancy exists among channels in CNN architecture, i.e., amounts of interchannel information is unimportant or even unnecessary in some cases
Rather than a better function approximator, this paper focuses on the efficient approaches for reducing the interchannel redundancy, and compressing the dimension of feature maps in a larger range

Summary

Introduction

Convolutional neural networks (CNNs) are attracting considerable attention in an increasing array of area, such as computer vision [1]–[3], computational acoustics [4]–[6] and natural language processing [7]–[9]. The general trend is to design deeper and more complicated network architecture to pursue better performance. Massive resources are required for desired performance, which hinders CNN-based classifiers from the real-time inference in mobile applications. Over the past few decades, various methods have been exploited for model compression and acceleration, including pruning [10]–[13], weight sharing [14], [15], low-rank matrix factorization [16]–[18] and knowledge distillation [19]–[21]. The associate editor coordinating the review of this manuscript and approving it for publication was Seok-Bum Ko. The associate editor coordinating the review of this manuscript and approving it for publication was Seok-Bum Ko Despite their desirable compression abilities, most of the compression methods typically suffer from two major drawbacks. Various manually chosen parameters (and even a lot of empirical engineering that only experts are competent to deal with) are required in these methods

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Channel Compression: Rethinking Information Redundancy Among Channels in CNN Architecture

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Joint Analysis of Sound Events and Acoustic Scenes Using Multitask Learning
Noriyuki Tonami ... Keisuke Imoto
IEICE Transactions on Information and Systems | VOL. E104.D
Noriyuki Tonami, et. al.Noriyuki Tonami ... Keisuke Imoto
16 Oct 2020
IEICE Transactions on Information and Systems | VOL. E104.D

Receptive Field Regularization Techniques for Audio Classification and Tagging With Deep Convolutional Neural Networks
Khaled Koutini ... Gerhard Widmer
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 29
Khaled Koutini, et. al.Khaled Koutini ... Gerhard Widmer
01 Jan 2020
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 29

Automatically Designing CNN Architectures for Acoustic Scene Classification
Noha W Hasan ... Hazem M Abbas
-
Noha W Hasan, et. al.Noha W Hasan ... Hazem M Abbas
15 Dec 2021
15 Dec 2021

Sound Context Classification based on Joint Learning Model and Multi-Spectrogram Features
Dat Ngo ... Lam Pham
International Journal of Computing | VOL. -
Dat Ngo, et. al.Dat Ngo ... Lam Pham
30 Jun 2022
International Journal of Computing | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Channel Compression: Rethinking Information Redundancy Among Channels in CNN Architecture

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access