Learnable filter-banks for CNN-based audio applications

Helena Peic Tukuljac,Benjamin Ricaud,Nicolas Aspert,Laurent Colbois

doi:10.7557/18.6279

Abstract

We investigate the design of a convolutional layer where kernels are parameterized functions. This layer aims at being the input layer of convolutional neural networks for audio applications or applications involving time-series. The kernels are defined as one-dimensional functions having a band-pass filter shape, with a limited number of trainable parameters. Building on the literature on this topic, we confirm that networks having such an input layer can achieve state-of-the-art accuracy on several audio classification tasks. We explore the effect of different parameters on the network accuracy and learning ability. This approach reduces the number of weights to be trained and enables larger kernel sizes, an advantage for audio applications. Furthermore, the learned filters bring additional interpretability and a better understanding of the audio properties exploited by the network.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proceedings of the Northern Lights Deep Learning Workshop	Publication Date: Mar 28, 2022
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Learnable filter-banks for CNN-based audio applications

Abstract

Talk to us

Similar Papers

More From: Proceedings of the Northern Lights Deep Learning Workshop

Lead the way for us

Similar Papers

Optimal design of both rectified layer and pooling layer of convolutional neural network for noninvasive blood glucose estimation system
Xin Wu ... Yuwei Liu
-
Xin Wu, et. al.Xin Wu ... Yuwei Liu
01 Jul 2016
01 Jul 2016

Joint Optimization of Sensing, Decision-Making and Motion-Controlling for Autonomous Vehicles: A Deep Reinforcement Learning Approach
Longquan Chen ... Weike Pan
IEEE Transactions on Vehicular Technology | VOL. 71
Longquan Chen, et. al.Longquan Chen ... Weike Pan
01 May 2022
IEEE Transactions on Vehicular Technology | VOL. 71

Automatic Detection of Age-Related Macular Degeneration from Optical Coherence Tomography Images
Camilla Wan Qi Zheng ... Ruchir Srivastava
-
Camilla Wan Qi Zheng, et. al.Camilla Wan Qi Zheng ... Ruchir Srivastava
01 Jan 2023
01 Jan 2023

ASP Vision: Optically Computing the First Layer of Convolutional Neural Networks Using Angle Sensitive Pixels
Huaijin G Chen ... Suren Jayasuriya
-
Huaijin G Chen, et. al.Huaijin G Chen ... Suren Jayasuriya
01 Jun 2016
01 Jun 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learnable filter-banks for CNN-based audio applications

Abstract

Talk to us

Similar Papers

More From: Proceedings of the Northern Lights Deep Learning Workshop