E-PANNs: Sound Recognition Using Efficient Pre-trained Audio Neural Networks

Arshdeep Singh,Haohe Liu,Mark D Plumbley

doi:10.3397/in_2023_1083

Abstract

Sounds carry an abundance of information about activities and events in our everyday environment, such as traffic noise, road works, music, or people talking. Recent machine learning methods, such as convolutional neural networks (CNNs), have been shown to be able to automatically recognize sound activities, a task known as audio tagging. One such method, pre-trained audio neural networks (PANNs), provides a neural network which has been pre-trained on over 500 sound classes from the publicly available AudioSet dataset, and can be used as a baseline or starting point for other tasks. However, the existing PANNs model has a high computational complexity and large storage requirement. This could limit the potential for deploying PANNs on resource-constrained devices, such as on-the-edge sound sensors, and could lead to high energy consumption if many such devices were deployed. In this paper, we reduce the computational complexity and memory requirement of the PANNs model by taking a pruning approach to eliminate redundant parameters from the PANNs model. The resulting Efficient PANNs (E-PANNs) model, which requires 36% less computations and 70% less memory, also slightly improves the sound recognition (audio tagging) performance. The code for the E-PANNs model has been released under an open source license.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

E-PANNs: Sound Recognition Using Efficient Pre-trained Audio Neural Networks

Abstract

Talk to us

Similar Papers

More From: INTER-NOISE and NOISE-CON Congress and Conference Proceedings

Lead the way for us

Journal: INTER-NOISE and NOISE-CON Congress and Conference Proceedings	Publication Date: Nov 30, 2023
Citations: 1

Similar Papers

Audio Tagging Using CNN Based Audio Neural Networks for Massive Data Processing
J Samuel Manoharan
Journal of Artificial Intelligence and Capsule Networks | VOL. 3
J Samuel ManoharanJ Samuel Manoharan
24 Dec 2021
Journal of Artificial Intelligence and Capsule Networks | VOL. 3

Task-Aware Mean Teacher Method for Large Scale Weakly Labeled Semi-Supervised Sound Event Detection
Jie Yan ... Ian Mcloughlin
-
Jie Yan, et. al.Jie Yan ... Ian Mcloughlin
01 May 2020
01 May 2020

A Comparison of Attention Mechanisms of Convolutional Neural Network in Weakly Labeled Audio Tagging
Yuanbo Hou ... Shengchen Li
-
Yuanbo Hou, et. al.Yuanbo Hou ... Shengchen Li
01 Jan 2019
01 Jan 2019

Convolutional gated recurrent neural network incorporating spatial features for audio tagging
Yong Xu ... Wenwu Wang
-
Yong Xu, et. al.Yong Xu ... Wenwu Wang
01 May 2017
01 May 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

E-PANNs: Sound Recognition Using Efficient Pre-trained Audio Neural Networks

Abstract

Talk to us

Similar Papers

More From: INTER-NOISE and NOISE-CON Congress and Conference Proceedings