Audio Tagging Using CNN Based Audio Neural Networks for Massive Data Processing

J Samuel Manoharan

doi:10.36548/jaicn.2021.4.008

Abstract

Sound event detection, speech emotion classification, music classification, acoustic scene classification, audio tagging and several other audio pattern recognition applications are largely dependent on the growing machine learning technology. The audio pattern recognition issues are also addressed by neural networks in recent days. The existing systems operate within limited durations on specific datasets. Pretrained systems with large datasets in natural language processing and computer vision applications over the recent years perform well in several tasks. However, audio pattern recognition research with large-scale datasets is limited in the current scenario. In this paper, a large-scale audio dataset is used for training a pre-trained audio neural network. Several audio related tasks are performed by transferring this audio neural network. Several convolution neural networks are used for modeling the proposed audio neural network. The computational complexity and performance of this system are analyzed. The waveform and leg-mel spectrogram are used as input features in this architecture. During audio tagging, the proposed system outperforms the existing systems with a mean average of 0.45. The performance of the proposed model is demonstrated by applying the audio neural network to five specific audio pattern recognition tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Audio Tagging Using CNN Based Audio Neural Networks for Massive Data Processing

Abstract

Talk to us

Similar Papers

More From: Journal of Artificial Intelligence and Capsule Networks

Lead the way for us

Journal: Journal of Artificial Intelligence and Capsule Networks	Publication Date: Dec 24, 2021
Citations: 2

Similar Papers

PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition
Qiuqiang Kong ... Turab Iqbal
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 28
Qiuqiang Kong, et. al.Qiuqiang Kong ... Turab Iqbal
01 Jan 2020
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 28

A Region Based Attention Method for Weakly Supervised Sound Event Detection and Classification
Jie Yan ... Li-Rong Dai
-
Jie Yan, et. al.Jie Yan ... Li-Rong Dai
01 May 2019
01 May 2019

Sound Event Detection of Weakly Labelled Data With CNN-Transformer and Automatic Threshold Optimization
Qiuqiang Kong ... Yong Xu
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 28
Qiuqiang Kong, et. al.Qiuqiang Kong ... Yong Xu
01 Jan 2020
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 28

Task-Aware Mean Teacher Method for Large Scale Weakly Labeled Semi-Supervised Sound Event Detection
Jie Yan ... Ian Mcloughlin
-
Jie Yan, et. al.Jie Yan ... Ian Mcloughlin
01 May 2020
01 May 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Audio Tagging Using CNN Based Audio Neural Networks for Massive Data Processing

Abstract

Talk to us

Similar Papers

More From: Journal of Artificial Intelligence and Capsule Networks