Audio Tagging System using Deep Learning Model

E Sophiya,S Jothilakshmi

doi:10.35940/ijitee.j9281.0881019

Abstract

Deep learning has been getting more attention towards the researchers for transforming input data into an effective representation through various learning algorithms. Hence it requires a large and variety of datasets to ensure good performance and generalization. But manually labeling a dataset is really a time consuming and expensive process, limiting its size. Some of websites like YouTube and Freesound etc. provide large volume of audio data along with their metadata. General purpose audio tagging is one of the newly proposed tasks in DCASE that can give valuable insights into classification of various acoustic sound events. The proposed work analyzes a large scale imbalanced audio data for a audio tagging system. The baseline of the proposed audio tagging system is based on Convolutional Neural Network with Mel Frequency Cepstral Coefficients. Audio tagging system is developed with Google Colaboratory on free Telsa K80 GPU using keras, Tensorflow, and PyTorch. The experimental result shows the performance of proposed audio tagging system with an average mean precision of 0.92 .

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Audio Tagging System using Deep Learning Model

Abstract

Talk to us

Similar Papers

More From: International Journal of Innovative Technology and Exploring Engineering

Lead the way for us

Similar Papers

Greedy regression and differential convex-based deep learning for audio event classification
J Sangeetha ... M Priyanka
Journal of Intelligent & Fuzzy Systems | VOL. -
J Sangeetha, et. al.J Sangeetha ... M Priyanka
11 Oct 2023
Journal of Intelligent & Fuzzy Systems | VOL. -

A Region Based Attention Method for Weakly Supervised Sound Event Detection and Classification
Jie Yan ... Li-Rong Dai
-
Jie Yan, et. al.Jie Yan ... Li-Rong Dai
01 May 2019
01 May 2019

Segmentation and characterization of acoustic event spectrograms using singular value decomposition
Manjunath Mulimani ... Shashidhar G Koolagudi
Expert Systems with Applications | VOL. 120
Manjunath Mulimani, et. al.Manjunath Mulimani ... Shashidhar G Koolagudi
03 Dec 2018
Expert Systems with Applications | VOL. 120

Sound Event Detection of Weakly Labelled Data With CNN-Transformer and Automatic Threshold Optimization
Qiuqiang Kong ... Yong Xu
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 28
Qiuqiang Kong, et. al.Qiuqiang Kong ... Yong Xu
01 Jan 2020
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Audio Tagging System using Deep Learning Model

Abstract

Talk to us

Similar Papers

More From: International Journal of Innovative Technology and Exploring Engineering