Sound Classification using Sound Spectrum Features and Convolutional Neural Networks

Ki In Tan,Bu Sung Lee,Seanglidet Yean

doi:10.1109/ihsh57076.2022.10092143

Abstract

This paper proposes an alternative approach to sound classification using sound spectrum features, differing from the use of the Mel-Frequency Cepstral Coefficients (MFCC). Aligning with the crowd sourcing data collection application NoiseCapture, the data are kept in form of the post-processed sound spectrum instead of the raw audio files to maintain privacy of volunteers. Under such circumstances, MFCC, which requires audio processing, cannot be directly obtained from nor maximize the features of sound spectrum data stored in the application. As sound spectrum does not undergo further feature transformation, it retains audio features from the audio file and should therefore be classifiable when passed into a trained sound spectrum model. Hence, in this study, we aim to evaluate whether sound spectrum could be used as a replacement of MFCC, especially when audio file is inaccessible. The UrbanSound8K dataset and a mix of deep learning and machine learning models were used for the comparison. Experiment results show sound spectrum achieving comparable results in Convolutional Neural Network (CNN), with better predictions than its MFCC counterpart. Further comparisons draw insights that illustrate the need for more finetuning for sound spectrum data when using non-CNN models for sound classification due to the shape of the input features.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Sound Classification using Sound Spectrum Features and Convolutional Neural Networks

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Explainable artificial intelligence (XAI) for predicting the need for intubation in methanol-poisoned patients: a study comparing deep and machine learning models
Khadijeh Moulaei ... Mitra Rahimi
Scientific Reports | VOL. 14
Khadijeh Moulaei, et. al.Khadijeh Moulaei ... Mitra Rahimi
08 Jul 2024
Scientific Reports | VOL. 14

A novel deep-learning technique for forecasting oil price volatility using historical prices of five precious metals in context of green financing – A comparison of deep learning, machine learning, and statistical models
Muhammad Mohsin ... Fouad Jamaani
Resources Policy | VOL. 86
Muhammad Mohsin, et. al.Muhammad Mohsin ... Fouad Jamaani
01 Oct 2023
Resources Policy | VOL. 86

Forest Smoke-Fire Net (FSF Net): A Wildfire Smoke Detection Model That Combines MODIS Remote Sensing Images with Regional Dynamic Brightness Temperature Thresholds
Yunhong Ding ... Mingyang Wang
Forests | VOL. 15
Yunhong Ding, et. al.Yunhong Ding ... Mingyang Wang
10 May 2024
Forests | VOL. 15

A Novel RBFNN-CNN Model for Speaker Identification in Stressful Talking Environments
Ali Bou Nassif ... Noha Alnazzawi
Applied Sciences | VOL. 12
Ali Bou Nassif, et. al.Ali Bou Nassif ... Noha Alnazzawi
11 May 2022
Applied Sciences | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sound Classification using Sound Spectrum Features and Convolutional Neural Networks

Abstract

Talk to us

Similar Papers