Abstract

An efficient sound classification algorithm can benefit a multitude of applications involving home, wildlife, and residential surveillance, traffic regulation, medical monitoring etc. An important application is to identify and notify the ambulance siren sound amidst a noisy environment. This work develops an efficient and less complex architecture for siren detection and urban sound classification using different sound to image transformation methods viz: Mel-spectrogram, Scalogram and Fourier decomposition method(FDM). Effects of augmentation and pre-processing techniques on the efficacy of the developed architecture against pre-trained models is analyzed. The performance of the proposed algorithm is tested on Urbansound8K dataset for urban sound classification, and a multi -source dataset for ambulance siren detector. The CNN proposed here gives an accuracy of 89.66% for the former case and 99.35% for the latter.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.