Abstract

For the last decades the work on audiorecognition has been directed to speech and music,however, an increasing interest for the classificationand recognition of acoustic events is observed for thelast years. This poses the challenge to determine theidentity of sounds, their sources, and the importanceof analysing the context of the scenario where theyact.The aim of this paper is focused on evaluatingthe robustness to retain the characteristic information ofan acoustic event against the background noise usingaudio features in the task of identifying acoustic eventsfrom a mixture of sounds that are produced in a kitchenenvironment.A new database of kitchen sounds wasbuilt by us, since in the reviewed literature there isno similar benchmark that allows us to evaluate thisissue in conditions of 3 decibels for the signal to noiseratio. In our study, we compared two methods of audiofeatures, Multiband Spectral Entropy Signature (MSES)and Mel Frequency Cepstral Coefficients (MFCC). Toevaluate the performance of both MSES and MFCC,we used different classifiers such as Similarity Distance,k-Nearest Neighbors, Support Vector Machines andArtificial Neural Networks (ANN). The results showedthat MSES supported with an ANN outperforms anyother combination of classifiers with MSES or MFCC forgetting a better score.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.