Abstract

At present, the environment sound recognition system mainly identifies environment sounds with deep neural networks and a wide variety of auditory features. Therefore, it is necessary to analyze which auditory features are more suitable for deep neural networks based ESCR systems. In this paper, we chose three sound features which based on two widely used filters:the Mel and Gammatone filter banks. Subsequently, the hybrid feature MGCC is presented. Finally, a deep convolutional neural network is proposed to verify which features are more suitable for environment sound classification and recognition tasks. The experimental results show that the signal processing features are better than the spectrogram features in the deep neural network based environmental sound recognition system. Among all the acoustic features, the MGCC feature achieves the best performance than other features. Finally, the MGCC-CNN model proposed in this paper is compared with the state-of-the-art environmental sound classification models on the UrbanSound 8K dataset. The results show that the proposed model has the best classification accuracy.

Highlights

  • The results show that the proposed model has the best classification ac⁃ curacy

  • G( f,t) = atn-1e -2πbt cos(2πft + φ)

Read more

Summary

Introduction

西北工业大学学报 Journal of Northwestern Polytechnical University https: / / doi.org / 10.1051 / jnwpu / 20203810162 Æ1.航天飞行动力学技术重点实验室, 陕西 西安 710072; 2.西北工业大学 航天学院, 陕西 西安 710072; çç3.Signals, Images, and Intelligent Systems Laboratory( LISSI / EA 3956) , University Paris⁃Est Creteil, è Senart⁃FB Institute of Technology, 36⁃37 rue Charpak, 77127 Lieusaint, France

Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.