Robust Audio Content Classification Using Hybrid-Based SMD and Entropy-Based VAD.

Kun-Ching Wang

doi:10.3390/e22020183

Abstract

A robust approach for the application of audio content classification (ACC) is proposed in this paper, especially in variable noise-level conditions. We know that speech, music, and background noise (also called silence) are usually mixed in the noisy audio signal. Based on the findings, we propose a hierarchical ACC approach consisting of three parts: voice activity detection (VAD), speech/music discrimination (SMD), and post-processing. First, entropy-based VAD is successfully used to segment input signal into noisy audio and noise even if variable-noise level is happening. The determinations of one-dimensional (1D)-subband energy information (1D-SEI) and 2D-textural image information (2D-TII) are then formed as a hybrid feature set. The hybrid-based SMD is achieved because the hybrid feature set is input into the classification of the support vector machine (SVM). Finally, a rule-based post-processing of segments is utilized to smoothly determine the output of the ACC system. The noisy audio is successfully classified into noise, speech, and music. Experimental results show that the hierarchical ACC system using hybrid feature-based SMD and entropy-based VAD is successfully evaluated against three available datasets and is comparable with existing methods even in a variable noise-level environment. In addition, our test results with the VAD scheme and hybrid features also shows that the proposed architecture increases the performance of audio content discrimination.

Highlights

With the rapid growth of information technology, multimedia management is a very crucial task
We presented a new algorithm of audio content classification (ACC) for applications under a variable noise-level environment
It was found that using hybrid-based features can discriminate the noisy audio signal into speech and music

Summary

Introduction

With the rapid growth of information technology, multimedia management is a very crucial task. In the field of AV indexing and retrieval, the speech/music discrimination (SMD) is a very crucial task for the audio content classification (ACC) system or general audio detection and classification (GADC) [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18]. A few studies focused on speech and song/music discrimination [35,36,37] Some features such as loudness and sharpness have been incorporated in the human hearing process to describe sounds [38,39].

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Entropy	Publication Date: Feb 6, 2020
Citations: 6	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Robust Audio Content Classification Using Hybrid-Based SMD and Entropy-Based VAD.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy

Lead the way for us

Similar Papers

Robust Voice Activity Detection Using Long-Term Signal Variability
Prasanta Kumar Ghosh ... Andreas Tsiartas
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 19
Prasanta Kumar Ghosh, et. al.Prasanta Kumar Ghosh ... Andreas Tsiartas
01 Mar 2011
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 19

Self-supervised speech denoising using only noisy audio signals
Jiasong Wu ... Huazhong Shu
Speech Communication | VOL. 149
Jiasong Wu, et. al.Jiasong Wu ... Huazhong Shu
23 Mar 2023
Speech Communication | VOL. 149

Speech/music discrimination using hybrid-based feature extraction for audio data indexing
Kun-Ching Wang ... Ying-Ru Yang
-
Kun-Ching Wang, et. al.Kun-Ching Wang ... Ying-Ru Yang
01 Jul 2017
01 Jul 2017

Efficient voice activity detection algorithm using long-term spectral flatness measure
Yanna Ma ... Akinori Nishihara
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2013
Yanna Ma, et. al.Yanna Ma ... Akinori Nishihara
16 Jul 2013
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust Audio Content Classification Using Hybrid-Based SMD and Entropy-Based VAD.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy