Leveraging structural information in music-speech dectection

Jinyu Han Jinyu Han,Bob Coover

doi:10.1109/icmew.2013.6618387

Abstract

Detecting music or speech signals in an audio mixture is an important but challenging problem. Even more challenging is detecting when both are present in a signal at the same time. This problem requires not only discriminating speech or music from each other but also detecting its presence in a mixture with interfering signals. In this paper, we address the problem of detecting speech and music signals in the presence of each other. We focus on leveraging features that capture the structural properties of audio to improve the performance of concurrent music-speech detection. Continuous Frequency Activation (CFA) is used to account for the sustained pitch/harmonic activities, and a new feature called Transient Activation (TAC) is proposed for the transient/percussive activities in an audio signal. The effectiveness of these features along with other acoustic features is evaluated in different statistical classification schemes. Feature selection is conducted to select the best feature set to maximize the detection performance. Experimental results on real world broadcast recordings have shown significant improvement by using the above techniques to incorporate the structural information of audio.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Leveraging structural information in music-speech dectection

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

CS reconstruction of the speech and musical signals
Trifun Savic ... Radoje Albijanic
-
Trifun Savic, et. al.Trifun Savic ... Radoje Albijanic
01 Jun 2015
01 Jun 2015

Comparison of optimum filter length in linear prediction between speech and musical signals
Ondrej Raso ... Miroslav Balik
-
Ondrej Raso, et. al.Ondrej Raso ... Miroslav Balik
01 Aug 2011
01 Aug 2011

Blind source separation of speech and music signals using harmonic frequency dependent independent vector analysis
C.H Choi ... W Chang
Electronics Letters | VOL. 48
C.H Choi, et. al.C.H Choi ... W Chang
01 Jan 2012
Electronics Letters | VOL. 48

Classification and Separation of Audio and Music Signals
Abdullah I Al-Shoshan
-
Abdullah I Al-ShoshanAbdullah I Al-Shoshan
02 Jun 2021
02 Jun 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Leveraging structural information in music-speech dectection

Abstract

Talk to us

Similar Papers