Voice activity detection using convolutive non-negative sparse coding

Peng Teng,Yunde Jia

doi:10.1109/icassp.2013.6639095

Abstract

This paper presents a voice activity detection (VAD) approach using convolutive non-negative sparse coding (CNSC) to improve the detection performance in low signal-to-noise (SNR) conditions. Our idea is to use noise-robust feature for speech signal detection while noise is reduced away. We first use magnitude spectrum as the non-negative and additive low-level representation of audio signals, and learn a speech dictionary from clean speech as well as a noise dictionary from noise samples. Then, the two dictionaries are concatenated to form a global dictionary, and an audio signal is decomposed into coefficient vectors using CNSC on the global dictionary. Only coefficients corresponding to the bases from the speech dictionary are taken as the features for the signal. At last, the activity labels is given by decoding a conditional random field (CRF) which is constructed to model the context of an audio signal for VAD. Experiments demonstrate that our VAD approach has an excellent performance in low SNR conditions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Voice activity detection using convolutive non-negative sparse coding

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Voice Activity Detection Via Noise Reducing Using Non-Negative Sparse Coding
Peng Teng ... Yunde Jia
IEEE Signal Processing Letters | VOL. 20
Peng Teng, et. al.Peng Teng ... Yunde Jia
01 May 2013
IEEE Signal Processing Letters | VOL. 20

A new method for voice activity detection based on sparse representation
Parvin Ahmadi ... Mohsen Joneidi
-
Parvin Ahmadi, et. al.Parvin Ahmadi ... Mohsen Joneidi
01 Oct 2014
01 Oct 2014

An Ensemble SVM-based Approach for Voice Activity Detection
Jayanta Dey ... Mohammad Ariful Haque
-
Jayanta Dey, et. al.Jayanta Dey ... Mohammad Ariful Haque
01 Dec 2018
01 Dec 2018

Using spectral fluctuation of speech in multi-feature HMM-based voice activity detection
Miquel Espi ... Takuya Nishimoto
-
Miquel Espi, et. al.Miquel Espi ... Takuya Nishimoto
27 Aug 2011
27 Aug 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Voice activity detection using convolutive non-negative sparse coding

Abstract

Talk to us

Similar Papers