Robust voice activity detection based on concept of modulation transfer function in noisy reverberant environments

Shota Morita,Xugang Lu,Masashi Unoki,Masato Akagi

doi:10.1109/iscslp.2014.6936716

Abstract

Most of the current voice activity detection (VAD) algorithms deal with clean (noiseless) speech or speech with additive noise conditions. They cannot work in noisy reverberant environments or work poorly if they do, because speech is smeared due to the effects of noise and reverberation. This paper proposes a robust VAD algorithm for precisely detecting speech and non-speech periods in noisy reverberant environments. The proposed VAD algorithm consists of three blocks. The first block is an estimation of the signal to noise ratio (SNR) which is used to mitigate the additive noise effect on the speech power envelope. The second block is a speech power envelope dereverberation based on the modulation transfer function concept. The last block is a threshold processing on the dereverberated speech power envelope for speech/non-speech decision. Experiments on VAD in both artificial and realistic noisy reverberant environments revealed that the proposed VAD algorithm significantly outperforms the conventional VAD algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust voice activity detection based on concept of modulation transfer function in noisy reverberant environments

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Robust Voice Activity Detection Based on Concept of Modulation Transfer Function in Noisy Reverberant Environments
Shota Morita ... Xugang Lu
Journal of Signal Processing Systems | VOL. 82
Shota Morita, et. al.Shota Morita ... Xugang Lu
11 Jun 2015
Journal of Signal Processing Systems | VOL. 82

A wavelet-based voice activity detection algorithm in noisy environments
Shi-Huang Chen ... Jhing-Fa Wang
-
Shi-Huang Chen, et. al. Shi-Huang Chen ... Jhing-Fa Wang
10 Dec 2002
10 Dec 2002

A Computationally Efficient Mel-Filter Bank VAD Algorithm for Distributed Speech Recognition Systems
Damjan Vlaj ... Bogomir Horvat
EURASIP Journal on Advances in Signal Processing | VOL. 2005
Damjan Vlaj, et. al.Damjan Vlaj ... Bogomir Horvat
30 Mar 2005
EURASIP Journal on Advances in Signal Processing | VOL. 2005

Voice activity detection algorithm using nonlinear spectral weights, hangover and hangbefore criteria
Damjan Vlaj ... Marko Kos
Computers and Electrical Engineering | VOL. 38
Damjan Vlaj, et. al.Damjan Vlaj ... Marko Kos
02 Oct 2012
Computers and Electrical Engineering | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust voice activity detection based on concept of modulation transfer function in noisy reverberant environments

Abstract

Talk to us

Similar Papers