Robust speech detection for noisy environments

Oscar Varela,Luis A Hernandez,Ruben San-Segundo

doi:10.1109/maes.2011.6070277

Oscar Varela, Luis A Hernandez + Show 1 more

Open Access

https://doi.org/10.1109/maes.2011.6070277

Copy DOI

Journal: IEEE Aerospace and Electronic Systems Magazine	Publication Date: Nov 1, 2011
Citations: 12	License type: cc-by-nc-nd

Abstract

This presents a robust voice activity detector (VAD) based on Hidden Markov Models (HMM) in stationary and non-stationary noise environments: inside motor vehicles (like cars or planes) or inside buildings close to high traffic places (like in a control tower for air traffic control (ATC)). In these environments, there is a high stationary noise level caused by vehicle motors and additionally, there could be people speaking at certain distance from the main speaker producing non-stationary noise. The VAD presented herein is characterized by a new front-end and a noise level adaptation process that increases significantly the VAD robustness for different signal to noise ratios (SNRs). The feature vector used by the VAD includes the most relevant Mel Frequency Cepstral Coefficients (MFCC), normalized log energy, and delta log energy. The proposed VAD has been evaluated and compared to other well-known VADs using three databases containing different noise conditions: speech in clean environments (SNRs >; 20 dB), speech recorded in stationary noise environments (inside or close to motor vehicles), and finally, speech in non-stationary environments (including noise from bars, television, and far-field speakers). In the three cases, the detection error obtained with the proposed VAD is the lowest for all SNRs compared to Acero's VAD (reference of this work [4]) and other well-known VADs like AMR, AURORA, or G729 annex b.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust speech detection for noisy environments

Abstract

Talk to us

Similar Papers

More From: IEEE Aerospace and Electronic Systems Magazine

Lead the way for us

Similar Papers

Sparse HMM-based speech enhancement method for stationary and non-stationary noise environments
Feng Deng ... Chang-Chun Bao
-
Feng Deng, et. al.Feng Deng ... Chang-Chun Bao
01 Apr 2015
01 Apr 2015

Voice activity detection over multiresolution subspaces
N Erdol ... R Schultz
-
N Erdol, et. al.N Erdol ... R Schultz
16 Mar 2000
16 Mar 2000

DNN-based voice activity detection with local feature shift technique
Tae Gyoon Kang ... Woo Hyun Kang
-
Tae Gyoon Kang, et. al.Tae Gyoon Kang ... Woo Hyun Kang
01 Dec 2016
01 Dec 2016

Laplacian Speech Model and Soft Decision Based MMSE Estimator for Noise Power Spectral Density in Speech Enhancement
Shifeng Ou ... Ying Gao
Chinese Journal of Electronics | VOL. 27
Shifeng Ou, et. al.Shifeng Ou ... Ying Gao
01 Nov 2018
Chinese Journal of Electronics | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust speech detection for noisy environments

Abstract

Talk to us

Similar Papers

More From: IEEE Aerospace and Electronic Systems Magazine