Comparison of acoustic and visual voice activity detection for noisy speech recognition

Piotr Bratoszewski,Andrzej Czyzewski,Grzegorz Szwoch

doi:10.1109/spa.2016.7763629

Abstract

The problem of accurate differentiating between the speaker utterance and the noise parts in a speech signal is considered. The influence of utilizing a voice activity detection in speech signals on the accuracy of the automatic speech recognition (ASR) system is presented. The examined methods of voice activity detection are based on acoustic and visual modalities. The problem of detecting the voice activity in clean and noisy speech is considered. The speech signal was recorded in a real-life scenario in an office-like environment with the babble noise generated by the loudspeakers at different levels. The proposed method of visual voice activity detection is aimed at enhancing the accuracy of ASR when the ratio of signal to noise is low. The numerals in English language are used as speech material and Word Error Rate (WER) is employed for the evaluation purposes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparison of acoustic and visual voice activity detection for noisy speech recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Noise robust speech recognition using parallel model compensation and voice activity detection methods
Serhat Hizlisoy ... Zekeriya Tufekci
-
Serhat Hizlisoy, et. al.Serhat Hizlisoy ... Zekeriya Tufekci
01 Dec 2016
01 Dec 2016

A Robust Voice Activity Detection Method Based on Speech Enhancement
Xulei Bao ... Ning Chen
-
Xulei Bao, et. al. Xulei Bao ... Ning Chen
01 Jan 2013
01 Jan 2013

Visual voice activity detection with optical flow
A.J Aubrey ... J.A Chambers
IET Image Processing | VOL. 4
A.J Aubrey, et. al.A.J Aubrey ... J.A Chambers
01 Jan 2009
IET Image Processing | VOL. 4

A new voice activity detection method using maximized Sub-band SNR
Weiwu Jiang ... Wai Kit Lo
-
Weiwu Jiang, et. al.Weiwu Jiang ... Wai Kit Lo
01 Nov 2010
01 Nov 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison of acoustic and visual voice activity detection for noisy speech recognition

Abstract

Talk to us

Similar Papers