Visual voice activity detection with optical flow

A.J Aubrey,J.A Chambers,Y.A Hicks

doi:10.1049/iet-ipr.2009.0042

Abstract

Current voice activity detection methods generally utilise only acoustic information. Therefore they are susceptible to false classification because of the presence of other acoustic sources such as another speaker or non-stationary noise. To address this issue, the authors propose a new method of voice activity detection using solely visual information in the form of a speaker's mouth region. Such video information is not affected by the acoustic environment. Simulations show that a high percentage correct silence detection (CSD) can be obtained with a low percentage false silence detection (FSD). Comparisons with two other visual voice activity detectors show the proposed method to be consistently more accurate, and on average yields a 4% improvement in CSD. The usefulness of the method is confirmed by applying it to a previously published audio–visual convolutive blind source separation algorithm, to increase the intelligibility of a speaker.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Visual voice activity detection with optical flow

Abstract

Talk to us

Similar Papers

More From: IET Image Processing

Lead the way for us

Journal: IET Image Processing	Publication Date: Dec 1, 2010
Citations: 32

Similar Papers

Comparison of acoustic and visual voice activity detection for noisy speech recognition
Piotr Bratoszewski ... Andrzej Czyzewski
-
Piotr Bratoszewski, et. al.Piotr Bratoszewski ... Andrzej Czyzewski
01 Sep 2016
01 Sep 2016

A Robust Voice Activity Detection Method Based on Speech Enhancement
Xulei Bao ... Ning Chen
-
Xulei Bao, et. al. Xulei Bao ... Ning Chen
01 Jan 2013
01 Jan 2013

Two novel visual voice activity detectors based on appearance models and retinal filtering
...
-
, et. al. ...
15 Nov 2007
15 Nov 2007

See the Silence: Improving Visual-Only Voice Activity Detection by Optical Flow and RGB Fusion
Danu Caus ... Timo Gerkmann
-
Danu Caus, et. al.Danu Caus ... Timo Gerkmann
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Visual voice activity detection with optical flow

Abstract

Talk to us

Similar Papers

More From: IET Image Processing