Voice activity detection using subband noncircularity

Scott Wisdom,Greg Okopal,James Pitton,Les Atlas

doi:10.1109/icassp.2015.7178823

Abstract

Many voice activity detection (VAD) systems use the magnitude of complex-valued spectral representations. However, using only the magnitude often does not fully characterize the statistical behavior of the complex values. We present two novel methods for performing VAD on single- and dual-channel audio that do completely account for the second-order statistical behavior of complex data. Our methods exploit the second-order noncircularity (also known as impropriety) of complex subbands of speech and noise. Since speech tends to be more improper than noise, higher impropriety suggests speech activity. Our single-channel method is blind in the sense that it is unsupervised and, unlike many VAD systems, does not rely on non-speech periods for noise parameter estimation. Our methods achieve improved performance over other state-of-the-art magnitude-based VADs on the QUT-NOISE-TIMIT corpus, which indicates that impropriety is a compelling new feature for voice activity detection.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Voice activity detection using subband noncircularity

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Evaluating the Impact of Voice Activity Detection on Speech Emotion Recognition for Autistic Children
Manuel Milling ... Björn W Schuller
Frontiers in Computer Science | VOL. 4
Manuel Milling, et. al.Manuel Milling ... Björn W Schuller
09 Feb 2022
Frontiers in Computer Science | VOL. 4

Low Bits: Binary Neural Network for Vad and Wakeup
Dandan Song ... Leibo Liu
-
Dandan Song, et. al.Dandan Song ... Leibo Liu
01 Jul 2018
01 Jul 2018

A Fusion Model for Robust Voice Activity Detection
Guan-Bo Wang ... Wei-Qiang Zhang
-
Guan-Bo Wang, et. al.Guan-Bo Wang ... Wei-Qiang Zhang
01 Dec 2019
01 Dec 2019

Incorporating VAD into ASR System by Multi-task Learning
Meng Li ... Yan Xia
-
Meng Li, et. al.Meng Li ... Yan Xia
11 Dec 2022
11 Dec 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Voice activity detection using subband noncircularity

Abstract

Talk to us

Similar Papers