Unsupervised Speech Activity Detection Using Voicing Measures and Perceptual Spectral Flux

Seyed Omid Sadjadi,John H L Hansen

doi:10.1109/lsp.2013.2237903

Unsupervised Speech Activity Detection Using Voicing Measures and Perceptual Spectral Flux

Seyed Omid Sadjadi, John H L Hansen

Open Access

https://doi.org/10.1109/lsp.2013.2237903

Copy DOI

Journal: IEEE Signal Processing Letters	Publication Date: Mar 1, 2013
Citations: 179	License type: implied-oa

Affiliation: The University of Texas at Dallas

#Speech Activity Detection #Distortion Levels + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Effective speech activity detection (SAD) is a necessary first step for robust speech applications. In this letter, we propose a robust and unsupervised SAD solution that leverages four different speech voicing measures combined with a perceptual spectral flux feature, for audio-based surveillance and monitoring applications. Effectiveness of the proposed technique is evaluated and compared against several commonly adopted unsupervised SAD methods under simulated and actual harsh acoustic conditions with varying distortion levels. Experimental results indicate that the proposed SAD scheme is highly effective and provides superior and consistent performance across various noise types and distortion levels.

Full Text