Speech Activity Detection under Adverse Noisy Conditions at Low SNRs

Rahul Jaiswal

doi:10.1109/icces51350.2021.9488934

Abstract

Speech originating from the noisy environments degrades the speech quality and intelligibility, thus reducing the human perceived Quality of Experience (QoE). For example, surveillance using drone during natural catastrophe needs an efficient speech recognition device to recognise the speech of the frozen human in presence of drone noise to save their life. Therefore, it often requires to pre-process the noisy speech in order to reduce the noise artifacts and enhance the speech. This paper detects the speech activity using Voice Activity Detection (VAD). The VAD distinguishes speech activity (speech presence) and speech inactivity (silence/noise) by extracting the speech features and comparing to a threshold. The energy and spectral centroid features are deployed to design VADs. Noisy dataset consisting of urban noise, for example, drone, helicopter, airplane and station noise, is created at different signal-to-noise ratios (SNRs). F-score and Euclidean distance are used to measure the performance of VADs. Results demonstrate that the spectral centroid VAD performs outstanding with various noise degradations tested.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech Activity Detection under Adverse Noisy Conditions at Low SNRs

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Robust Voice Activity Detection Method Based on Speech Enhancement
Xulei Bao ... Ning Chen
-
Xulei Bao, et. al. Xulei Bao ... Ning Chen
01 Jan 2013
01 Jan 2013

Hybrid voice activity detection system based on LSTM and auditory speech features
Yunus Korkmaz ... Aytuğ Boyacı
Biomedical Signal Processing and Control | VOL. 80
Yunus Korkmaz, et. al.Yunus Korkmaz ... Aytuğ Boyacı
17 Nov 2022
Biomedical Signal Processing and Control | VOL. 80

Unsupervised and supervised VAD systems using combination of time and frequency domain features
Yunus Korkmaz ... Aytuğ Boyacı
Biomedical Signal Processing and Control | VOL. 61
Yunus Korkmaz, et. al.Yunus Korkmaz ... Aytuğ Boyacı
15 Jun 2020
Biomedical Signal Processing and Control | VOL. 61

A 600BPS MELP vocoder with voice activity detection
Qiuyun Hao ... Peng Zhang
-
Qiuyun Hao, et. al.Qiuyun Hao ... Peng Zhang
01 Jul 2016
01 Jul 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech Activity Detection under Adverse Noisy Conditions at Low SNRs

Abstract

Talk to us

Similar Papers