Recurrent Neural Networks and Acoustic Features for Frame-Level Signal-to-Noise Ratio Estimation

Hao Li,Guanglai Gao,Deliang Wang,Xueliang Zhang

doi:10.1109/taslp.2021.3107617

Abstract

It is important to know the presence and the relative level of background noise for many speech processing tasks. Frame-level signal-to-noise ratio (SNR) provides a measure of instantaneous noise level of a noisy signal, and its estimation has been researched for decades. This problem can be approached from a supervised learning perspective by predicting SNR from features of noisy speech. In this study, we introduce a deep learning algorithm for frame-level SNR estimation. The proposed algorithm employs recurrent neural networks (RNNs) with long short-term memory (LSTM) to leverage contextual information. We also systematically examine a range of acoustic features and investigate feature combinations using Group Lasso and sequential floating forward selection (SFFS). The proposed algorithm naturally leads to an utterance-level SNR estimator. Systematical evaluations show that the proposed algorithm provides an accurate estimate of frame-level SNR, as well as utterance-level SNR, under different noise conditions, outperforming other estimators.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Recurrent Neural Networks and Acoustic Features for Frame-Level Signal-to-Noise Ratio Estimation

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE/ACM Transactions on Audio, Speech, and Language Processing	Publication Date: Jan 1, 2021
Citations: 6

Similar Papers

Long short-term memory (LSTM) recurrent neural network for muscle activity detection
Marco Ghislieri ... Marco Knaflitz
Journal of NeuroEngineering and Rehabilitation | VOL. 18
Marco Ghislieri, et. al.Marco Ghislieri ... Marco Knaflitz
21 Oct 2021
Journal of NeuroEngineering and Rehabilitation | VOL. 18

A Real-Time SNR Estimator for D-MPSK over Frequency-Flat Slow Fading AWGN Channels
Yair Linn
-
Yair LinnYair Linn
01 Mar 2006
01 Mar 2006

Convergence analysis of a joint signal‐to‐noise ratio and channel estimator for frequency selective channels in orthogonal frequency division multiplexing context
Vincent Savaux ... Yves Louët
IET Signal Processing | VOL. 8
Vincent Savaux, et. al.Vincent Savaux ... Yves Louët
01 Aug 2014
IET Signal Processing | VOL. 8

Optimized Deep Learning Model for Effective Spectrum Sensing in Dynamic SNR Scenario
G Arunachalam ... P Sureshkumar
Computer Systems Science and Engineering | VOL. 45
G Arunachalam, et. al.G Arunachalam ... P Sureshkumar
01 Jan 2023
Computer Systems Science and Engineering | VOL. 45

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Recurrent Neural Networks and Acoustic Features for Frame-Level Signal-to-Noise Ratio Estimation

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing