A Progressive Learning Approach to Adaptive Noise and Speech Estimation for Speech Enhancement and Noisy Speech Recognition

Zhaoxu Nian,Yan-Hui Tu,Jun Du,Chin-Hui Lee

doi:10.1109/icassp39728.2021.9413395

Abstract

In this paper, we propose a progressive learning-based adaptive noise and speech estimation (PL-ANSE) method for speech preprocessing in noisy speech recognition, leveraging upon a frame-level noise tracking capability of improved minima controlled recursive averaging (IMCRA) and an utterance-level deep progressive learning of nonlinear interactions between speech and noise. First, a bi-directional long short-term memory model is adopted at each network layer to learn progressive ratio masks (PRMs) as targets with progressively increasing signal-to-noise ratios. Then, the estimated PRMs at the utterance level are combined within a conventional speech enhancement algorithm at the frame level for speech enhancement. Finally, the enhanced speech based on multi-level information fusion is directly fed into a speech recognition system to improve the recognition performance. Experiments show that our proposed approach can achieve a relative word error rate (WER) reduction of 22.1% when compared to results attained with unprocessed noisy speech (from 23.84% to 18.57%) on the CHiME-4 single-channel real test data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Progressive Learning Approach to Adaptive Noise and Speech Estimation for Speech Enhancement and Noisy Speech Recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Speech Enhancement Based on Teacher–Student Deep Learning Using Improved Speech Presence Probability for Noise-Robust Speech Recognition
Yan-Hui Tu ... Jun Du
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 27
Yan-Hui Tu, et. al.Yan-Hui Tu ... Jun Du
01 Dec 2019
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 27

Two-stage Noise Spectra Estimation and Regression based In-car Speech Recognition using Single Distant Microphone
Weifeng Li ... F Itakura
-
Weifeng Li, et. al. Weifeng Li ... F Itakura
18 Mar 2005
18 Mar 2005

Improved Noise Spectra Estimation and Log-spectral Regression for In-car Speech Recognition
Weifeng Li ... F Itakura
-
Weifeng Li, et. al. Weifeng Li ... F Itakura
01 Jan 2004
01 Jan 2004

Towards Fast and Accurate Streaming End-To-End ASR
Bo Li ... Ruoming Pang
-
Bo Li, et. al.Bo Li ... Ruoming Pang
01 May 2020
01 May 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Progressive Learning Approach to Adaptive Noise and Speech Estimation for Speech Enhancement and Noisy Speech Recognition

Abstract

Talk to us

Similar Papers