Phase and reverberation aware DNN for distant-talking speech enhancement

Zeyan Oo,Seiichi Nakagawa,Khomdet Phapatanaburi,Jianwu Dang,Masahiro Iwahashi,Longbiao Wang

doi:10.1007/s11042-018-5686-1

Abstract

Enhancing reverberant speech with Deep Neural Networks (DNNs) is an interesting yet challenging topic. The performance of speech enhancement degrades significantly when test and training conditions are mismatched. In this paper we propose a Static Reverberation Aware Training (SRAT)-based dereverberation through which the reverberation estimate is obtained by averaging over broken down frame. This method significantly reduces the input dimensions of the and enables the DNN to learn the relations between clean and reverberant speech more efficiently. Most speech enhancement approaches ignore phase information due to its complicated structure. As phase correlates closely to speech signal we exploited this relationship to achieve better performance using DNN. Phase information was augmented with magnitude information and used as the input for DNN. We denote this method as phase aware DNN. Finally, both phase information and reverberation were added to reverberant speech to achieve better speech enhancement performance in a distant-talking condition. Features of the reverberant speech, phase and reverberation were used during the training and testing stages. This is because the DNN could use both reverberation and phase information to better generalize the speech signal. The proposed method was evaluated using the REVERB CHALLENGE 2014 database. Results are significantly improved results with respect to both reconstructed speech quality (PESQ: Perceptual Evaluation of Speech Quality) and influence of reverberation (SRMR: Speech to Reverberation Modulation Energy Ratio). As compared to the conventional DNN-based approach, this proposed one improved SRMR from 4.84 to 5.92 and PESQ from 2.34 to 2.70, indicating that our proposed method could efficiently enhance speech severely corrupted by reverberation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Phase and reverberation aware DNN for distant-talking speech enhancement

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications

Lead the way for us

Journal: Multimedia Tools and Applications	Publication Date: Feb 20, 2018
Citations: 10

Similar Papers

Performance analysis of low complexity fully connected neural networks for monaural speech enhancement
Himavanth Reddy ... Jan Østergaard
Applied Acoustics | VOL. 190
Himavanth Reddy, et. al.Himavanth Reddy ... Jan Østergaard
24 Jan 2022
Applied Acoustics | VOL. 190

Experimental study on speech enhancement using DNN with perceptual weighting
Wenhua Shi ... Xiongwei Zhang
-
Wenhua Shi, et. al.Wenhua Shi ... Xiongwei Zhang
02 Nov 2018
02 Nov 2018

Speech dereverberation for enhancement and recognition using dynamic features constrained deep neural networks and feature adaptation
Xiong Xiao ... Haizhou Li
EURASIP Journal on Advances in Signal Processing | VOL. 2016
Xiong Xiao, et. al.Xiong Xiao ... Haizhou Li
13 Jan 2016
EURASIP Journal on Advances in Signal Processing | VOL. 2016

Speech Enhancement based on Deep Convolutional Neural Network
Ramesh Nuthakki ... Yukta T N
-
Ramesh Nuthakki, et. al.Ramesh Nuthakki ... Yukta T N
11 Nov 2021
11 Nov 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Phase and reverberation aware DNN for distant-talking speech enhancement

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications