An Improved Fully Convolutional Network Based on Post-Processing with Global Variance Equalization and Noise-Aware Training for Speech Enhancement

Wenlong Li,Kaoru Hirota,Yaping Dai,Zhiyang Jia

doi:10.20965/jaciii.2021.p0130

Wenlong Li, Kaoru Hirota + Show 2 more

Open Access

https://doi.org/10.20965/jaciii.2021.p0130

Copy DOI

Abstract

An improved fully convolutional network based on post-processing with global variance (GV) equalization and noise-aware training (PN-FCN) for speech enhancement model is proposed. It aims at reducing the complexity of the speech improvement system, and it solves overly smooth speech signal spectrogram problem and poor generalization capability. The PN-FCN is fed with the noisy speech samples augmented with an estimate of the noise. In this way, the PN-FCN uses additional online noise information to better predict the clean speech. Besides, PN-FCN uses the global variance information, which improve the subjective score in a voice conversion task. Finally, the proposed framework adopts FCN, and the number of parameters is one-seventh of deep neural network (DNN). Results of experiments on the Valentini-Botinhaos dataset demonstrate that the proposed framework achieves improvements in both denoising effect and model training speed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Advanced Computational Intelligence and Intelligent Informatics	Publication Date: Jan 20, 2021
Citations: 1	License type: cc-by-nd

R Discovery Prime

R Discovery Prime

An Improved Fully Convolutional Network Based on Post-Processing with Global Variance Equalization and Noise-Aware Training for Speech Enhancement

Abstract

Talk to us

Similar Papers

More From: Journal of Advanced Computational Intelligence and Intelligent Informatics

Lead the way for us

Similar Papers

Dynamic noise aware training for speech enhancement based on deep neural networks
Yong Xu ... Jun Du
-
Yong Xu, et. al.Yong Xu ... Jun Du
14 Sep 2014
14 Sep 2014

An Analysis of Noise-aware Features in Combination with the Size and Diversity of Training Data for DNN-based Speech Enhancement
Robert Rehr ... Timo Gerkmann
-
Robert Rehr, et. al.Robert Rehr ... Timo Gerkmann
01 May 2019
01 May 2019

Two-stage noise aware training using asymmetric deep denoising autoencoder
Kang Hyun Lee ... Shin Jae Kang
-
Kang Hyun Lee, et. al.Kang Hyun Lee ... Shin Jae Kang
01 Mar 2016
01 Mar 2016

Trajectory training considering global variance for speech synthesis based on neural networks
Kei Hashimoto ... Yoshihiko Nankaku
-
Kei Hashimoto, et. al.Kei Hashimoto ... Yoshihiko Nankaku
01 Mar 2016
01 Mar 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Improved Fully Convolutional Network Based on Post-Processing with Global Variance Equalization and Noise-Aware Training for Speech Enhancement

Abstract

Talk to us

Similar Papers

More From: Journal of Advanced Computational Intelligence and Intelligent Informatics