Speech preprocessing and enhancement based on joint time domain and time-frequency domain analysis.

Wenbo Zhang,Xuefeng Xie,Yanling Du,Dongmei Huang

doi:10.1121/10.0026219

Abstract

Speech enhancement aims to make noisy speech signals clearer. Traditional time-frequency domain methods struggle to differentiate between speech and noise, leading to a risk of speech distortion. This paper introduces an approach that combines the time domain and time-frequency domain using the W-net module to suppress noise at the front end. The module is an improved version of Wave-U-Net, called TTF-W-Net. We conducted experiments using the TIMIT speech and NOISEX-92 noise datasets to evaluate the enhancement performance achieved by integrating preprocessing networks, specifically Wave-U-Net and our TTF-W-Net, into the baseline methods: Phase, FullSubNet+, and DB-AIAT. Experimental results show that TTF-W-Net outperforms the baseline Wave-U-Net by 15.7% on the PESQ metric and the effect of the network by using our preprocessing method is improved. Consequently, the TTF-W-Net preprocessing Net offers effective speech enhancement.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech preprocessing and enhancement based on joint time domain and time-frequency domain analysis.

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Journal: The Journal of the Acoustical Society of America	Publication Date: Jun 1, 2024
Citations: 1

Similar Papers

Ultrasonic test of resistance spot welds based on wavelet package analysis
Jing Liu ... Guanghao Zhou
Ultrasonics | VOL. 56
Jing Liu, et. al.Jing Liu ... Guanghao Zhou
20 Oct 2014
Ultrasonics | VOL. 56

Summary and Recommendations for Safe Mooring System Design in ULS and ALS
Siril Okkenhaug ... Torfinn Hørte
-
Siril Okkenhaug, et. al.Siril Okkenhaug ... Torfinn Hørte
25 Jun 2017
25 Jun 2017

Joint Time-Frequency and Time Domain Learning for Speech Enhancement
Chuanxin Tang ... Wenjun Zeng
-
Chuanxin Tang, et. al.Chuanxin Tang ... Wenjun Zeng
01 Jul 2020
01 Jul 2020

Time domain and frequency domain analysis of functionally graded piezoelectric harvesters subjected to random vibration: Finite element modeling
Y Amini ... H Parandvar
Composite Structures | VOL. 136
Y Amini, et. al.Y Amini ... H Parandvar
11 Nov 2015
Composite Structures | VOL. 136

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech preprocessing and enhancement based on joint time domain and time-frequency domain analysis.

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America