Abstract
Traditional speech enhancement algorithms are only suitable for dealing with stationary noise, but the noise in the stage of flight is nonstationary noise, so the traditional method is not suitable for dealing with the noise in the stage of flight. This paper proposes a speech enhancement algorithm based on a generative adversarial network: Deep Convolutional–Wasserstein Generative Adversarial Network (DWGAN). Firstly, the model integrates the deep convolutional generative adversarial network and the Wasserstein distance based on the generative adversarial network. Secondly, it introduces a conditional model to improve the enhanced speech quality, and the spectral constraint layer is used to prevent the model from falling too fast and causing collapse. Finally, the L1 loss term is introduced into the loss function to reduce the number of training times and further improve the enhanced speech quality. The experimental results show that the intrusiveness of background noise and overall processed speech quality of DWGAN are improved by about 7.6 and 9.4%, respectively, compared with WGAN in the acoustic environment of simulated aircraft operation.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.