Cross-Domain Speech Enhancement with a Neural Cascade Architecture

Heming Wang,Deliang Wang

doi:10.1109/icassp43922.2022.9747752

Cross-Domain Speech Enhancement with a Neural Cascade Architecture

Heming Wang, Deliang Wang

https://doi.org/10.1109/icassp43922.2022.9747752

Copy DOI

Export

Save

Cite

Publication Date: May 23, 2022

Citations: 2

Affiliation: The Ohio State University

#Complex Spectrogram #Cascade Architecture #Spectral Magnitude #Speech Representation #Noisy Speech #Speech Enhancement #Speech Quality #Neural Architecture #Spectral Waveform #Domains Of Representation

Abstract
Full-Text
Similar Papers

Abstract

Listen

This paper proposes a novel cascade architecture to address the monaural speech enhancement problem. We leverage three different domains of speech representation, namely spectral magnitude, waveform, and complex spectrogram, to progressively suppress the background noise within noisy speech. Our proposed neural cascade architecture consists of three modules, and each operates on the original noisy input and the output of the previous module in a distinct speech representation. During training, the network simultaneously optimizes all modules with a triple-domain loss. Experiments on the WSJ0 SI-84 corpus demonstrate that our proposed approach achieves superior enhancement results, and substantially outperforms previous baselines in terms of both speech quality and intelligibility.

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

Cross-Domain Speech Enhancement with a Neural Cascade Architecture

Abstract

Published Version

Talk to us

Similar Papers

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Cross-Domain Speech Enhancement with a Neural Cascade Architecture

Abstract

Published Version

Talk to us

Similar Papers