Abstract

Recently, progressive learning (PL) technology has become a hot spot in the single-channel speech enhancement field. Nevertheless, the existing PL-based methods only focus on SNR variations, which may lead to noise overestimation and speech distortion. To this end, we propose a hybrid method for single-channel speech enhancement leveraging an improved progressive deep neural network (IPDNN) and a novel masking-based harmonic regeneration (MHR). First, to make a tradeoff between noise reduction and weak-energy speech distortion, we design the IPDNN architecture by guiding each hidden layer to explicitly learn an improved progressive ratio mask (IPRM) as a target with a specific weak-unvoiced component improvement and SNR gain. Then, to further compensate for the first-level enhancement results from IPDNN and obtain refined results with more harmonic components, the MHR is proposed, in which the enhanced speech is reconstructed by merging the estimated IPRMs into the conventional harmonic regeneration procedure. Finally, compared with several reference methods, our experimental results show that the proposed method can consistently improve the perceived speech quality and intelligibility for all noise types and SNR levels.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.