Abstract

Recently, progressive learning (PL) technology has become a hot spot in the single-channel speech enhancement field. Nevertheless, the existing PL-based methods only focus on SNR variations, which may lead to noise overestimation and speech distortion. To this end, we propose a hybrid method for single-channel speech enhancement leveraging an improved progressive deep neural network (IPDNN) and a novel masking-based harmonic regeneration (MHR). First, to make a tradeoff between noise reduction and weak-energy speech distortion, we design the IPDNN architecture by guiding each hidden layer to explicitly learn an improved progressive ratio mask (IPRM) as a target with a specific weak-unvoiced component improvement and SNR gain. Then, to further compensate for the first-level enhancement results from IPDNN and obtain refined results with more harmonic components, the MHR is proposed, in which the enhanced speech is reconstructed by merging the estimated IPRMs into the conventional harmonic regeneration procedure. Finally, compared with several reference methods, our experimental results show that the proposed method can consistently improve the perceived speech quality and intelligibility for all noise types and SNR levels.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call