Abstract

We investigate single-channel speech enhancement using the double spectrum (DS) consisting of pitch-synchronous and modulation transforms. We first explore the fundamentals of the proposed DS domain and its advantageous properties for pitch estimation and speech presence probability estimation. We then propose speech enhancement methods based on adaptive weighting and Wiener filtering in the DS domain. We demonstrate the effectiveness of the proposed DS-based methods compared to the conventional benchmarks in the modulation or short-time Fourier transform domains. Our results show a good tradeoff between improved perceived quality and slight degradation in speech intelligibility is achieved by the proposed method across different signal-to-noise ratios and noise types.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call