Abstract
This paper proposes an algorithm that adopts the harmonic regeneration as post-processing to improve the performance of speech enhancement using traditional Short Time Spectral Amplitude (STSA). The proposed algorithm aims to alleviate the distortion of the high harmonics of enhanced speech via the traditional STSA, and consequently improves the speech quality. We first detect the pitch, or fundamental frequency, of the enhanced speech via the traditional STSA, and then, divide the whole spectrum into multiple sub-bands which center on each harmonic. After that, a series of specially designed windows centered on each harmonic are applied to all the sub-bands, in order to redistribute the energy in the sub-bands. The results of experiment demonstrate that the method has both theoretical and practical basis.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.