Abstract

A new speech enhancement algorithm using a Hilbert Transform (HT) based Time-Frequency (TF) representation of speech signal with respect to human perception is proposed. TF representation is carried out by the means of analytic decomposition of speech signal in the hearing Critical Bands (CB) where the envelope and phase components of the analytic signals are used. For the purpose of enhancement the envelope in each CB is modified, based on the conventional spectral subtraction method, using a time varying gain function which takes into account the threshold of hearing. This threshold is calculated on the basis of masking effects of all bands using a perception model. Signal is reconstructed from the modified envelopes and the original phases of noisy signal in critical bands. Experimental results show that using the threshold of hearing in which temporal masking is included can effectively eliminate the musical noise without a significant decrease in intelligibility.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.