Abstract
This paper presents a new method for speech enhancement based on a dictionary learning method. The proposed approach is based on using coherence measure in dictionary learning. Data required for better fitting to atoms in sparse representation of noise is provided by a noise estimation algorithm that causes noise dictionary to be trained with the same data size as speech signal. To decrease coherence between dictionaries after the training step, a new method is applied to yield incoherent dictionaries. In sparse representation of speech data, the highest energy atoms of noise dictionary are replaced with the lowest energy atoms, under certain conditions. A similar replacement can happen in sparse representation of noise data. Furthermore, in this paper, only one noise dictionary, chosen by a classification method, is used in speech enhancement step, resulting in a faster algorithm. Objective and subjective measures are used for evaluating the simulation results. According to experimental results, the proposed algorithm has been found superior in performance and computation overhead in comparison with the earlier methods in this context. Moreover, this method achieves significantly better results compared with baseline methods such as multi-band and geometric spectral subtraction.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.