Abstract
For robust speech recognition in real-world noisy environments, we present an algorithm to incorporate blind signal separation based on independent component analysis (ICA) and top-down attention processing. While ICA-based unmixing networks learn the inverse of mixing characteristics in frequency domain, their performance is limited by mismatches between the real-world mixing characteristics and assumptions of the ICA algorithm. The top-down process from a multiplayer Perceptron (MLP) classifier provides additional information on the speech signal, and fine-tunes the networks to compensate for the mismatches. For noisy speech signals recorded in a real office environment, the developed algorithm demonstrated great improvements on recognition performance.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.