Towards Improved Room Impulse Response Estimation for Speech Recognition

Anton Ratnarajah,Dinesh Manocha,Ishwarya Ananthabhotla,Paul Calamia,Pablo Hoffmann,Vamsi Krishna Ithapu

doi:10.1109/icassp49357.2023.10094770

Towards Improved Room Impulse Response Estimation for Speech Recognition

Anton Ratnarajah, Dinesh Manocha + Show 4 more

Open Access

https://doi.org/10.1109/icassp49357.2023.10094770

Copy DOI

Publication Date: Jun 4, 2023

Citations: 8

Affiliation: University of Maryland, College Park, META Health

#Room Impulse Response Estimation #Room Impulse Response + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We propose a novel approach for blind room impulse response (RIR) estimation systems in the context of a downstream application scenario, far-field automatic speech recognition (ASR). We first draw the connection between improved RIR estimation and improved ASR performance, as a means of evaluating neural RIR estimators. We then propose a generative adversarial network (GAN) based architecture that encodes RIR features from reverberant speech and constructs an RIR from the encoded features, and uses a novel energy decay relief loss to optimize for capturing energy-based properties of the input reverberant speech. We show that our model outperforms the state-of-the-art baselines on acoustic benchmarks (by 17% on the energy decay relief and 22% on an early-reflection energy metric), as well as in an ASR evaluation task (by 6.9% in word error rate).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.