Fast-Rir: Fast Neural Diffuse Room Impulse Response Generator

Anton Ratnarajah,Zhenyu Tang,Dinesh Manocha,Dong Yu,Shi-Xiong Zhang,Meng Yu

doi:10.1109/icassp43922.2022.9747846

Anton Ratnarajah, Zhenyu Tang + Show 4 more

Open Access

https://doi.org/10.1109/icassp43922.2022.9747846

Copy DOI

Abstract

We present a neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment. Our FAST-RIR takes rectangular room dimensions, listener and speaker positions, and reverberation time (T <inf xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">60</inf> ) as inputs and generates specular and diffuse reflections for a given acoustic environment. Our FAST-RIR is capable of generating RIRs for a given input T <inf xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">60</inf> with an average error of 0.02s. We evaluate our generated RIRs in automatic speech recognition (ASR) applications using Google Speech API, Microsoft Speech API, and Kaldi tools. We show that our proposed FAST-RIR with batch size 1 is 400 times faster than a state-of-the-art diffuse acoustic simulator (DAS) on a CPU and gives similar performance to DAS in ASR experiments. Our FAST-RIR is 12 times faster than an existing GPU-based RIR generator (gpuRIR). We show that our FAST-RIR outperforms gpuRIR by 2.5% in an AMI far-field ASR benchmark.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fast-Rir: Fast Neural Diffuse Room Impulse Response Generator

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Application of automatic speech recognition (ASR) techniques for automatic speech assessment in people with aphasia
Ying Qin ... Tan Lee
Frontiers in Human Neuroscience | VOL. 12
Ying Qin, et. al.Ying Qin ... Tan Lee
01 Jan 2018
Frontiers in Human Neuroscience | VOL. 12

Gammatone sub-band magnitude-domain dereverberation for ASR
Kshitiz Kumar ... Richard Stern
-
Kshitiz Kumar, et. al.Kshitiz Kumar ... Richard Stern
01 May 2011
01 May 2011

Applications of automatic speech recognition and text-to-speech technologies for hearing assessment: a scoping review
Mohsen Fatehifar ... Kevin J Munro
International Journal of Audiology | VOL. ahead-of-print
Mohsen Fatehifar, et. al.Mohsen Fatehifar ... Kevin J Munro
12 Nov 2024
International Journal of Audiology | VOL. ahead-of-print

SR-NBS: A fast sparse representation based N-best class selector for robust phoneme classification
Armin Saeb ... Massoud Babaie-Zadeh
Engineering Applications of Artificial Intelligence | VOL. 28
Armin Saeb, et. al.Armin Saeb ... Massoud Babaie-Zadeh
27 Dec 2013
Engineering Applications of Artificial Intelligence | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fast-Rir: Fast Neural Diffuse Room Impulse Response Generator

Abstract

Talk to us

Similar Papers