Convolution with spatial Room Impulse Responses can achieve realistic auralizations. When combined with interpolation between spatially distributed RIRs, this technique can be used to create navigable virtual environments. This study explores the impact of various interpolation parameters on the perceived auditory stability of a nearby static sound source with listener movements in a reverberant environment. The auditory scene was rendered via third-order Ambisonic RIR convolution combined with magnitude-least-squares binaural decoding using nonindividualized head-related transfer functions. First, the estimated direction of arrival as a function of the listener’s position within a 2D grid of RIRs under various configurations is examined as an objective metric. The perceived stability of the auditory source is then assessed through a perceptual experiment. Participants freely explored a virtual scene reproduced over headphones and a tracked head-mounted display. They were asked to rate the stability of a nonvisual source under various conditions of RIR grid density, interpolation panning method, and reverberation time. Results indicate no need to use an RIR grid size finer than 1m to optimize source stability when using a three-nearest-neighbor interpolation scheme.
Read full abstract