Head-related Impulse Responses Research Articles

Bilateral cochlear implants (CIs) greatly improve spatial hearing acuity for CI users, but substantial gaps still exist compared to normal-hearing listeners. For example, CI users have poorer localization skills, little or no binaural unmasking, and reduced spatial release from masking. Multiple factors have been identified that limit binaural hearing with CIs. These include degradation of cues due to the various sound processing stages, the viability of the electrode-neuron interface, impaired brainstem neurons, and deterioration in connectivity between different cortical layers. To help quantify the relative importance and inter-relationship between these factors, computer models can and arguably should be employed. While models exploring single stages are often in good agreement with selected experimental data, their combination often does not yield a comprehensive and accurate simulation of perception. Here, we combine information from CI sound processing with computational auditory model stages in a modular and open-source framework, resembling an artificial bilateral CI user. The main stages are (a) binaural signal generation with optional head-related impulse response filtering, (b) generic CI sound processing not restricted to a specific manufacturer, (c) electrode-to-neuron transmission, (d) binaural interaction, and (e) a decision model. The function and the outputs of different model stages are demonstrated with examples of localization experiments. However, the model framework is not tailored to a specific dataset. It offers a selection of sound coding strategies and allows for third-party model extensions or substitutions; thus, it is possible to employ the model for a wide range of binaural applications and even for educational purposes.

Read full abstract

Spatial audio has attracted more and more attention in the fields of virtual reality (VR), blind navigation and so on. The individualized head-related transfer functions (HRTFs) play an important role in generating spatial audio with accurate localization perception. Existing methods only focus on one database, and do not fully utilize the information from multiple databases. In light of this, a pre-trained-based individualization model is proposed to predict HRTFs for any target user in this paper, and a real-time spatial audio rendering system built on a wearable device is implemented to produce an immersive virtual auditory display. The proposed method first builds a pre-trained model based on multiple databases using a DNN-based model combined with an autoencoder-based dimensional reduction method. This model can capture the nonlinear relationship between user-independent HRTFs and position-dependent features. Then, fine tuning is done using a transfer learning technique at a limit number of layers based on the pre-trained model. The key idea behind fine tuning is to transfer the pre-trained user-independent model to the user-dependent one based on anthropometric features. Finally, real-time issues are discussed to guarantee a fluent auditory experience during dynamic scene update, including fine-grained head-related impulse response (HRIR) acquisition, efficient spatial audio reproduction, and parallel synthesis and playback. These techniques ensure that the system is implemented with little computational cost, thus minimizing processing delay. The experimental results show that the proposed model outperforms other methods in terms of subjective and objective metrics. Additionally, our rendering system runs on HTC Vive, with almost unnoticeable delay.

Read full abstract

Head-related Impulse Responses Research Articles

Related Topics

Articles published on Head-related Impulse Responses

DNN-based HRTF individualization for accurate spectral cues using a compact PRTF

Comparing subjective similarity ratings and quantitative errors for the evaluation of free-field binaural panning techniques

Auditory Spatial Bisection of Blind and Normally Sighted Individuals in Free Field and Virtual Acoustics.

Effects of visual stimuli on auditory separation of sound images spatially split by synthesized binaural signal

Free-field perceptual evaluation of virtual acoustic rendering algorithms using two head-related impulse response delay treatment strategies

The influence of helmets on sound localisation in motorcyclists

A model framework for simulating spatial hearing of bilateral cochlear implant users

An Algorithm for Generating Virtual Sources in Dynamic Virtual Auditory Display Based on Tensor Decomposition of Head-Related Impulse Responses

The impact of head-related impulse response delay treatment strategy on psychoacoustic cue reconstruction errors from virtual loudspeaker arrays.

Auditory model-based estimation of the effect of head-worn devices on frontal horizontal localisation

The Presence of a Floor Improves Subjective Elevation Accuracy of Binaural Stimuli Created With Non-Individualized Head-Related Impulse Responses

Improved accuracy and computational efficiency in virtual acoustic rendering using principal components-based amplitude panning

Comparing the differences in robustness between interaural time delay calculation methods

Acoustic characteristics of a miniature dynamic speaker driver unit MT006B for measurement of head-related impulse responses by reciprocal method

Perceptual implications of different Ambisonics-based methods for binaural reverberation.

Toward realistic binaural auralizations – perceptual comparison between measurement and simulation-based auralizations and the real room for a classroom scenario

Pre-Trained-Based Individualization Model for Real-Time Spatial Audio Rendering System

Speech Intelligibility and Spatial Release From Masking Improvements Using Spatial Noise Reduction Algorithms in Bimodal Cochlear Implant Users.

Speech Enhancement Based on Modulation-Domain Parametric Multichannel Kalman Filtering

Investigating the perceptual accuracy of machine-learning generated personalized head-eelated impulse responses

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Head-related Impulse Responses Research Articles

Related Topics

Articles published on Head-related Impulse Responses

DNN-based HRTF individualization for accurate spectral cues using a compact PRTF

Comparing subjective similarity ratings and quantitative errors for the evaluation of free-field binaural panning techniques

Auditory Spatial Bisection of Blind and Normally Sighted Individuals in Free Field and Virtual Acoustics.

Effects of visual stimuli on auditory separation of sound images spatially split by synthesized binaural signal

Free-field perceptual evaluation of virtual acoustic rendering algorithms using two head-related impulse response delay treatment strategies

The influence of helmets on sound localisation in motorcyclists

A model framework for simulating spatial hearing of bilateral cochlear implant users

An Algorithm for Generating Virtual Sources in Dynamic Virtual Auditory Display Based on Tensor Decomposition of Head-Related Impulse Responses

The impact of head-related impulse response delay treatment strategy on psychoacoustic cue reconstruction errors from virtual loudspeaker arrays.

Auditory model-based estimation of the effect of head-worn devices on frontal horizontal localisation

The Presence of a Floor Improves Subjective Elevation Accuracy of Binaural Stimuli Created With Non-Individualized Head-Related Impulse Responses

Improved accuracy and computational efficiency in virtual acoustic rendering using principal components-based amplitude panning

Comparing the differences in robustness between interaural time delay calculation methods

Acoustic characteristics of a miniature dynamic speaker driver unit MT006B for measurement of head-related impulse responses by reciprocal method

Perceptual implications of different Ambisonics-based methods for binaural reverberation.

Toward realistic binaural auralizations – perceptual comparison between measurement and simulation-based auralizations and the real room for a classroom scenario

Pre-Trained-Based Individualization Model for Real-Time Spatial Audio Rendering System

Speech Intelligibility and Spatial Release From Masking Improvements Using Spatial Noise Reduction Algorithms in Bimodal Cochlear Implant Users.

Speech Enhancement Based on Modulation-Domain Parametric Multichannel Kalman Filtering

Investigating the perceptual accuracy of machine-learning generated personalized head-eelated impulse responses