Binaural Enhancement Research Articles

Future wearable technology may provide for enhanced communication in noisy environments and for the ability to pick out a single talker of interest in a crowded room simply by the listener shifting their attentional focus. Such a system relies on two components, speaker separation and decoding the listener’s attention to acoustic streams in the environment. To address the former, we present a system for joint speaker separation and noise suppression, referred to as the Binaural Enhancement via Attention Masking Network (BEAMNET). The BEAMNET system is an end-to-end neural network architecture based on self-attention. Binaural input waveforms are mapped to a joint embedding space via a learned encoder, and separate multiplicative masking mechanisms are included for noise suppression and speaker separation. Pairs of output binaural waveforms are then synthesized using learned decoders, each capturing a separated speaker while maintaining spatial cues. A key contribution of BEAMNET is that the architecture contains a separation path, an enhancement path, and an autoencoder path. This paper proposes a novel loss function which simultaneously trains these paths, so that disabling the masking mechanisms during inference causes BEAMNET to reconstruct the input speech signals. This allows dynamic control of the level of suppression applied by BEAMNET via a minimum gain level, which is not possible in other state-of-the-art approaches to end-to-end speaker separation. This paper also proposes a perceptually-motivated waveform distance measure. Using objective speech quality metrics, the proposed system is demonstrated to perform well at separating two equal-energy talkers, even in high levels of background noise. Subjective testing shows an improvement in speech intelligibility across a range of noise levels, for signals with artificially added head-related transfer functions and background noise. Finally, when used as part of an auditory attention decoder (AAD) system using existing electroencephalogram (EEG) data, BEAMNET is found to maintain the decoding accuracy achieved with ideal speaker separation, even in severe acoustic conditions. These results suggest that this enhancement system is highly effective at decoding auditory attention in realistic noise environments, and could possibly lead to improved speech perception in a cognitively controlled hearing aid.

While auditory cortex in non-human primates has been subdivided into multiple functionally specialized auditory cortical fields (ACFs), the boundaries and functional specialization of human ACFs have not been defined. In the current study, we evaluated whether a widely accepted primate model of auditory cortex could explain regional tuning properties of fMRI activations on the cortical surface to attended and non-attended tones of different frequency, location, and intensity. The limits of auditory cortex were defined by voxels that showed significant activations to non-attended sounds. Three centrally located fields with mirror-symmetric tonotopic organization were identified and assigned to the three core fields of the primate model while surrounding activations were assigned to belt fields following procedures similar to those used in macaque fMRI studies. The functional properties of core, medial belt, and lateral belt field groups were then analyzed. Field groups were distinguished by tonotopic organization, frequency selectivity, intensity sensitivity, contralaterality, binaural enhancement, attentional modulation, and hemispheric asymmetry. In general, core fields showed greater sensitivity to sound properties than did belt fields, while belt fields showed greater attentional modulation than core fields. Significant distinctions in intensity sensitivity and contralaterality were seen between adjacent core fields A1 and R, while multiple differences in tuning properties were evident at boundaries between adjacent core and belt fields. The reliable differences in functional properties between fields and field groups suggest that the basic primate pattern of auditory cortex organization is preserved in humans. A comparison of the sizes of functionally defined ACFs in humans and macaques reveals a significant relative expansion in human lateral belt fields implicated in the processing of speech.

Binaural Enhancement Research Articles

Related Topics

Articles published on Binaural Enhancement

Multilingual non-intrusive binaural intelligibility prediction based on phone classification

Optimized Binaural Enhancement via attention masking network-based speech separation framework in digital hearing aids

Speaker separation in realistic noise environments with applications to a cognitively-controlled hearing aid

Evaluation of a method for enhancing interaural level differences at low frequencies.

Parameter-based binaural hearing aid algorithms to improve speech intelligibility and localization in complex environments.

Functional Properties of Human Auditory Cortical Fields

An Electrophysiological Measure of Binaural Hearing in Noise

Binaural enhancement of speech intelligibility metrics

The effect of broad-band noise on the binaural interaction components of human auditory brainstem-evoked potentials.

Stethoscope having pseudostereophonic binaural enhancement

Binaural and Monaural Speech Discrimination Under Reverberation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Binaural Enhancement Research Articles

Related Topics

Articles published on Binaural Enhancement

Multilingual non-intrusive binaural intelligibility prediction based on phone classification

Optimized Binaural Enhancement via attention masking network-based speech separation framework in digital hearing aids

Speaker separation in realistic noise environments with applications to a cognitively-controlled hearing aid

Evaluation of a method for enhancing interaural level differences at low frequencies.

Parameter-based binaural hearing aid algorithms to improve speech intelligibility and localization in complex environments.

Functional Properties of Human Auditory Cortical Fields

An Electrophysiological Measure of Binaural Hearing in Noise

Binaural enhancement of speech intelligibility metrics

The effect of broad-band noise on the binaural interaction components of human auditory brainstem-evoked potentials.

Stethoscope having pseudostereophonic binaural enhancement

Binaural and Monaural Speech Discrimination Under Reverberation