Speech In Noisy Environments Research Articles

The goal of speech enhancement is to restore clean speech in noisy environments. Acoustic scenarios with low signal-to-noise ratios (SNR) make it quite challenging to extract the target speech from its noise. In the current study, to enhance noisy speech, we propose a feature recalibration based multi-scale convolutional encoder-decoder architecture with squeeze temporal convolutional networks (S-TCN) bottleneck. Each multi-scale convolutional layer in encoder and decoder is followed by time-frequency attention module (TFA). The recalibration based multi-scale 2D convolution layers are used to extract local and contextual information. Additionally, the recalibration network is equipped with a gating mechanism to control the flow of information among the layers, enabling weighting of the scaled features for noise suppression and speech retention. The fully connected layer (FC) in the bottleneck part of encoder-decoder contains a few neurons, which capture the global information from the multi-scale 2D convolution layer and reduce parameters. A S-TCN, inspired by the popular temporal convolutional neural network (TCNN), is inserted between the encoder and the decoder to model long-term dependencies in speech. The TFA is a highly efficient network component, that operates through two simultaneous attentions, one focused on time frames, and the other on frequency channels. These attentions work together to explicitly exploit positional information to create a two-dimensional attention map to effectively capture the significant time-frequency distribution of speech. Utilizing the common voice dataset, our proposed model consistently enhances results compared to the current benchmarks, as demonstrated by two extensively utilized objective measures PESQ and STOI. The proposed model shows significant improvements, with average PESQ and STOI scores increasing by 45.7% and 23.8% respectively for seen background noises, and by 43.5% and 21.4% for unseen background noises, when compared to the quality of noisy speech. Tests validate that the proposed approach outperforms numerous cutting-edge algorithms.

Cochlear implants are among the most successful neural prosthetic devices to date but exhibit poor frequency selectivity and the inability to consistently activate apical (low frequency) spiral ganglion neurons. These issues can limit hearing performance in many cochlear implant patients, especially for understanding speech in noisy environments and in perceiving or appreciating more complex inputs such as music and multiple talkers. For cochlear implants, electrical current must pass through the bony wall of the cochlea, leading to widespread activation of auditory nerve fibers. Cochlear implants also cannot be implanted in some individuals with an obstruction or severe malformations of the cochlea. Alternatively, intraneural stimulation delivered via an auditory nerve implant could provide direct contact with neural fibers and thus reduce unwanted current spread. More confined current during stimulation can increase selectivity of frequency fiber activation. Furthermore, devices such as the Utah Slanted Electrode Array can provide access to the full cross section of the auditory nerve, including low frequency fibers that are difficult to reach using a cochlear implant. However, further scientific and preclinical research of these Utah Slanted Electrode Array devices is limited by the lack of a chronic large animal model for the auditory nerve implant, especially one that leverages an appropriate surgical approach relevant for human translation. This paper presents a newly developed transbullar translabyrinthine surgical approach for implanting the auditory nerve implant into the cat auditory nerve. In our first of a series of studies, we demonstrate a surgical approach in non-recovery experiments that enables implantation of the auditory nerve implant into the auditory nerve, without damaging the device and enabling effective activation of the auditory nerve fibers, as measured by electrode impedances and electrically evoked auditory brainstem responses. These positive results motivate performing future chronic cat studies to assess the long-term stability and function of these auditory nerve implant devices, as well as development of novel stimulation strategies that can be translated to human patients.

Speech In Noisy Environments Research Articles

Related Topics

Articles published on Speech In Noisy Environments

ESERNet: Learning spectrogram structure relationship for effective speech emotion recognition with swin transformer in classroom discourse analysis

Cochlear Synaptopathy Evaluation With Electrocochleography in Patients With Hearing Difficulty in Noise Despite Normal Hearing Levels.

Effects of Age on Responses of Principal Cells of the Mouse Anteroventral Cochlear Nucleus in Quiet and Noise.

Deep learning approach for automatic speech recognition in the presence of noise

A Fused Deep Denoising Sound Coding Strategy for Bilateral Cochlear Implants.

Using deep learning to improve the intelligibility of a target speaker in noisy multi-talker environments for people with normal hearing and hearing loss.

The Efficacy of Wireless Auditory Training in Unilateral Hearing Loss Rehabilitation.

Multi scale encoder-decoder network with Time Frequency Attention and S-TCN for single channel speech enhancement

Topography and Ensemble Activity in the Auditory Cortex of a Mouse Model of Fragile X Syndrome.

Improved tactile speech robustness to background noise with a dual-path recurrent neural network noise-reduction method

A Mixed-Rate Strategy on a Bilaterally-Synchronized Cochlear Implant Processor Offering the Opportunity to Provide Both Speech Understanding and Interaural Time Difference Cues.

Development of a feline model for preclinical research of a new translabyrinthine auditory nerve implant.

A Step Toward Precision Audiology: Individual Differences and Characteristic Profiles From Auditory Perceptual and Cognitive Abilities.

The electroacoustic performance of digital noise reduction systems in commercial hearing aids with Malay speech-plus-noise test signals

Leading and following: Noise differently affects semantic and acoustic processing during naturalistic speech comprehension

Improvements in naturalistic speech-in-noise comprehension in middle-aged and older adults after 3 weeks of computer-based speechreading training

Evaluation of the auditory findings of patients with obstructive sleep apnea syndrome

No Musician Advantage in the Perception of Degraded-Fundamental Frequency Speech in Noisy Environments.

Congenital Cytomegalovirus and Hearing Loss: The State of the Art.

Speech recognition in noise task among children and young-adults: a pupillometry study.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Speech In Noisy Environments Research Articles

Related Topics

Articles published on Speech In Noisy Environments

ESERNet: Learning spectrogram structure relationship for effective speech emotion recognition with swin transformer in classroom discourse analysis

Cochlear Synaptopathy Evaluation With Electrocochleography in Patients With Hearing Difficulty in Noise Despite Normal Hearing Levels.

Effects of Age on Responses of Principal Cells of the Mouse Anteroventral Cochlear Nucleus in Quiet and Noise.

Deep learning approach for automatic speech recognition in the presence of noise

A Fused Deep Denoising Sound Coding Strategy for Bilateral Cochlear Implants.

Using deep learning to improve the intelligibility of a target speaker in noisy multi-talker environments for people with normal hearing and hearing loss.

The Efficacy of Wireless Auditory Training in Unilateral Hearing Loss Rehabilitation.

Multi scale encoder-decoder network with Time Frequency Attention and S-TCN for single channel speech enhancement

Topography and Ensemble Activity in the Auditory Cortex of a Mouse Model of Fragile X Syndrome.

Improved tactile speech robustness to background noise with a dual-path recurrent neural network noise-reduction method

A Mixed-Rate Strategy on a Bilaterally-Synchronized Cochlear Implant Processor Offering the Opportunity to Provide Both Speech Understanding and Interaural Time Difference Cues.

Development of a feline model for preclinical research of a new translabyrinthine auditory nerve implant.

A Step Toward Precision Audiology: Individual Differences and Characteristic Profiles From Auditory Perceptual and Cognitive Abilities.

The electroacoustic performance of digital noise reduction systems in commercial hearing aids with Malay speech-plus-noise test signals

Leading and following: Noise differently affects semantic and acoustic processing during naturalistic speech comprehension

Improvements in naturalistic speech-in-noise comprehension in middle-aged and older adults after 3 weeks of computer-based speechreading training

Evaluation of the auditory findings of patients with obstructive sleep apnea syndrome

No Musician Advantage in the Perception of Degraded-Fundamental Frequency Speech in Noisy Environments.

Congenital Cytomegalovirus and Hearing Loss: The State of the Art.

Speech recognition in noise task among children and young-adults: a pupillometry study.