Bone-conducted Speech Research Articles

Bone-conduction (BC) headphones enable listeners to hear sounds through BC while leaving the ear canal (EC) open to pass surrounding air-conducted (AC) sound at the same time. However, BC transmission degrades the word intelligibility of BC speech, especially in noisy environments. This article examines the attenuation of higher-frequency components during BC transmission as a key characteristic that degrades speech intelligibility and proposes two methods for improving the intelligibility of BC speech by means of BC headphones. We focused on two types of BC transfer function, namely vibration of the regio temporalis (RT) and sound radiated in the EC relative to AC speech sound, and developed two methods for each (first order high-frequency emphasis (FOE) and higher-order high-frequency emphasis (HOE)) for improving the intelligibility of BC speech by compensating for the BC transmission characteristics. Speech signals in the proposed methods are filtered using four types of emphasis filter (RT-FOE, EC-FOE, RT-HOE, and EC-HOE). The results of word-intelligibility tests demonstrated that the word recognition rate, especially with FOE by the inverse transfer function between RT vibration and AC speech (RT-FOE), tended to significantly increase compared to that with no emphasis. Our findings suggested that linear compensation for the attenuation of higher frequency due to BC transmission by using the inverse transfer function between RT vibration and AC speech, is effective for improving BC speech intelligibility.

Read full abstract

Bone-conducted microphone (BCM) senses vibrations from bones in the skull during speech to electrical audio signal. When transmitting speech signals, bone-conduction microphones (BCMs) capture speech signals based on the vibrations of the speaker's skull and have better noise-resistance capabilities than standard air-conduction microphones (ACMs). BCMs have a different frequency response than ACMs because they only capture the low-frequency portion of speech signals. When we replace an ACM with a BCM, we may get satisfactory noise suppression results, but the speech quality and intelligibility may suffer due to the nature of the solid vibration. Mismatched BCM and ACM characteristics can also have an impact on ASR performance, and it is impossible to recreate a new ASR system using voice data from BCMs. The speech intelligibility of a BCM-conducted speech signal is determined by the location of the bone used to acquire the signal and accurately model phonemes of words. Deep learning techniques such as neural network have traditionally been used for speech recognition. However, neural networks have a high computational cost and are unable to model phonemes in signals. In this paper, the intelligibility of BCM signal speech was evaluated for different bone locations, namely the right ramus, larynx, and right mastoid. Listener and deep learning architectures such as CapsuleNet, UNet, and S-Net were used to acquire the BCM signal for Tamil words and evaluate speech intelligibility. As validated by the listener and deep learning architectures, the Larynx bone location improves speech intelligibility.

Read full abstract

Bone-conducted Speech Research Articles

Related Topics

Articles published on Bone-conducted Speech

Spectral analysis of bone-conducted speech using modified linear prediction

A lightweight speech enhancement network fusing bone- and air-conducted speech.

Online bone/air-conducted speech fusion in the presence of strong narrowband noise

Regularized Modified Covariance Method for Spectral Analysis of Bone-Conducted Speech

Restoration of Bone-Conducted Speech With U-Net-Like Model and Energy Distance Loss

Packet Loss Concealment Estimating Residual Errors of Forward-Backward Linear Prediction for Bone-Conducted Speech

A Two-Stage Approach to Quality Restoration of Bone-Conducted Speech

Speech Signal Extraction Method Based on Bayesian Estimation Using Air- and Bone-Conduction Sound in Speech Confusion

Deep-Learning-Based Speech Emotion Recognition Using Synthetic Bone-Conducted Speech

Packet Loss Compensation for VoIP through Bone‐Conducted Speech Using Modified Linear Prediction

Comparison of objective and subjective methods for evaluating speech quality and intelligibility recorded through bone conduction and in-ear microphones

Methods for improving word intelligibility of bone-conducted speech by using bone-conduction headphones

Speaker-Independent Spectral Enhancement for Bone-Conducted Speech

End-to-End Multi-Modal Speech Recognition on an Air and Bone Conducted Speech Corpus

Multi-modal speech enhancement with bone-conducted speech in time domain

Regional Language Speech Recognition from Bone-Conducted Speech Signals through Different Deep Learning Architectures.

Magnetically Levitated Flexible Vibration Sensors with Surficial Micropyramid Arrays for Magnetism Enhancement.

Air‐Conducted and Bone‐Conducted Speeches Combination for Noise‐Robust Pitch Extraction

1Pb5-7 Intelligibility of bone-conducted speech detected on the scalp

Fusing Bone-conduction and Air-conduction Sensors for Complex-Domain Speech Enhancement.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Bone-conducted Speech Research Articles

Related Topics

Articles published on Bone-conducted Speech

Spectral analysis of bone-conducted speech using modified linear prediction

A lightweight speech enhancement network fusing bone- and air-conducted speech.

Online bone/air-conducted speech fusion in the presence of strong narrowband noise

Regularized Modified Covariance Method for Spectral Analysis of Bone-Conducted Speech

Restoration of Bone-Conducted Speech With U-Net-Like Model and Energy Distance Loss

Packet Loss Concealment Estimating Residual Errors of Forward-Backward Linear Prediction for Bone-Conducted Speech

A Two-Stage Approach to Quality Restoration of Bone-Conducted Speech

Speech Signal Extraction Method Based on Bayesian Estimation Using Air- and Bone-Conduction Sound in Speech Confusion

Deep-Learning-Based Speech Emotion Recognition Using Synthetic Bone-Conducted Speech

Packet Loss Compensation for VoIP through Bone‐Conducted Speech Using Modified Linear Prediction

Comparison of objective and subjective methods for evaluating speech quality and intelligibility recorded through bone conduction and in-ear microphones

Methods for improving word intelligibility of bone-conducted speech by using bone-conduction headphones

Speaker-Independent Spectral Enhancement for Bone-Conducted Speech

End-to-End Multi-Modal Speech Recognition on an Air and Bone Conducted Speech Corpus

Multi-modal speech enhancement with bone-conducted speech in time domain

Regional Language Speech Recognition from Bone-Conducted Speech Signals through Different Deep Learning Architectures.

Magnetically Levitated Flexible Vibration Sensors with Surficial Micropyramid Arrays for Magnetism Enhancement.

Air‐Conducted and Bone‐Conducted Speeches Combination for Noise‐Robust Pitch Extraction

1Pb5-7 Intelligibility of bone-conducted speech detected on the scalp

Fusing Bone-conduction and Air-conduction Sensors for Complex-Domain Speech Enhancement.