Fusing Bone-conduction and Air-conduction Sensors for Complex-Domain Speech Enhancement.

Heming Wang,Xueliang Zhang,Deliang Wang

doi:10.1109/taslp.2022.3209943

Abstract

Speech enhancement aims to improve the listening quality and intelligibility of noisy speech in adverse environments. It proves to be challenging to perform speech enhancement in very low signal-to-noise ratio (SNR) conditions. Conventional speech enhancement utilizes air-conduction (AC) microphones, which are sensitive to background noise but capable of capturing full-band signals. On the other hand, bone-conduction (BC) sensors are unaffected by acoustic noise, but recorded speech has limited bandwidth. This study proposes an attention-based fusion method to combine the strengths of AC and BC signals and perform complex spectral mapping for speech enhancement. Experiments on the EMSB dataset demonstrate that the proposed approach effectively leverages the advantages of AC and BC sensors, and outperforms a recent time-domain baseline in all conditions. We also show that the sensor fusion method is superior to single-sensor counterparts, especially in low SNR conditions. As the amount of BC data is very limited, we additionally propose a semi-supervised technique to utilize both parallelly and unparallely recorded AC and BC speech signals. With additional AC speech from the AISHELL-1 dataset, we achieve similar performance to supervised learning with only 50% parallel data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fusing Bone-conduction and Air-conduction Sensors for Complex-Domain Speech Enhancement.

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM transactions on audio, speech, and language processing

Lead the way for us

Journal: IEEE/ACM transactions on audio, speech, and language processing	Publication Date: Jan 1, 2022
Citations: 10

Similar Papers

Using the Callsign Acquisition Test (CAT) to compare the speech intelligibility of air versus bone conduction
M Gripper ... X Jiang
International Journal of Industrial Ergonomics | VOL. 37
M Gripper, et. al.M Gripper ... X Jiang
25 May 2007
International Journal of Industrial Ergonomics | VOL. 37

Multi-modal speech enhancement with bone-conducted speech in time domain
Mou Wang ... Susanto Rahardja
Applied Acoustics | VOL. 200
Mou Wang, et. al.Mou Wang ... Susanto Rahardja
13 Oct 2022
Applied Acoustics | VOL. 200

Presentation method as air- and bone-conducted speech for delayed auditory feedback
Teruki Toya ... Masashi Unoki
The Journal of the Acoustical Society of America | VOL. 141
Teruki Toya, et. al.Teruki Toya ... Masashi Unoki
01 May 2017
The Journal of the Acoustical Society of America | VOL. 141

Kalman Filtering with Machine Learning Methods for Speech Enhancement

-

04 May 2021
04 May 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fusing Bone-conduction and Air-conduction Sensors for Complex-Domain Speech Enhancement.

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM transactions on audio, speech, and language processing