Deep Learning Based Speech Beamforming

Kaizhi Qian,Xuesong Yang,Dinei Florencio,Shiyu Chang,Yang Zhang,Mark Hasegawa-Johnson

doi:10.1109/icassp.2018.8462430

Abstract

Multi-channel speech enhancement with ad-hoc sensors has been a challenging task. Speech model guided beamforming algorithms are able to recover natural sounding speech, but the speech models tend to be oversimplified or the inference would otherwise be too complicated. On the other hand, deep learning based enhancement approaches are able to learn complicated speech distributions and perform efficient inference, but they are unable to deal with variable number of input channels. Also, deep learning approaches introduce a lot of errors, particularly in the presence of unseen noise types and settings. We have therefore proposed an enhancement framework called DEEPBEAM, which combines the two complementary classes of algorithms. DEEPBEAM introduces a beamforming filter to produce natural sounding speech, but the filter coefficients are determined with the help of a monaural speech enhancement neural network. Experiments on synthetic and real-world data show that DEEPBEAM is able to produce clean, dry and natural sounding speech, and is robust against unseen noise.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Learning Based Speech Beamforming

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Speech Signal Processing

-

25 Sep 2000
25 Sep 2000

The end-product of behavioural stuttering therapy: three decades of denaturing the disorder
T Saltuklaroglu ... J Kalinowski
Disability and Rehabilitation | VOL. 24
T Saltuklaroglu, et. al.T Saltuklaroglu ... J Kalinowski
01 Jan 2002
Disability and Rehabilitation | VOL. 24

Robust Speaker Recognition Based on Single-Channel and Multi-Channel Speech Enhancement
Hassan Taherian ... Deliang Wang
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 28
Hassan Taherian, et. al.Hassan Taherian ... Deliang Wang
01 Jan 2020
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 28

CNN-based noise reduction for multi-channel speech enhancement system with discrete wavelet transform (DWT) preprocessing.
Pavani Cherukuru ... Mumtaz Begum Mustafa
PeerJ. Computer science | VOL. 10
Pavani Cherukuru, et. al.Pavani Cherukuru ... Mumtaz Begum Mustafa
28 Feb 2024
PeerJ. Computer science | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Learning Based Speech Beamforming

Abstract

Talk to us

Similar Papers