Deep beamforming for speech enhancement and speaker localization with an array response-aware loss function

Hsinyu Chang,Mingsian R Bai,Yicheng Hsu

doi:10.3389/frsip.2024.1413983

Abstract

Recent research advances in deep neural network (DNN)-based beamformers have shown great promise for speech enhancement under adverse acoustic conditions. Different network architectures and input features have been explored in estimating beamforming weights. In this paper, we propose a deep beamformer based on an efficient convolutional recurrent network (CRN) trained with a novel ARray RespOnse-aWare (ARROW) loss function. The ARROW loss exploits the array responses of the target and interferer by using the ground truth relative transfer functions (RTFs). The DNN-based beamforming system, trained with ARROW loss through supervised learning, is able to perform speech enhancement and speaker localization jointly. Experimental results have shown that the proposed deep beamformer, trained with the linearly weighted scale-invariant source-to-noise ratio (SI-SNR) and ARROW loss functions, achieves superior performance in speech enhancement and speaker localization compared to two baselines.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep beamforming for speech enhancement and speaker localization with an array response-aware loss function

Abstract

Talk to us

Similar Papers

More From: Frontiers in Signal Processing

Lead the way for us

Journal: Frontiers in Signal Processing	Publication Date: Sep 10, 2024
License type: CC BY 4.0

Similar Papers

Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses
Shengkui Zhao ... Bin Ma
-
Shengkui Zhao, et. al.Shengkui Zhao ... Bin Ma
06 Jun 2021
06 Jun 2021

Speech Enhancement via Mask-Mapping Based Residual Dense Network
Lin Zhou ... Qiuyue Zhong
Computers, Materials & Continua | VOL. 74
Lin Zhou, et. al.Lin Zhou ... Qiuyue Zhong
01 Jan 2023
Computers, Materials & Continua | VOL. 74

Speech Enhancement Using Convolutional Recurrent Neural Network with Twin Gate Units and Two-Stage Modeling
Baosheng Lv ... Yongbao Ma
-
Baosheng Lv, et. al.Baosheng Lv ... Yongbao Ma
09 Dec 2022
09 Dec 2022

Research on Speech Enhancement Algorithm of Multiresolution Cochleagram Based on Skip Connection Deep Neural Network
Chaofeng Lan ... Xiaojia Lin
Journal of Sensors | VOL. 2022
Chaofeng Lan, et. al.Chaofeng Lan ... Xiaojia Lin
09 May 2022
Journal of Sensors | VOL. 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep beamforming for speech enhancement and speaker localization with an array response-aware loss function

Abstract

Talk to us

Similar Papers

More From: Frontiers in Signal Processing